☁️ Healthcare Natural Language API - (PoC)

Project Overview

This project is a Proof of Concept (PoC) using Google Cloud's Healthcare Natural Language API. The goal is to test the API's ability to extract medical entities and relationships from unstructured clinical notes, leveraging pre-trained natural language models.

About Healthcare Natural Language API

The Healthcare Natural Language API is part of the Google Cloud Healthcare API. It uses natural language processing (NLP) models to extract healthcare-related information from medical text.

Key Features

The API can identify and extract:

🏥 Medical concepts such as medications, procedures, and health conditions.
📅 Functional attributes like temporal relationships, subjects, and certainty assessments.
🔗 Relationships between entities, such as side effects or drug dosages.

Core Functionality

In this tutorial, we will focus on the following function:

Entity Analysis: The analyzeEntities method inspects medical text to detect and return medical concepts and their relationships.

Prerequisites

Before running this PoC, ensure you have completed the following steps:

✅ Google Cloud account: You must have a Google Cloud account set up.
🌐 Enable APIs: Ensure the Cloud Healthcare API and Healthcare Natural Language API are enabled.
🛠️ Install Google Cloud CLI (gcloud): Download and install the Google Cloud CLI.
📄 Create a .env file: This file will store the necessary environment variables.

Create a `.env` file

To configure the environment variables, create a .env file in your project directory and add the following content:

PROJECT_ID = "project_name"
LOCATION = "location_name"
TOKEN = "token_value"

How to obtain the token value

Follow these steps to authenticate and get your access token:

Authenticate with Google Cloud by running the following command in your terminal:

gcloud auth login

Get the access token by running:

gcloud auth print-access-token

Copy the token value and paste it into the .env file under the TOKEN variable.

📊 Streamlit Dashboard

This project includes an interactive Streamlit dashboard for visualizing and analyzing the Healthcare API results.

Features

The dashboard provides three main sections:

1. Dashboard Tab

Compact metrics cards showing file info, total entities, unique types, and texts
Interactive confidence filtering to filter entities by subject confidence
Visual charts for entity type distribution, subject analysis, and temporal assessment

2. Entity Table Tab

Filterable data table with all entity details
Export functionality to download results as CSV
Advanced filtering by type, subject, and confidence levels

3. Note Tab

Original medical note with color-coded entity highlighting
Dynamic filtering to show/hide specific entity types
Interactive legend showing colors for each entity type
Real-time highlighting based on selected filters

Sample Data

The data/ folder includes:

note_es.txt - Sample Spanish medical note (fictitious)
entities_*.json - Example API response with extracted entities

Running the Streamlit App

Install dependencies:

cd streamlit
pip install -r requirements.txt

Run the application:

streamlit run app.py

Access the dashboard: Open your browser and navigate to http://localhost:8501

App Structure

streamlit/
├── app.py              # Main Streamlit application
├── google_api.py       # Google Healthcare API service
├── data_etl.py         # Data transformation functions
├── requirements.txt    # Python dependencies
└── README.md          # Setup instructions

Goals

Test entity extraction from clinical notes.
Evaluate how well the API identifies medical concepts and maps relationships.
Understand the output format and how to integrate the extracted data into a larger cloud architecture.
Visualize and analyze results through an interactive dashboard.

Next Steps

Set up Google Cloud Healthcare API.
Enable the Healthcare Natural Language API.
Implement entity analysis using the analyzeEntities method.
Analyze the results and review entity mapping.
Use the Streamlit dashboard to explore and visualize the extracted entities.

⚠️ Important Note: The medical note included in this repository is entirely fictitious. It was generated for a fictional healthcare professional and does not belong to any real patient. This data is solely for testing and demonstration purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
img		img
streamlit		streamlit
.gitignore		.gitignore
README.md		README.md
main.ipynb		main.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

☁️ Healthcare Natural Language API - (PoC)

Project Overview

About Healthcare Natural Language API

Key Features

Core Functionality

Prerequisites

Create a `.env` file

How to obtain the token value

📊 Streamlit Dashboard

Features

1. Dashboard Tab

2. Entity Table Tab

3. Note Tab

Sample Data

Running the Streamlit App

App Structure

Goals

Next Steps

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

☁️ Healthcare Natural Language API - (PoC)

Project Overview

About Healthcare Natural Language API

Key Features

Core Functionality

Prerequisites

Create a .env file

How to obtain the token value

📊 Streamlit Dashboard

Features

1. Dashboard Tab

2. Entity Table Tab

3. Note Tab

Sample Data

Running the Streamlit App

App Structure

Goals

Next Steps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Create a `.env` file

Packages