ReportVision

Overview

ReportVision is a tool that automates the reading and extracting of labs from PDF’s

Steps

Annotate Template for a Lab Report
Extract Data based on selected annotations
Conversion of Extracted Data to PDF’s

Getting Started

Prerequisites

Python 3.8
Node 23.1
Tesseract 5.5 (brew install tesseract)
Java21
PostgreSQL
Docker (required for DB and middleware set up)

Installation and Development Guides

For Frontend
For Middleware
For OCR OCR README.

High Level Architecture

flowchart TD
    A[ReportVision UI] --> B[ReportVision Backend]
    B --> C[OCR API]
    B --> D[Data Storage - Postgres]
    C --> D

The ReportVision application is composed of the following core components:

Components

1. React-Based Single Page Application (SPA)

Purpose: Serves as the user interface for the application.

2. ReportVision Middleware

Purpose: Acts as middleware to handle communication between the UI, OCR API, and data storage.

3. OCR API

Purpose: Performs Optical Character Recognition (OCR) on provided input.

4. Data Storage (PostgreSQL)

Purpose: Stores saved templates and extracted data.

Infrastructure and Cloud Components

Hosting

The application is hosted in Azure

Infrastructure Guide

For detailed information on how the application is deployed and managed in Azure, refer to:
- ReportVision’s Terraform Setup
- Github Action workflows for building and deploying ReportVision in a dev environment