Skip to content

Extract Text and Data from Document with OCR NER

Notifications You must be signed in to change notification settings

TZNcse209/OCR_NER

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OCR_NER

  1. Data Preparation
  • Extracting Text from Business Card Images
  • Cleaning and Labeling the Data
  1. Data Preprocessing
  • Loading and Preparing the Data
  • Converting Data to Spacy Training Format
  • Train-Test Split
  1. Training NER Model with Spacy
  • Model Architecture and Training Process
  • Evaluation and Performance Metrics
  1. Testing the Trained Model
  • Data Preparation on New Images
  • Prediction and Bounding Box Generation
  • Evaluation and Refinement
  1. Document Scanner
  • Edge Detection and Morphological Transformations
  • Perspective Transform and Image Cropping
  1. Web App Development in Flask
  • Document Upload and Scanning
  • Manual Adjustment with JavaScript Canvas
  • Text Extraction and Entity Prediction
  • Entity Placement and Visualization

About

Extract Text and Data from Document with OCR NER

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages