ProjectsDental AI Automation SuiteExecutive Summary
Executive Summary0 min read

Dental AI Automation Suite

AI-Powered Document Processing & Verification Workflows

Project Walkthrough

Dental AI Automation Suite - Full Walkthrough

Dental AI Automation Suite - Full Walkthrough

Click to play walkthrough

Dental AI Automation Suite — Executive Summary

Project Type: Freelance Assignment | Duration: 3 Workflows | Tech Stack: n8n, Google Gemini Vision 2.0, Google Sheets API, HTML-to-Image
Status: Completed & Production-Ready | Outcome: 3 fully functional AI automation workflows


🎯 Project Overview

Built a comprehensive AI-powered document automation suite for a mid-size dental supplies e-commerce operation, solving three critical operational bottlenecks through intelligent workflow orchestration. The system combines computer vision, OCR, and automated document generation to eliminate manual data entry and streamline inventory management.

The Challenge: The operations team was drowning in manual work—generating product labels from spreadsheets, verifying clinic images for compliance, and extracting data from supplier invoices. Each task consumed hours daily and introduced human error into critical business processes.

The Solution: Three production-grade n8n workflows powered by Google Gemini Vision 2.0 Flash, each solving a specific pain point while maintaining a unified architecture for scalability and maintainability.


💼 Why This Project Matters

For Clients

  • Proven execution: Delivered 3 complex workflows from PRD to production in a structured, professional manner
  • Real-world impact: Solved actual business problems, not tutorial exercises
  • Production-ready code: Includes error handling, validation, and comprehensive documentation
  • Scalable architecture: Modular design allows easy extension to additional workflows

For Recruiters

  • AI/ML integration: Hands-on experience with Google Gemini Vision API for OCR and image analysis
  • Workflow orchestration: Advanced n8n patterns including webhook loops, error recovery, and rate limiting
  • Full-stack thinking: Combined frontend (HTML/CSS), backend (n8n), and AI (Gemini) into cohesive solutions
  • Documentation excellence: Created detailed PRDs, setup guides, and technical documentation for each workflow

🔧 The Three Workflow Modules

1️⃣ Product Label Image Generator

Problem: Manual creation of product labels from Google Sheets data was time-consuming and error-prone.

Solution: Automated pipeline that reads product data from Google Sheets, generates professional PNG labels with barcodes using HTML-to-Image API, and outputs print-ready files.

Key Features:

  • Dynamic barcode generation from SKU data
  • Custom HTML/CSS templating matching brand guidelines
  • Batch processing with error isolation
  • Validation for required fields (SKU, MRP, manufacturer info)

Impact: Eliminated 100% of manual label creation work, reduced errors, and enabled same-day label generation for new products.

🎥 Live Demonstration & Architecture

Product Label Generator Architecture
Product Label Generator Architecture


2️⃣ Dental Clinic Image Analysis

Problem: Manual verification of clinic images for compliance checks (person count, location validation, equipment inventory) was slow and inconsistent.

Solution: Computer vision workflow using Gemini Vision to extract structured data from clinic verification photos—person count, pincode from metadata overlays, GPS coordinates, timestamp, and visible equipment list.

Key Features:

  • Multi-element detection (people, text overlays, equipment)
  • Structured JSON output with confidence scores
  • Graceful handling of missing/unclear data
  • Real-time processing via manual trigger or webhook

Impact: Reduced verification time from 5+ minutes per image to under 10 seconds, with consistent accuracy across all checks.

🎥 Live Demonstration & Architecture

Clinic Compliance Verification Architecture
Clinic Compliance Verification Architecture


3️⃣ Invoice OCR Extractor

Problem: Extracting data from pink thermal invoices (PIN codes, item descriptions) required manual typing, introducing errors into inventory tracking.

Solution: OCR workflow optimized for thermal invoice format, extracting 6-digit PIN codes and up to 7 item descriptions with high accuracy despite challenging pink background.

Key Features:

  • Specialized prompting for pink thermal invoice format
  • Regex validation for PIN code format
  • Flexible item extraction (handles 1-10 items)
  • Partial data recovery when full extraction fails

Impact: Automated 90%+ of invoice data entry, freeing operations team for higher-value work.

🎥 Live Demonstration & Architecture

Invoice OCR Extractor Architecture
Invoice OCR Extractor Architecture


🛠️ Technology Stack & Architecture

Core Technologies

  • n8n — Workflow orchestration and automation engine
  • Google Gemini Vision 2.0 Flash — Computer vision and OCR processing
  • Google Sheets API — Data source integration
  • HTML-to-Image API — Dynamic document generation
  • Node.js — Custom code nodes for validation and parsing

Architectural Patterns

  • Modular workflow design — Each workflow is self-contained and independently deployable
  • Error isolation — Failures in one workflow don't cascade to others
  • Structured prompting — Optimized Gemini prompts for consistent JSON output
  • Validation layers — Pre-processing validation + post-processing verification
  • Graceful degradation — Partial success handling when full extraction isn't possible

📊 Execution Quality & Proof

What Makes This Production-Ready

Comprehensive PRDs — Each workflow has a detailed Product Requirements Document defining inputs, outputs, edge cases, and success criteria

Working demos — Video walkthroughs showing each workflow in action with real data

Error handling — Retry logic, validation, and clear error messages for debugging

Documentation — Setup guides, prompt engineering notes, and technical architecture docs

Sanitized for public showcase — All sensitive client data (SKUs, invoice details, customer info) removed or anonymized

Proof Assets

  • 3 live YouTube demo videos showing end-to-end workflow execution
  • 3 detailed PRDs with input/output schemas and edge case handling
  • Workflow JSON exports for each n8n workflow (sanitized)
  • Sample mock images demonstrating OCR and image analysis capabilities

🎓 Key Takeaways

What I Learned

  1. Prompt engineering is critical — Gemini Vision's accuracy jumped 30%+ with structured prompts and explicit output format instructions
  2. Edge cases matter — Pink thermal invoices, metadata overlays, and missing data all required special handling
  3. Validation saves time — Pre-flight checks prevented wasted API calls and caught data issues early
  4. Modular design scales — Keeping workflows independent made debugging and iteration much faster

What Clients Get

  • Speed — Delivered 3 complex workflows in a structured, professional timeline
  • Quality — Production-ready code with error handling and documentation
  • Communication — Clear PRDs, regular updates, and comprehensive handoff materials
  • Value — Solved real business problems with measurable time savings


Built by Aman Suryavanshi — AI Solutions Architect & Full-Stack Automation Developer
Portfolio: amansuryavanshi.me | GitHub: @AmanSuryavanshi-1