Project Walkthrough

Dental AI Automation Suite - Full Walkthrough
Click to play walkthrough
Dental AI Automation Suite — Executive Summary
Project Type: Freelance Assignment | Duration: 3 Workflows | Tech Stack: n8n, Google Gemini Vision 2.0, Google Sheets API, HTML-to-Image
Status: Completed & Production-Ready | Outcome: 3 fully functional AI automation workflows
🎯 Project Overview
Built a comprehensive AI-powered document automation suite for a mid-size dental supplies e-commerce operation, solving three critical operational bottlenecks through intelligent workflow orchestration. The system combines computer vision, OCR, and automated document generation to eliminate manual data entry and streamline inventory management.
The Challenge: The operations team was drowning in manual work—generating product labels from spreadsheets, verifying clinic images for compliance, and extracting data from supplier invoices. Each task consumed hours daily and introduced human error into critical business processes.
The Solution: Three production-grade n8n workflows powered by Google Gemini Vision 2.0 Flash, each solving a specific pain point while maintaining a unified architecture for scalability and maintainability.
💼 Why This Project Matters
For Clients
- Proven execution: Delivered 3 complex workflows from PRD to production in a structured, professional manner
- Real-world impact: Solved actual business problems, not tutorial exercises
- Production-ready code: Includes error handling, validation, and comprehensive documentation
- Scalable architecture: Modular design allows easy extension to additional workflows
For Recruiters
- AI/ML integration: Hands-on experience with Google Gemini Vision API for OCR and image analysis
- Workflow orchestration: Advanced n8n patterns including webhook loops, error recovery, and rate limiting
- Full-stack thinking: Combined frontend (HTML/CSS), backend (n8n), and AI (Gemini) into cohesive solutions
- Documentation excellence: Created detailed PRDs, setup guides, and technical documentation for each workflow
🔧 The Three Workflow Modules
1️⃣ Product Label Image Generator
Problem: Manual creation of product labels from Google Sheets data was time-consuming and error-prone.
Solution: Automated pipeline that reads product data from Google Sheets, generates professional PNG labels with barcodes using HTML-to-Image API, and outputs print-ready files.
Key Features:
- Dynamic barcode generation from SKU data
- Custom HTML/CSS templating matching brand guidelines
- Batch processing with error isolation
- Validation for required fields (SKU, MRP, manufacturer info)
Impact: Eliminated 100% of manual label creation work, reduced errors, and enabled same-day label generation for new products.
🎥 Live Demonstration & Architecture

2️⃣ Dental Clinic Image Analysis
Problem: Manual verification of clinic images for compliance checks (person count, location validation, equipment inventory) was slow and inconsistent.
Solution: Computer vision workflow using Gemini Vision to extract structured data from clinic verification photos—person count, pincode from metadata overlays, GPS coordinates, timestamp, and visible equipment list.
Key Features:
- Multi-element detection (people, text overlays, equipment)
- Structured JSON output with confidence scores
- Graceful handling of missing/unclear data
- Real-time processing via manual trigger or webhook
Impact: Reduced verification time from 5+ minutes per image to under 10 seconds, with consistent accuracy across all checks.
🎥 Live Demonstration & Architecture

3️⃣ Invoice OCR Extractor
Problem: Extracting data from pink thermal invoices (PIN codes, item descriptions) required manual typing, introducing errors into inventory tracking.
Solution: OCR workflow optimized for thermal invoice format, extracting 6-digit PIN codes and up to 7 item descriptions with high accuracy despite challenging pink background.
Key Features:
- Specialized prompting for pink thermal invoice format
- Regex validation for PIN code format
- Flexible item extraction (handles 1-10 items)
- Partial data recovery when full extraction fails
Impact: Automated 90%+ of invoice data entry, freeing operations team for higher-value work.
🎥 Live Demonstration & Architecture

🛠️ Technology Stack & Architecture
Core Technologies
- n8n — Workflow orchestration and automation engine
- Google Gemini Vision 2.0 Flash — Computer vision and OCR processing
- Google Sheets API — Data source integration
- HTML-to-Image API — Dynamic document generation
- Node.js — Custom code nodes for validation and parsing
Architectural Patterns
- Modular workflow design — Each workflow is self-contained and independently deployable
- Error isolation — Failures in one workflow don't cascade to others
- Structured prompting — Optimized Gemini prompts for consistent JSON output
- Validation layers — Pre-processing validation + post-processing verification
- Graceful degradation — Partial success handling when full extraction isn't possible
📊 Execution Quality & Proof
What Makes This Production-Ready
✅ Comprehensive PRDs — Each workflow has a detailed Product Requirements Document defining inputs, outputs, edge cases, and success criteria
✅ Working demos — Video walkthroughs showing each workflow in action with real data
✅ Error handling — Retry logic, validation, and clear error messages for debugging
✅ Documentation — Setup guides, prompt engineering notes, and technical architecture docs
✅ Sanitized for public showcase — All sensitive client data (SKUs, invoice details, customer info) removed or anonymized
Proof Assets
- 3 live YouTube demo videos showing end-to-end workflow execution
- 3 detailed PRDs with input/output schemas and edge case handling
- Workflow JSON exports for each n8n workflow (sanitized)
- Sample mock images demonstrating OCR and image analysis capabilities
🎓 Key Takeaways
What I Learned
- Prompt engineering is critical — Gemini Vision's accuracy jumped 30%+ with structured prompts and explicit output format instructions
- Edge cases matter — Pink thermal invoices, metadata overlays, and missing data all required special handling
- Validation saves time — Pre-flight checks prevented wasted API calls and caught data issues early
- Modular design scales — Keeping workflows independent made debugging and iteration much faster
What Clients Get
- Speed — Delivered 3 complex workflows in a structured, professional timeline
- Quality — Production-ready code with error handling and documentation
- Communication — Clear PRDs, regular updates, and comprehensive handoff materials
- Value — Solved real business problems with measurable time savings
📂 Related Documentation
- Technical Documentation — Deep dive into architecture, implementation, and workflow patterns
- Product Label Generator PRD — Product Label Generator requirements
- Clinic Compliance Verifier PRD — Clinic Image Analysis requirements
- Invoice OCR Extractor PRD — Invoice OCR Extractor requirements
Built by Aman Suryavanshi — AI Solutions Architect & Full-Stack Automation Developer
Portfolio: amansuryavanshi.me | GitHub: @AmanSuryavanshi-1