Back to Portfolio

Talk-to-Data Platform

Agentic Natural Language Intelligence with Multi-Source Integration

An enterprise-grade conversational data platform powered by multi-agent AI orchestration that transforms natural language questions into actionable insights across SQL, NoSQL, and unstructured data sources. Features intelligent query routing, autonomous data validation, semantic schema understanding, and self-healing query correction with built-in governance and audit trails.

The Challenge

Critical barriers to data democratization

🚫

Business users locked out of data insights - require SQL expertise and data team bottlenecks delay critical decisions.

πŸ—‚οΈ

Data scattered across multiple sources (SQL databases, data lakes, APIs, documents) with no unified query interface.

⚠️

Existing NL2SQL tools generate incorrect queries for complex business logic, lack context awareness, and produce unreliable results.

πŸ›‘οΈ

No enterprise governance - query history lost, no audit trails, PII exposure risks, and zero compliance controls.

System Architecture

Multi-agent orchestration with self-healing capabilities

πŸ’Ύ Data Source Layer
Azure SQL DB
Production Data
Cosmos DB
NoSQL
Blob Storage
Documents
External APIs
SaaS
↓
↓
🧠 Agentic Orchestration Layer
Master Coordinator Agent
GPT-5 β€’ Query Understanding & Routing
Schema Agent
GPT-4.5
Query Builder
GPT-5
Data Validator
GPT-4.1
Governance Agent
GPT-4
↓
↓
πŸ”„ Self-Healing & Validation
Query Validation
Syntax & Semantic Checks
Error Recovery Agent
Automatic Correction
↓
πŸ“Š Visualization Engine
Visualization Agent
GPT-5 β€’ Auto Chart Selection
Power BI
Interactive Dashboards
↓
πŸ›‘οΈ Enterprise Governance & Security
PII Detection
Auto-Masking
Audit Trail
Complete History
RLS Enforcement
Row-Level Security
↓
πŸ’¬ API & User Interface
FastAPI Backend
WebSocket + REST
React Frontend
Conversational UI

System Components

Multi-agent architecture building blocks

πŸ“š

Intelligent Data Catalog

Azure Purview + SQL Server 2025

Automated schema discovery with natural language business glossary mapping. Tracks data lineage, classifies sensitive data (PII), and maintains real-time metadata synchronization across all sources.

🧠

Orchestration Agent

Azure OpenAI

Central intelligence that interprets user intent, decomposes complex questions into sub-tasks, routes work to specialized agents, and synthesizes final responses. Maintains conversation context across multi-turn dialogues.

πŸ—ΊοΈ

Schema Intelligence Agent

Azure OpenAI

Semantic understanding of database schemas through business glossary and metadata. Maps natural language concepts to technical columns, understands relationships and joins, and maintains context-aware schema embeddings.

πŸ”§

SQL Generation Agent

Azure OpenAI + SQL Server 2025

Advanced NL2SQL with multi-step query decomposition. Leverages SQL Server 2025's native semantic search and AI functions for context-aware query generation. Handles complex aggregations, subqueries, and cross-database joins.

βœ“

Validation & Quality Agent

Azure OpenAI + Rules Engine

Real-time query validation with syntax checking, semantic correctness verification, and performance prediction. Detects potential data quality issues, validates results against expected patterns, and flags anomalies.

πŸ”„

Error Recovery Agent

AutoGen Multi-Agent Framework

Autonomous error detection and correction system. Analyzes failed queries, generates alternative approaches, learns from successful corrections, and maintains feedback loops. Continuously improves accuracy through reinforcement learning.

πŸ›‘οΈ

Security & Audit Agent

Azure AD + Purview DLP

Enterprise-grade security with automatic PII detection and masking, row-level security enforcement, column-level permissions, and approval workflows for sensitive queries. Complete audit trail with user attribution and compliance reporting.

πŸ“Š

Auto-Visualization Agent

Azure OpenAI + Power BI

Intelligent chart selection based on data characteristics and user intent. Automatically generates dashboards, detects visualization opportunities, and exports to multiple formats. Learns user preferences for consistent styling.

Data Flow: Question to Insight

Multi-agent collaboration in action

1
User Query Received
Business user submits complex question through conversational interface.
"Show me revenue trends by region for products launched in the last quarter, excluding returns"
2
Master Coordinator Agent
Intent Analysis & Agent Routing
Master Coordinator analyzes query complexity and distributes work to specialized agents.
  • Identifies: Time-series + Geographic + Product filtering + Exclusion logic
  • Complexity: High (requires multi-step decomposition)
  • Agent Routing: Schema β†’ Query Builder β†’ Validator
3
Schema Intelligence Agent
Semantic Schema Mapping
Maps natural language to database objects using Azure Purview metadata.
  • "revenue" β†’ [Sales].[TotalAmount]
  • "region" β†’ [Geography].[RegionName] (via JOIN)
  • "products launched" β†’ [Products].[LaunchDate] + FILTER
  • "excluding returns" β†’ WHERE [Sales].[Type] != 'Return'
βœ“ Complete semantic schema understanding achieved
4
Query Builder Agent
Multi-Step Query Decomposition
Breaks complex question into executable sub-queries using SQL Server 2025 semantic layer.
  • Step 1: Identify products launched in last 90 days
  • Step 2: Calculate revenue by region (excluding returns)
  • Step 3: Generate time-series aggregation
  • Step 4: Final JOIN and aggregation
5
Validation Agent
Pre-Execution Validation
Performs comprehensive checks before query execution.
  • SQL Syntax: βœ“ Valid
  • Semantic Correctness: βœ“ Matches intent
  • Performance Estimate: ~2.3 seconds
  • Security Check: βœ“ User has access
βœ“ Query approved for execution
6
Query Execution
Execute against production databases with real-time monitoring.
1,247 rows returned in 1.9 seconds
7
Validation Agent
Result Quality Check
Post-execution validation to ensure data quality.
⚠️ Anomaly detected: Region "APAC" shows 300% spike (flagged for review)
8
Visualization Agent
Intelligent Chart Selection
Auto-generates appropriate visualizations based on data characteristics.
  • Data type: Time-series + Numeric + Categorical
  • Best fit: Line chart with regional comparison
  • Secondary: Geographic heat map
9
Error Recovery Agent (Background)
Self-Healing Monitoring
Monitors execution - NO errors detected this time.
βœ“ Query success rate: 100%
βœ“ Pattern logged for future learning
10
Governance Agent
Audit Trail & Compliance
Complete audit record for compliance and security.
  • User: jane.doe@company.com
  • Query Intent: Revenue analysis by region
  • Data Accessed: Sales, Products, Geography
  • PII Exposure: None detected
  • Compliance: βœ“ GDPR compliant
11
Interactive Visualization Delivered
User receives complete analysis with anomaly alerts and drill-down options.
βœ“ Revenue trends by region visualized
βœ“ Anomaly alert provided
βœ“ Export options available
βœ“ Total time: 2.1 seconds

What Makes This Different?

Unlike traditional NL2SQL tools that rely on a single LLM and often fail on complex queries, this platform uses a multi-agent architecture where specialized AI agents collaborate to understand your question, validate queries before execution, and automatically correct errors. The result: 95% accuracy on complex business questions with built-in enterprise governance that ensures data security, complete audit trails, and regulatory compliance.