DataSentinel AI is an AI-powered Data Reliability Platform designed to protect modern data pipelines by detecting schema drift, validating data quality, and preventing bad data from reaching data warehouses.
Modern organizations depend on data pipelines. A small change in data schema, missing values, or unexpected data anomalies can break dashboards, ML models, and business decisions.
DataSentinel AI acts as an intelligent guard layer between data sources and data warehouses.
Data Source
↓
DataSentinel AI
↓
Data Warehouse
- Detect missing columns
- Detect new columns
- Identify data type changes
- Missing value analysis
- Duplicate record detection
- Invalid data checks
- Data quality scoring
- Explain data issues
- Identify possible root causes
- Suggest smart fixes
| Category | Technologies |
|---|---|
| Language | Python |
| Data Processing | Pandas |
| Dashboard | Streamlit |
| API | FastAPI |
| Database | PostgreSQL |
| AI/ML | Scikit-learn, LLM APIs |
- ✅ Repository Setup
- 🚧 MVP Development
- 🔜 Interactive Dashboard
- 🔜 AI Intelligence Engine
- 🔜 Enterprise SaaS Platform
DataSentinel-AI/
│
├── src/
├── data/
├── assets/
├── tests/
├── README.md
└── LICENSE
To become the trust layer between raw data and business decisions.
Contributions, suggestions, and feedback are welcome!
If you like this project, consider giving it a star ⭐