GovChime Analytics Platform
Government Contracts Intelligence
Designed 7-package monorepo with 3 SmartSync microservices (Data sync, Materialized view refresh, ClickHouse sync) and Express.js API. 50+ materialized views reduce query times by 100-300% on 70M+ row datasets. Self-hosted CI/CD with 24+ GitHub Actions workflows, Komodo deployment orchestration, and Stripe paywall integration.
Key Features
- 3 SmartSync microservices: Data sync, Materialized view refresh, ClickHouse sync
- 50+ materialized views reducing query times by 100-300% on 70M+ row datasets
- ClickHouse OLAP integration alongside PostgreSQL for real-time analytics
- Express.js API with Stripe paywall integration and PostHog analytics
- 24+ GitHub Actions CI/CD workflows on self-hosted runner
- Komodo HTTP API for Docker deployment orchestration
Tech Stack
Backend
Database & OLAP
Infrastructure
Frontend
AI & Dev Tools
Challenges & Solutions
Slow Analytics Queries on 70M+ Rows
Real-time aggregation queries across 70M+ rows with complex JOINs took seconds, making dashboards unusable for end users.
Designed ClickHouse OLAP integration alongside PostgreSQL with 50+ materialized views for common aggregations. Query times reduced by 100-300% — dashboards became instant.
Multi-Service CI/CD for Sole Engineer
7 packages with interdependent builds and deploys needed reliable CI/CD without a dedicated DevOps team. Frontend ISR depends on backend being live, services must deploy atomically.
Architected 24+ GitHub Actions workflows on a self-hosted runner with dynamic port allocation, Komodo HTTP API for Docker orchestration, and a Build → Verify → Deploy pipeline ensuring frontend validates against temp backend before any production deploy.
Data Quality at Scale
Raw SamgovAPI data contained inconsistencies, missing fields and unstructured descriptions making it difficult to search and match contracts.
Built AI pipeline using LLM APIs for automated data sanitization, opportunity matching and description generation. Structured output validation ensures consistent data quality.