Live Projects


TheMarketCast.ai

2025

3,000+ organic weekly users tracking private fundraising activity

↗ Visit Site

Platform for parsing and analyzing SEC Form D filings daily, enabling investors to track private fundraising activity in real time. Attracts 3,000+ organic weekly users via SEO/GEO optimization with real-time tracking of 450,000+ filings. Enriched company data via Exa.

Python PostgreSQL Redis PostHog Exa SEC EDGAR

StrataLens AI

2025

Equity research agent with 91% accuracy on FinanceBench

↗ Visit Site

Equity research platform for financial analysts. Built latency-optimized agentic RAG system using Qwen3-235B that synthesizes earnings calls, SEC filings, and real-time news via Tavily to answer complex financial queries. SEC filings sub-agent achieved 91% accuracy on FinanceBench benchmark with LLM-as-judge evaluation. Built text-to-SQL stock screener using DuckDB for natural language financial queries.

Python FastAPI React TypeScript PostgreSQL Redis Langchain Logfire

Technical Projects


Web Search Engine

2026

Distributed crawler over 10M+ pages with full-text search

Built fault-tolerant distributed web crawlers in Rust across 5 instances with politeness policies and Redis queue coordination, storing 10M+ tech-focused pages (200GB) in AWS S3 with metadata in PostgreSQL, provisioned via Terraform. Built full-text search over the crawled corpus using Tantivy inverted index with BM25 ranking and a search UI for querying curated tech content.

Rust Redis AWS S3 PostgreSQL Tantivy Terraform

Financial Podcast Platform

2025

50+ early beta users, personalized portfolio podcasts on-demand

Real-time AI podcast platform using OpenAI Whisper and GPT-4o with Celery workers for async audio generation. React frontend with WebSocket streaming for live audio updates, backend deployed on AWS. Organically acquired 50+ users in early beta with portfolio-based personalized podcast generation on-demand. Market data pipeline processing 9,000 tickers every 20 minutes using SERP API and Redis caching.

Python React Celery WebSocket OpenAI Whisper GPT-4o AWS

SourceMapR

2025

One line of code to add observability to your RAG pipelines

↗ GitHub

Drop-in observability library for RAG pipelines. Traces retrieval quality, latency, and relevance with minimal instrumentation, designed so engineers can monitor and debug RAG systems without changing their existing pipeline code. Provides observability from the source documents with citations to detect broken chunking, incorrect parsing, and other ingestion-level failures that are otherwise invisible at query time.

Python RAG Observability LLM

Open Source


PySyft, OpenMined

2019–2022

Core contributor to one of the most widely used privacy-preserving ML libraries

↗ GitHub

Implemented FALCON protocol operations, the first Python implementation of an honest-majority maliciously secure framework for private deep learning. Planned the SyMPC library roadmap and performed code reviews for the secure multi-party computation library. Contributed to core privacy-preserving ML infrastructure using PyTorch, TensorFlow, and differential privacy.

Python PyTorch TensorFlow SMPC Differential Privacy

GreyNSights

2021

Privacy-preserving data analysis framework for Pandas

↗ GitHub

Framework for privacy-preserving data analysis using Pandas with a pointer-based architecture for flexible EDA on private data without direct access. Implements differential privacy for individual row protection and federated analytics using secure multi-party computation.

Python Pandas Differential Privacy SMPC