Real-Time Fraud Signals Pipeline
repoKafka -> Spark Structured Streaming -> Delta -> dbt -> Streamlit. Exactly-once processing, anomaly detection, dbt tests.
- Apache Kafka
- Spark Structured Streaming
- Delta Lake
- dbt
- Streamlit
- Python
Senior Data / Analytics Engineer
I build production data platforms — streaming pipelines, lakehouse architectures, and analytics that move the needle on revenue and risk.
Senior data and analytics engineer with six years at AT&T building production data platforms on Azure and Databricks. I work the full stack from PySpark ingestion through dbt analytics models to BI delivery, with a focus on streaming, data quality, and turning ambiguous business problems into tractable pipelines. Microsoft DP-203 (Data Engineering) and DP-100 (AI Engineering) certified.
Kafka -> Spark Structured Streaming -> Delta -> dbt -> Streamlit. Exactly-once processing, anomaly detection, dbt tests.
Medallion (Bronze/Silver/Gold) lakehouse over synthetic CDR data. Airflow + MinIO + Iceberg + Great Expectations + dbt.
CLI that uses Claude to suggest Spark/Snowflake query rewrites and partition strategies. Benchmarked against a 50-query corpus.