Archive Post
TFuzzyScore: A Hybrid Similarity Metric Combining TF-IDF and Fuzzy Logic for Entity Mapping in Structured Text
📄 Read the full peer-reviewed preprint on TechRxiv: View Full…
Extending Power BI Usage Metrics Beyond 30 Days Using Azure and Databricks
AbstractPower BI is a powerful business intelligence platform widely used…
Optimizing Null Checks and Multi-Granularity Deduplication in Scalable SQL Pipelines
AbstractLarge-scale data pipelines, especially those built on Delta Lake and…