Archive Post
TFuzzyScore: A Hybrid Similarity Metric Combining TF-IDF and Fuzzy Logic for Entity Mapping in Structured Text
📄 Read the full peer-reviewed preprint on TechRxiv: View Full Paper (DOI) Introducing TFuzzyScore In…
Extending Power BI Usage Metrics Beyond 30 Days Using Azure and Databricks
AbstractPower BI is a powerful business intelligence platform widely used across enterprises for interactive reporting…
Optimizing Null Checks and Multi-Granularity Deduplication in Scalable SQL Pipelines
AbstractLarge-scale data pipelines, especially those built on Delta Lake and Spark SQL, require robust mechanisms…