Archive Post

TFuzzyScore: A Hybrid Similarity Metric Combining TF-IDF and Fuzzy Logic for Entity Mapping in Structured Text

📄 Read the full peer-reviewed preprint on TechRxiv: View Full Paper (DOI) Introducing TFuzzyScore In…

Extending Power BI Usage Metrics Beyond 30 Days Using Azure and Databricks

AbstractPower BI is a powerful business intelligence platform widely used across enterprises for interactive reporting…

Optimizing Null Checks and Multi-Granularity Deduplication in Scalable SQL Pipelines

AbstractLarge-scale data pipelines, especially those built on Delta Lake and Spark SQL, require robust mechanisms…