Sign in Subscribe

Databricks

A collection of 17 issues

RAG Does Not Fail Because of the Model. It Fails Because of the Data.

Every enterprise AI strategy deck I have seen in the past years contains the same promise: “We will build a RAG-based knowledge assistant…

Why Z-Ordering Fails on Skewed Data — and Liquid Clustering Does Not

Z-ordering and Liquid Clustering both aim to improve Databricks query performance through data skipping. But when your data is skewed, one of them quietly becomes useless. A visual explanation of why — and how the Hilbert curve changes everything.

You Migrated From Teradata to Spark and Threw Away the One Thing That Made It Fast

If you have spent any amount of time working with Teradata, you know that the Primary Index is one of the most important design decisions you make. It determines how data is distributed across AMPs and whether your joins are fast or slow. Choosing the wrong Primary Index is one

The 15-Year Detour: How the Data Industry Spent Billions Reinventing SQL

Teradata Hybrid: Bridge or Destination?

Introduction to Apache Spark: A Powerful Solution for Big Data Processing and Analytics

Collect Statistics in Teradata - Evaluation

Collect Statistics in Teradata - The Evaluation After collecting every combination considered necessary and helpful, you can check the result of the collected statistics on a table by looking at * the time it took to collect and the then prevailing circumstances * the collection results Consider the lengthier collection time when

Warning: Teradata 16.20 Upgrade May Affect Reporting Queries - Potential Fix Found

Become a Teradata Vantage Expert with Our Free Android App: Download Now from Google Play Store!

Become a Teradata Vantage expert by downloading our new Android app for free from the Google Play store and practicing. Want more practical data engineering analysis like this? Join DWHPro Letters and get field-tested notes on Teradata, Snowflake, AI, migrations, performance, and enterprise data work. Early subscribers keep launch access

Efficient Teradata Date Calculations Avoiding INTEGER Values

Learn how Teradata stores dates internally as INTEGER values and how to efficiently calculate dates after 1900-01-01 using a simple formula. Get more useful Teradata date calculations in this article.