5 pronunciation exercises on the serialisation and storage format names that data engineers encounter daily.
0 / 5 completed
1 / 5
How is "YAML" pronounced?
YAML (YAML Ain't Markup Language) is pronounced "YAM-ul" /ˈjæməl/ — two syllables, rhyming with "camel." The Y is /j/ (like "yes"), the A is short /æ/, and the second syllable reduces to a schwa /məl/. It is always spoken as a word. The name is a recursive acronym. In a sentence: "Kubernetes manifests are written in YAML, which is sensitive to indentation errors."
2 / 5
How is "Parquet" pronounced?
Parquet is pronounced "par-KAY" /pɑːrˈkeɪ/ — two syllables, stress on the second, and the final T is silent (French borrowing). The second syllable ends in /eɪ/ like "day." This is the same pronunciation as the flooring term "parquet." A very common mistake is pronouncing the T — in French, final consonants in borrowed words are typically silent. In a sentence: "Store the analytics events as Parquet files in S3 so Athena can query them efficiently."
3 / 5
How is "Avro" pronounced?
Avro is pronounced "AV-roh" /ˈævroʊ/ — two syllables: AV (stressed, short /æ/ like "have") + roh (/roʊ/, long O). Stress is on the first syllable. The Apache Avro format was named after the Avro aircraft company. In a sentence: "Kafka messages are serialised with Avro schemas registered in the Schema Registry."
4 / 5
How is "Protobuf" pronounced?
Protobuf (Protocol Buffers) is pronounced "PRO-toh-buf" /ˈproʊtəbʌf/ — three syllables: PRO (stressed, long /oʊ/) + toh (unstressed schwa-like /tə/) + buf (/bʌf/, short U like "buff"). Stress is firmly on the first syllable. In a sentence: "Replace the JSON API responses with Protobuf to reduce payload size and improve parsing speed."
5 / 5
How is "ORC" pronounced?
ORC (Optimized Row Columnar) is pronounced "ork" /ɔːrk/ — one syllable, rhyming with "fork" or "stork." The vowel is /ɔːr/ and the C is a hard /k/. It is always said as a word, coincidentally sharing pronunciation with the fantasy creature "orc." In a sentence: "ORC files compress Hive table data more aggressively than plain text and support predicate pushdown."