SQL DatabasesPostgreSQL, MySQL; ACID, transactions, isolation levels, MVCCNoSQL DatabasesDocument, key-value, wide-column, graph databases and their use casesDatabase SelectionConsistency needs, query patterns, scale, operational complexity, costData ModelingNormalization (1NF–3NF, BCNF), denormalization, schema evolutionIndexing StrategiesB-tree, hash, composite, covering, partial, full-text, vector indexesQuery OptimizationEXPLAIN plans, N+1 problem, eager vs lazy loading, query hintsACID TransactionsAtomicity, consistency, isolation, durability; distributed transactions (2PC)CAP Applied to DatabasesChoosing CP vs AP databases for specific use casesConsistent HashingVirtual nodes, hot-spot avoidance, rebalancing on node changesSharding & PartitioningHorizontal sharding, range vs hash vs directory-based, cross-shard queriesReplicationMaster-slave, multi-master, read replicas, replication lag, conflict resolutionOLTP vs OLAPTransactional vs analytical workloads, HTAP databasesData WarehousingStar/snowflake schema, columnar storage (Redshift, BigQuery, Snowflake)Data Lake & LakehouseDelta Lake, Apache Iceberg, Apache Hudi; ACID on object storageData Pipelines & ETL/ELTBatch vs streaming, Apache Spark, Flink, dbt, AirflowEvent Streaming as StorageKafka log compaction, event sourcing on Kafka, schema registryDatabase ReliabilityConnection pooling, backups, point-in-time recovery, failoverVector DatabasesEmbeddings, ANN search, HNSW index, pgvector, Pinecone, Weaviate