Kurátorované zdroje

Knihovna

Užitečné zdroje, dokumentace, knihy a nástroje pro datové inženýry a architekty.

📖

Základní knihy

  • Designing Data-Intensive Applications - Martin Kleppmann (must-read pro každého data engineera)
  • Fundamentals of Data Engineering - Joe Reis, Matt Housley (komplexní overview moderního data stacku)
  • Database Internals - Alex Petrov (deep dive do storage engines)
  • Streaming Systems - Tyler Akidau et al. (Bible stream processingu)
  • The Data Warehouse Toolkit - Ralph Kimball (dimensional modeling classic)
📚

Oficiální dokumentace

  • PostgreSQL Docs - postgresql.org/docs (nejlepší databázová dokumentace)
  • Apache Kafka Documentation - kafka.apache.org/documentation
  • Kubernetes Docs - kubernetes.io/docs/home
  • AWS Data Services - docs.aws.amazon.com
  • dbt Documentation - docs.getdbt.com (SQL transformace best practices)
🎓

Online kurzy

  • MIT 6.824 - Distributed Systems (legendární kurz, materials zdarma)
  • DataCamp - Data Engineering track (hands-on practice)
  • Coursera - Big Data Specialization (University of California)
  • A Cloud Guru - AWS/GCP/Azure data certifications prep
  • LinkedIn Learning - Apache Spark, Airflow courses
🔗

Užitečné blogy

  • Martin Kleppmann's blog - martin.kleppmann.com
  • Confluent Blog - Kafka best practices a stream processing
  • Netflix Tech Blog - netflixtechblog.com (scale stories)
  • Uber Engineering - eng.uber.com (real-world data challenges)
  • High Scalability - highscalability.com (architecture deep dives)
🛠️

Development Tools

  • DBeaver - Universal database client (free, cross-platform)
  • DataGrip - JetBrains SQL IDE (paid, výborný for productivity)
  • Postman - API testing (essential for REST APIs)
  • k9s - Terminal UI pro Kubernetes management
  • Lens - Kubernetes IDE (GUI for k8s clusters)
📄

Research Papers

  • MapReduce - Google (2004) - foundation of big data processing
  • Bigtable - Google (2006) - wide-column store design
  • Dynamo - Amazon (2007) - eventually consistent key-value store
  • Spanner - Google (2012) - globally distributed database
  • Delta Lake - Databricks - ACID transactions on data lakes
🎙️

Podcasts & Newslettery

  • Data Engineering Podcast - dataengineeringpodcast.com
  • Software Engineering Daily - častá data témata
  • Data Engineering Weekly - newsletter s kurátorovanými články
  • Postgres Weekly - newsletter pro PostgreSQL komunitu
  • DB Weekly - široký database newsletter
🏆

Certifikace

  • AWS Certified Data Analytics - Specialty certification
  • Google Professional Data Engineer - GCP certification
  • Databricks Certified Data Engineer - Spark expertise
  • Confluent Certified Developer - Kafka mastery
  • Snowflake SnowPro Core - Cloud data warehouse
👥

Komunity

  • dbt Community Slack - getdbt.com/community
  • Data Engineering Subreddit - r/dataengineering
  • Kafka Users Slack - confluent.io/community
  • DataTalks.Club - online meetups a kurzy
  • Local Meetups - meetup.com (Data Engineering groups)