Data Orbit Lab
Blog
Nástroje
Analýzy
Případové studie
Benchmarky
Mindset
Knihovna
O nás
Kontakt
Kurátorované zdroje
Knihovna
Užitečné zdroje, dokumentace, knihy a nástroje pro datové inženýry a architekty.
📖
Základní knihy
Designing Data-Intensive Applications
- Martin Kleppmann (must-read pro každého data engineera)
Fundamentals of Data Engineering
- Joe Reis, Matt Housley (komplexní overview moderního data stacku)
Database Internals
- Alex Petrov (deep dive do storage engines)
Streaming Systems
- Tyler Akidau et al. (Bible stream processingu)
The Data Warehouse Toolkit
- Ralph Kimball (dimensional modeling classic)
📚
Oficiální dokumentace
PostgreSQL Docs
- postgresql.org/docs (nejlepší databázová dokumentace)
Apache Kafka Documentation
- kafka.apache.org/documentation
Kubernetes Docs
- kubernetes.io/docs/home
AWS Data Services
- docs.aws.amazon.com
dbt Documentation
- docs.getdbt.com (SQL transformace best practices)
🎓
Online kurzy
MIT 6.824
- Distributed Systems (legendární kurz, materials zdarma)
DataCamp
- Data Engineering track (hands-on practice)
Coursera
- Big Data Specialization (University of California)
A Cloud Guru
- AWS/GCP/Azure data certifications prep
LinkedIn Learning
- Apache Spark, Airflow courses
🔗
Užitečné blogy
Martin Kleppmann's blog
- martin.kleppmann.com
Confluent Blog
- Kafka best practices a stream processing
Netflix Tech Blog
- netflixtechblog.com (scale stories)
Uber Engineering
- eng.uber.com (real-world data challenges)
High Scalability
- highscalability.com (architecture deep dives)
🛠️
Development Tools
DBeaver
- Universal database client (free, cross-platform)
DataGrip
- JetBrains SQL IDE (paid, výborný for productivity)
Postman
- API testing (essential for REST APIs)
k9s
- Terminal UI pro Kubernetes management
Lens
- Kubernetes IDE (GUI for k8s clusters)
📄
Research Papers
MapReduce
- Google (2004) - foundation of big data processing
Bigtable
- Google (2006) - wide-column store design
Dynamo
- Amazon (2007) - eventually consistent key-value store
Spanner
- Google (2012) - globally distributed database
Delta Lake
- Databricks - ACID transactions on data lakes
🎙️
Podcasts & Newslettery
Data Engineering Podcast
- dataengineeringpodcast.com
Software Engineering Daily
- častá data témata
Data Engineering Weekly
- newsletter s kurátorovanými články
Postgres Weekly
- newsletter pro PostgreSQL komunitu
DB Weekly
- široký database newsletter
🏆
Certifikace
AWS Certified Data Analytics
- Specialty certification
Google Professional Data Engineer
- GCP certification
Databricks Certified Data Engineer
- Spark expertise
Confluent Certified Developer
- Kafka mastery
Snowflake SnowPro Core
- Cloud data warehouse
👥
Komunity
dbt Community Slack
- getdbt.com/community
Data Engineering Subreddit
- r/dataengineering
Kafka Users Slack
- confluent.io/community
DataTalks.Club
- online meetups a kurzy
Local Meetups
- meetup.com (Data Engineering groups)