AWS Data Engineer Associate (DEA-C01) Study Guide 2026
Complete study guide for the AWS Certified Data Engineer Associate (DEA-C01) exam. Covers all five domains, key services (Glue, Redshift, Kinesis, Lake Formation), and a proven study plan to pass on your first attempt.
Posted by
Related reading
How to Pass the AWS Cloud Practitioner Exam in 2026 (CLF-C02)
AWS Certified Cloud Practitioner CLF-C02 study guide: exam domains, best free AWS resources, practice tips, and a 3-week plan for complete beginners.
AWS Cloud Practitioner vs Solutions Architect: Which First?
AWS Cloud Practitioner vs Solutions Architect Associate compared: difficulty gap, prerequisites, time to study, job roles unlocked, and salary data for 2026.
Google Associate Cloud Engineer (ACE) Exam Study Guide 2026
Complete study guide for the Google Associate Cloud Engineer exam: how it differs from PCA, the 5 domains, core GCP services, essential gcloud commands, and top resources.
Key Takeaways
- The DEA-C01 exam has 65 questions and a 130-minute time limit with a 720/1000 passing score.
- Five domains: Data Ingestion and Transformation (34%), Data Store Management (26%), Data Operations and Support (22%), Data Security and Governance (18%).
- AWS Glue, Redshift, Kinesis, Lake Formation, and Athena are the most frequently tested services.
- Data engineering roles on AWS command average salaries of $142,000–$168,000 (LinkedIn Salary, 2025).
- The DEA-C01 replaced the old AWS Big Data Specialty (BDS-C01) as the recommended data certification at associate level.
About the AWS Data Engineer Associate Exam
Launched in 2023, the AWS Certified Data Engineer Associate (DEA-C01) is AWS's newest associate-level certification. It targets data engineers, ETL developers, and analytics professionals who design, build, and maintain data pipelines on AWS. Unlike the former Big Data Specialty, the DEA-C01 is positioned at the associate level — meaning it is more accessible but still comprehensive.
AWS recommends candidates have at least 2 years of experience with AWS data services and a working knowledge of SQL, Python, or Spark. However, many candidates with strong cloud fundamentals and 3–4 months of focused data engineering study pass without meeting the full experience recommendation.
DEA-C01 Exam Details
| Attribute | Detail |
|---|---|
| Exam code | DEA-C01 |
| Questions | 65 (scored) + up to 15 unscored |
| Duration | 130 minutes |
| Passing score | 720 / 1000 |
| Cost | $150 USD |
| Format | Multiple choice, multiple response, ordering, matching |
| Delivery | Pearson VUE (in-person or online) |
| Validity | 3 years |
Domain 1: Data Ingestion and Transformation (34%)
The largest domain covers the core ETL/ELT patterns on AWS. Key services include AWS Glue (serverless ETL, Glue Data Catalog, Glue DataBrew for no-code transformation), Amazon Kinesis (Data Streams for real-time ingestion, Firehose for delivery to S3/Redshift/OpenSearch), and AWS Database Migration Service (DMS) for migrating on-premises databases to AWS.
You must understand when to use batch vs. streaming ingestion, how to partition data in S3 for performance, and how to handle schema evolution in Glue Data Catalog. Questions frequently test the difference between Kinesis Data Streams (you manage shards/consumers) and Kinesis Firehose (fully managed, 60-second delivery window).
Domain 2: Data Store Management (26%)
Covers storage decisions across the AWS ecosystem. Amazon Redshift is the centerpiece — know distribution styles (KEY, EVEN, ALL), sort keys, Redshift Spectrum (querying S3 from Redshift), and Redshift Serverless. Amazon S3 remains the backbone of data lakes — understand storage classes (S3 Intelligent-Tiering, Glacier), S3 lifecycle policies, and data lake organization patterns (raw/processed/curated zones).
Other tested services: DynamoDB (NoSQL, partition key design, Global Tables for multi-region), Aurora (relational, supports PostgreSQL/MySQL), and AWS Lake Formation (centralized permissions for data lake access control, column/row-level security, governed tables).
Stop guessing. Start understanding.
Certify Copilot AI explains any certification practice question in real-time, directly on your screen. Try it free with 10 credits, no card required.
Try Certify Copilot AI FreeDomain 3: Data Operations and Support (22%)
Covers monitoring, orchestration, and cost optimization. AWS Step Functions orchestrates multi-step workflows including Glue jobs, Lambda functions, and EMR clusters. Amazon MWAA (Managed Workflows for Apache Airflow) is tested for complex DAG-based pipelines. CloudWatch is the primary monitoring service — know how to set alarms on Glue job metrics, Redshift query performance, and Kinesis consumer lag.
Cost optimization questions test your knowledge of Redshift reserved nodes vs. on-demand, S3 Intelligent-Tiering breakeven analysis, and EMR Spot Instance strategies. Operational questions often involve troubleshooting — a Glue job that fails because of missing partitions, a Kinesis stream with high GetRecords latency due to too few shards, or a Redshift query with poor performance due to a missing sort key.
Domain 4: Data Security and Governance (18%)
Covers encryption, access control, and compliance. All S3 data should be encrypted at rest — know SSE-S3 (AWS-managed keys), SSE-KMS (customer-managed keys with audit trail via CloudTrail), and SSE-C (customer-provided keys). Lake Formation column/row-level security, AWS Macie for PII detection in S3, and AWS Glue DataBrew's data quality rules are tested.
For governance, understand the difference between AWS Glue Data Catalog (technical metadata) and AWS DataZone (data marketplace with business glossary and data access subscriptions). GDPR/CCPA compliance questions test your knowledge of data masking, tokenization, and deletion workflows.
12-Week DEA-C01 Study Plan
- Weeks 1–2: AWS fundamentals refresher (VPC, IAM, S3 core) + Kinesis deep dive. Build a streaming pipeline in the AWS free tier.
- Weeks 3–4: AWS Glue — ETL scripts, Data Catalog, crawlers, DataBrew. Run at least 5 hands-on Glue jobs.
- Weeks 5–6: Redshift — cluster setup, distribution styles, Spectrum, Serverless. Practice WLM configuration.
- Weeks 7–8: Lake Formation, Athena, EMR. Focus on Lake Formation permissions model — it's heavily tested.
- Weeks 9–10: Step Functions, MWAA, CloudWatch, cost optimization. Build a sample orchestrated pipeline.
- Weeks 11–12: Two to three full practice exams. Review all incorrect answers. Focus on services you scored lowest on in Week 11 during Week 12.
Frequently Asked Questions
Is the DEA-C01 harder than the AWS Solutions Architect Associate?
Most candidates with a data engineering background find them similarly difficult. The SAA-C03 has broader service coverage; the DEA-C01 goes deeper on fewer services. If you have hands-on Glue, Redshift, and Kinesis experience, the DEA-C01 may feel more natural. Without data engineering experience, the SAA-C03 is typically easier to pass first as foundational preparation.
What prerequisite certifications should I have before DEA-C01?
AWS recommends but does not require prior certifications. Most successful DEA-C01 candidates hold the AWS Cloud Practitioner (CCP) or Solutions Architect Associate (SAA-C03) first. The SAA-C03 covers networking, IAM, and S3 fundamentals that appear across DEA-C01 domains, making it the most valuable prerequisite in practice.
What are the best practice test resources for DEA-C01?
Tutorials Dojo by Jon Bonso is the most recommended third-party practice test for DEA-C01 — high question quality, detailed explanations, and regular updates. AWS's official practice exam (available on AWS Training) is shorter but uses the same question format and difficulty level as the real exam. Use both for best results.
Do I need to know Apache Spark for DEA-C01?
Basic familiarity with Spark concepts helps but is not required for most questions. The exam tests AWS-managed Spark via EMR and Glue ETL scripts (which use PySpark under the hood). You should understand RDDs vs. DataFrames conceptually, and know that Glue uses dynamic frames (an extension of Spark DataFrames) — but you will not need to write Spark code on the exam.
How does DEA-C01 compare to the old AWS Big Data Specialty?
The DEA-C01 is the successor to the retired Big Data Specialty (BDS-C01). The DEA-C01 covers newer services like Lake Formation, DataZone, and Redshift Serverless that did not exist when the BDS-C01 was designed. The BDS-C01 placed more emphasis on Elasticsearch/OpenSearch and Kinesis Analytics (now Managed Flink). The DEA-C01 is overall more current and reflects modern AWS data architectures.
Stop guessing. Start understanding.
Certify Copilot AI explains any certification practice question in real-time, directly on your screen. Try it free with 10 credits, no card required.
Try Certify Copilot AI Free