- Simply cleared the AWS Licensed Information Engineer – Affiliate DEA-C01 examination with a rating of 930/1000.
- AWS Licensed Information Engineer – Affiliate DEA-C01 examination is the most recent AWS examination launched on twelfth March 2024.
AWS Licensed Information Engineer – Affiliate DEA-C01 Examination Content material
Refer AWS Licensed Information Engineer – Affiliate DEA-C01 Examination Information
AWS Licensed Information Engineer – Affiliate DEA-C01 Examination Abstract
- DEA-C01 examination consists of 65 questions in 130 minutes, and the time is greater than ample in case you are well-prepared.
- DEA-C01 examination contains two varieties of questions, multiple-choice and multiple-response.
- DEA-C01 has a scaled rating between 100 and 1,000. The scaled rating wanted to cross the examination is 720.
- Affiliate exams at the moment price $ 150 + tax.
- You will get an extra half-hour if English is your second language by requesting Examination Lodging. It may not be wanted for Affiliate exams however is useful for Skilled and Specialty ones.
- AWS exams may be taken both remotely or on-line, I choose to take them on-line because it supplies numerous flexibility. Simply be sure you have a correct place to take the examination with no disturbance and nothing round you.
- Additionally, in case you are taking the AWS On-line examination for the primary time attempt to be a part of a minimum of half-hour earlier than the precise time as I’ve had points with each PSI and Pearson with lengthy wait instances.
AWS Licensed Information Engineer – Affiliate DEA-C01 Examination Assets
- On-line Programs
- Observe checks
- Signed up with AWS for the Free Tier account which supplies numerous Providers to be tried at no cost with sure limits that are greater than sufficient to get issues going. Remember to decommission companies past the free limits, stopping any surprises 🙂
- Learn the FAQs a minimum of for the vital subjects, as they cowl vital factors and are good for fast evaluate
AWS Licensed Information Engineer – Affiliate DEA-C01 Examination Matters
- DEA-C01 Examination covers the information engineering points when it comes to information ingestion, transformation, orchestration, designing information fashions, managing information life cycles, and guaranteeing information high quality.
Analytics
- Guarantee you already know and canopy all of the companies in-depth, as 80% of the examination focuses on subjects like Glue, Athena, Kinesis, and Redshift.
- AWS Analytics Providers Cheat Sheet
- Glue
- DEA-C01 covers Glue in nice element.
- AWS Glue is a completely managed, ETL service that automates the time-consuming steps of information preparation for analytics.
- helps server-side encryption for information at relaxation and SSL for information in movement.
- Glue ETL engine to Extract, Remodel, and Load information that may robotically generate Scala or Python code.
- Glue Information Catalog is a central repository and chronic metadata retailer to retailer structural and operational metadata for all the information property. It really works with Apache Hive as its metastore.
- Glue Crawlers scan varied information shops to robotically infer schemas and partition constructions to populate the Information Catalog with corresponding desk definitions and statistics.
- Glue Job Bookmark tracks information that has already been processed throughout a earlier run of an ETL job by persisting state data from the job run.
- Glue Streaming ETL permits performing ETL operations on streaming information utilizing repeatedly operating jobs.
- Glue supplies a versatile scheduler that handles dependency decision, job monitoring, and retries.
- Glue Studio provides a graphical interface for authoring AWS Glue jobs to course of information permitting you to outline the stream of the information sources, transformations, and targets within the visible interface and producing Apache Spark code in your behalf.
- Glue Information High quality helps scale back guide information high quality efforts by robotically measuring and monitoring the standard of information in information lakes and pipelines.
- Glue DataBrew helps put together, visualize, clear, and normalize information instantly from the information lake, information warehouses, and databases, together with S3, Redshift, Aurora, and RDS.
- Glue Flex execution choice helps to scale back the prices of pre-production, take a look at, and non-urgent information integration workloads by as much as 34% and is right for buyer workloads that don’t require quick jobs begin instances.
- Glue
FindMatches
remodel helps determine duplicate or matching data within the dataset, even when the data shouldn’t have a typical distinctive identifier and no fields match precisely.
- Kinesis
- Perceive Kinesis Information Streams and Kinesis Information Firehose in-depth.
- Know Kinesis Information Streams vs Kinesis Firehose
- Know Kinesis Information Streams is open-ended for each producer and client. It helps KCL and works with Spark.
- Know Kinesis Firehose is open-ended for producers solely. Information is saved in S3, Redshift, and OpenSearch.
- Kinesis Firehose works in batches with minimal 60secs intervals and in near-real time.
- Kinesis Firehose helps out-of-the-box transformation and customized transformation utilizing Lambda
- Kinesis helps encryption at relaxation utilizing server-side encryption
- Kinesis helps Interface VPC endpoint to maintain site visitors between the VPC and Kinesis Information Streams from leaving the Amazon community and doesn’t require an web gateway, NAT gadget, VPN connection, or Direct Join connection.
- Kinesis Producer Library helps batching
- Kinesis Information Analytics OR Managed Service for Apache Flink
- helps remodel and analyze streaming information in actual time utilizing Apache Flink.
- helps anomaly detection utilizing Random Minimize Forest ML
- helps reference information saved in S3.
- Redshift
- Redshift can also be coated in depth.
- Redshift Superior embrace
- Redshift Distribution Fashion determines how information is distributed throughout compute nodes and helps decrease the affect of the redistribution step by finding the information the place it must be earlier than the question is executed.
- Redshift Enhanced VPC routing forces all COPY and UNLOAD site visitors between the cluster and the information repositories by way of the VPC.
- Workload administration (WLM) permits customers to flexibly handle priorities inside workloads in order that brief, fast-running queries gained’t get caught in queues behind long-running queries.
- Redshift Spectrum
- helps question structured and semistructured information from information in S3 with out having to load the information into Redshift tables.
- can not entry information from Glacier.
- Federated Question characteristic permits querying and analyzing information throughout operational databases, information warehouses, and information lakes.
- Brief question acceleration (SQA) prioritizes chosen short-running queries forward of longer-running queries.
- Concurrency Scaling helps help hundreds of concurrent customers and concurrent queries, with constantly quick question efficiency.
- Redshift Serverless is a serverless choice of Redshift that makes it extra environment friendly to run and scale analytics in seconds with out the necessity to arrange and handle information warehouse infrastructure.
- Streaming ingestion supplies low-latency, high-speed ingestion of stream information from Kinesis Information Streams and Managed Streaming for Apache Kafka right into a Redshift provisioned or Redshift Serverless materialized view.
- Redshift information sharing can securely share entry to dwell information throughout Redshift clusters, workgroups, AWS accounts, and AWS Areas with out manually shifting or copying the information.
- Redshift Information API supplies a safe HTTP endpoint and integration with AWS SDKs to assist entry Redshift information with net companies–primarily based functions, together with AWS Lambda, SageMaker notebooks, and AWS Cloud9.
- Redshift Finest Practices w.r.t number of Distribution fashion, Kind key, importing/exporting information
- COPY command which permits parallelism, and performs higher than a number of COPY instructions
- COPY command can use manifest information to load information
- COPY command handles encrypted information
- COPY command which permits parallelism, and performs higher than a number of COPY instructions
- Redshift Resizing cluster choices (elastic resize didn’t help node kind adjustments earlier than, however does now)
- Redshift helps encryption at relaxation and in transit
- Redshift helps encrypting an unencrypted cluster utilizing KMS. Nevertheless, you’ll be able to’t allow {hardware} safety module (HSM) encryption by modifying the cluster. As a substitute, create a brand new, HSM-encrypted cluster and migrate your information to the brand new cluster.
- Know Redshift views to manage entry to information.
- Athena
- is a serverless, interactive analytics service constructed on open-source frameworks, supporting open-table and file codecs.
- supplies a simplified, versatile approach to analyze information in an S3 information lake and 30 information sources, together with on-premises information sources or different cloud programs utilizing SQL or Python with out loading the information.
- integrates with QuickSight for visualizing the information or creating dashboards.
- makes use of a managed Glue Information Catalog to retailer data and schemas concerning the databases and tables for the information saved in S3.
- Workgroups can be utilized to separate customers, groups, functions, or workloads, to set limits on the quantity of information every question or your entire workgroup can course of, and to trace prices.
- Athena finest practices
- Information partitioning,
- Partition projection, and
- Columnar file codecs like ORC or Parquet as they help compression and are splittable.
- Elastic Map Scale back
- Perceive EMRFS
- Use Constant view to ensure S3 objects referred by totally different functions are in sync. Though, it’s not wanted now.
- Know EMR Finest Practices (trace: begin with many small nodes as an alternative of few massive nodes)
- Know EMR Encryption choices
- helps SSE-S3, SS3-KMS, CSE-KMS, and CSE-Customized encryption for EMRFS
- helps LUKS encryption for native disks
- helps TLS for information in transit encryption
- helps EBS encryption
- Hive metastore may be externally hosted utilizing RDS, Aurora, and AWS Glue Information Catalog
- Perceive EMRFS
- OpenSearch
- OpenSearch is a search service that helps indexing, full-text search, faceting, and so on.
- OpenSearch can be utilized for evaluation and helps visualization utilizing OpenSearch Dashboards which may be real-time.
- OpenSearch Service Storage tiers help Scorching, UltraWarm, and Chilly and the information may be transitioned utilizing Index State administration.
- QuickSight
- Know Supported Information Sources
- QuickSight supplies IP addresses that must be whitelisted for QuickSight to entry the information retailer.
- QuickSight supplies direct integration with Microsoft AD
- QuickSight helps row-level safety utilizing dataset guidelines to manage entry to information at row granularity primarily based on permissions related to the person interacting with the information.
- QuickSight helps ML insights as effectively
- QuickSight helps customers outlined through IAM or e-mail signup.
- AWS Lake Formation
- is an built-in information lake service that helps to find, ingest, clear, catalog, remodel, and safe information and make it accessible for evaluation.
- robotically manages entry to the registered information in S3 by way of companies together with AWS Glue, Athena, Redshift, QuickSight, and EMR
- supplies central entry management for the information, together with table-and-column-level entry controls, and encryption for information at relaxation.
- Easy Storage Service – S3 as a storage service
- Information Pipeline for information switch helps automate and schedule common information motion and information processing actions in AWS.
- Step Features assist construct distributed functions, automate processes, orchestrate microservices, and create information and ML pipelines.
- AppFlow is a completely managed integration service to securely trade information between software-as-a-service (SaaS) functions, resembling Salesforce, and AWS companies, resembling Easy Storage Service (S3) and Redshift.
Safety, Identification & Compliance
Administration & Governance Instruments
- Perceive AWS CloudWatch for Logs and Metrics.
- CloudWatch Logs Subscription Filters can be utilized to route information to Kinesis Information Streams, Kinesis Information Firehose, and Lambda.
On the Examination Day
- Ensure you are relaxed and get some good evening’s sleep. The examination shouldn’t be robust in case you are well-prepared.
- In case you are taking the AWS On-line examination
- Attempt to be a part of a minimum of half-hour earlier than the precise time as I’ve had points with each PSI and Pearson with lengthy wait instances.
- The net verification course of does take a while and normally, there are glitches.
- Bear in mind, you wouldn’t be allowed to take the take in case you are late by greater than half-hour.
- Ensure you have your desk clear, no hand-watches, or exterior screens, hold your telephones away, and no one can enter the room.
Lastly, All of the Finest 🙂