AWS DBS真题 No.1-100

0%
0 投票, 0 平均值
0

Report a question

You cannot submit an empty report. Please add some details.

SOA

AWS Certified SysOps Administrator – Associate

AWS DBS真题 No.1-100

中英双语,人工翻译,带完整解析 AWS DBS真题 No.1-100

1 / 100

分类: DBS

1. 1. A data engineer is about to perform a major upgrade to the DDL contained within an Amazon Redshift cluster to support a new data warehouse application. The upgrade scripts will include user permission updates, view and table structure changes as well as additional loading and data manipulation tasks. The data engineer must be able to restore the database to its existing state in the event of issues. Which action should be taken prior to performing this upgrade task?

一名数据工程师即将对Amazon Redshift集群中的DDL进行重大升级,以支持一个新的数据仓库应用程序。升级脚本将包括用户权限更新、视图和表结构更改,以及额外的加载和数据操作任务。数据工程师必须能够在出现问题时将数据库恢复到现有状态。应该在执行此升级任务之前采取什么措施?

2 / 100

分类: DBS

2. 2. The department of transportation for a major metropolitan area has placed sensors on roads at key locations around the city. The goal is to analyze the flow of traffic and notifications from emergency services to identify potential issues and to help planners correct trouble spots. A data engineer needs a scalable and fault-tolerant solution that allows planners to respond to issues within 30 seconds of their occurrence. Which solution should the data engineer choose?

2. 一个主要大都市区的交通部门在城市周围的关键位置的道路上安装了传感器。目标是分析交通流量和紧急服务的通知,以识别潜在问题,并帮助规划者修正问题点。数据工程师需要一个可扩展且容错的解决方案,使规划者能够在问题发生后的30秒内做出响应。数据工程师应该选择哪个解决方案?

3 / 100

分类: DBS

3. 3. An Amazon Redshift Database is encrypted using KMS. A data engineer needs to use the AWS CLI to create a KMS encrypted snapshot of the database in another AWS region. Which three steps should the data engineer take to accomplish this task? (Choose three.)

3. 一个 Amazon Redshift 数据库使用 KMS 进行加密。数据工程师需要使用 AWS CLI 在另一个 AWS 区域创建数据库的 KMS 加密快照。数据工程师应该采取哪三个步骤来完成此任务?(选择三项。)

4 / 100

分类: DBS

4. 4. You have two different groups using Redshift to analyze data of a petabyte-scale data warehouse. Each query issued by the first group takes approximately 1-2 hours to analyze the data while the second group’s queries only take between 5-10 minutes to analyze data. You don’t want the second group’s queries to wait until the first group’s queries are finished. You need to design a solution so that this does not happen. Which of the following would be the best and cheapest solution to deploy to solve this dilemma?
4. 你有两个不同的团队使用Redshift分析一个PB级数据仓库的数据。第一个团队发出的每个查询大约需要1到2个小时来分析数据,而第二个团队的查询只需要5到10分钟就能分析数据。你不希望第二个团队的查询在第一个团队的查询完成之前等待。你需要设计一个解决方案,确保这种情况不会发生。以下哪种解决方案是最合适且最经济的?

5 / 100

分类: DBS

5. 5. A telecommunications company needs to predict customer churn (i.e., customers who decide to switch to a competitor). The company has historic records of each customer, including monthly consumption patterns, calls to customer service, and whether the customer ultimately quit the service. All of this data is stored in Amazon S3. The company needs to know which customers are likely going to churn soon so that they can win back their loyalty. What is the optimal approach to meet these requirements?

5. 一家电信公司需要预测客户流失(即决定转向竞争对手的客户)。该公司拥有每个客户的历史记录,包括每月消费模式、客服电话记录以及客户是否最终停止使用该服务。所有这些数据都存储在Amazon S3中。公司需要知道哪些客户可能很快会流失,以便能够挽回他们的忠诚度。为了满足这些需求,最佳的做法是什么?

6 / 100

分类: DBS

6. 6. Your social media marketing application has a component written in Ruby running on AWS Elastic Beanstalk. This application component posts messages to social media sites in support of various marketing campaigns. Your management now requires you to record replies to these social media messages to analyze the effectiveness of the marketing campaign in comparison to past and future efforts. You’ve already developed a new application component to interface with the social media site APIs in order to read the replies. Which process should you use to record the social media replies in a durable data store that can be accessed at any time for analytics of historical data?
6. 你们的社交媒体营销应用程序包含一个用 Ruby 编写的组件,该组件在 AWS Elastic Beanstalk 上运行。这个应用程序组件向社交媒体网站发布消息,以支持各种营销活动。现在,管理层要求你记录这些社交媒体消息的回复,以便分析营销活动的效果,并与过去和未来的努力进行对比。你已经开发了一个新的应用组件,用于与社交媒体网站的 API 接口,以读取回复。那么,应该使用哪种流程将社交媒体的回复记录在一个耐用的数据存储中,以便随时进行历史数据的分析?

7 / 100

分类: DBS

7. 7. An organization needs to design and deploy a large-scale data storage solution that will be highly durable and highly flexible with respect to the type and structure of data being stored. The data to be stored will be sent or generated from a variety of sources and must be persistently available for access and processing by multiple applications. What is the most cost-effective technique to meet these requirements?

7. 一个组织需要设计并部署一个大规模的数据存储解决方案,该解决方案在数据存储类型和结构方面具有高度的耐用性和灵活性。要存储的数据将来自各种来源,并且必须始终可用,以便多个应用程序能够访问和处理这些数据。满足这些要求的最具成本效益的技术是什么?

8 / 100

分类: DBS

8. 8. You have been asked to handle a large data migration from multiple Amazon RDS MySQL instances to a DynamoDB table. You have been given a short amount of time to complete the data migration. What will allow you to complete this complex data processing workflow?

8. 您被要求处理从多个 Amazon RDS MySQL 实例到 DynamoDB 表的大规模数据迁移。您被给定了很短的时间来完成数据迁移。什么方法将帮助您完成这个复杂的数据处理工作流?

9 / 100

分类: DBS

9. 9. A retailer exports data daily from its transactional databases into an S3 bucket in the Sydney region. The retailer’s Data Warehousing team wants to import this data into an existing Amazon Redshift cluster in their VPC at Sydney. Corporate security policy mandates that data can only be transported within a VPC. What combination of the following steps will satisfy the security policy? Choose 2 answers

9. 一家零售商每天将数据从其事务数据库导出到位于悉尼地区的S3存储桶中。该零售商的数据仓库团队希望将这些数据导入到他们悉尼VPC中现有的Amazon Redshift集群中。公司安全政策要求数据只能在VPC内部传输。以下哪些步骤的组合能够满足安全政策?请选择两个答案。

10 / 100

分类: DBS

10. 10. You have an application that is currently in the development stage but is expected to write 2,400 items per minute to a DynamoDB table, each 2Kb in size or less and then fluctuate to 4,800 writes of items (of the same size) per minute on weekends. There may be other fluctuations within that range in the future as the application develops. It is important to the success of the application that the vast majority of user requests are met in a cost-effective way. How should this table be created?

10. 你有一个应用程序,目前处于开发阶段,但预计每分钟向DynamoDB表写入2,400个项目,每个项目的大小为2Kb或更小,然后在周末波动到每分钟4,800个项目的写入(大小相同)。随着应用程序的发展,未来可能会在该范围内出现其他波动。为了确保应用程序的成功,大多数用户请求需要以具有成本效益的方式得到满足。这个表应该如何创建?

11 / 100

分类: DBS

11. 11. Your company recently purchased five different companies that run different backend databases that include Redshift, MySQL, Hive on EMR and PostgreSQL. You need a single tool that can run queries on all the different platform for your daily ad-hoc analysis. Which tool enables you to do that?
11. 你们公司最近购买了五家不同的公司,这些公司运行着不同的后台数据库,包括Redshift、MySQL、Hive on EMR和PostgreSQL。你需要一个可以在所有这些不同平台上运行查询的工具,用于每天的临时分析。哪个工具可以帮助你实现这一点?

12 / 100

分类: DBS

12. 12. A company has lot of web applications, databases and data warehouse built on Teradata, NoSQL databases, and other types of data stores. They have lot of data assets in terms of logs, documents; excel files, CSV files, PDF documents and others. Web Application has different user workloads at different parts of the day. They are running one of their web application Node.js supported by MongoDB Database. The schema designed is document based. The team wants to migrate the platform on to AWS. Which NoSQL Managed service provides the document management capability?

12. 一家公司拥有大量基于Teradata、NoSQL数据库和其他类型数据存储构建的Web应用程序、数据库和数据仓库。它们拥有大量的数据资产,包括日志、文档、Excel文件、CSV文件、PDF文档等。Web应用程序在一天的不同时间段有不同的用户工作负载。它们正在运行其中一个基于Node.js的Web应用程序,支持MongoDB数据库。所设计的架构是基于文档的。团队希望将平台迁移到AWS。那么,哪个NoSQL托管服务提供文档管理功能?

13 / 100

分类: DBS

13. 13. An International company has deployed a multi-tier web application that relies on DynamoDB in a single region. For regulatory reasons they need disaster recovery capability in a separate region with a Recovery Time Objective of 2 hours and a Recovery Point Objective of 24 hours. They should synchronize their data on a regular basis and be able to provision the web application rapidly using CloudFormation. The objective is to minimize changes to the existing web application, control the throughput of DynamoDB used for the synchronization of data and synchronize only the modified elements. Which design would you choose to meet these requirements?

13. 一家国际公司在单一区域部署了一个多层次的网页应用程序,该应用程序依赖于DynamoDB。由于法规要求,他们需要在另一个区域具备灾难恢复能力,恢复时间目标为2小时,恢复点目标为24小时。他们应该定期同步数据,并能够使用CloudFormation快速配置网页应用程序。目标是尽量减少对现有网页应用程序的更改,控制用于数据同步的DynamoDB的吞吐量,并仅同步已修改的元素。为了满足这些要求,您会选择哪种设计方案?

14 / 100

分类: DBS

14. 14. You work for a start-up that tracks commercial delivery trucks via GPS. You receive coordinates that are transmitted from each delivery truck once every 6 seconds. You need to process these coordinates in real-time from multiple sources and load them into Elasticsearch without significant technical overhead to maintain. Which tool should you use to digest the data?

14. 你在一家跟踪商业配送卡车的初创公司工作,通过GPS跟踪卡车的位置。你每6秒钟接收到一次从每辆配送卡车传输的坐标数据。你需要实时处理来自多个来源的这些坐标,并将它们加载到Elasticsearch中,同时不需要太大的技术维护负担。你应该使用哪个工具来处理这些数据?

15 / 100

分类: DBS

15. 15. A solutions architect works for a company that has a data lake based on a central Amazon S3 bucket. The data contains sensitive information. The architect must be able to specify exactly which files each user can access. Users access the platform through a SAML federation Single Sign On platform. The architect needs to build a solution that allows fine grained access control, traceability of access to the objects, and usage of the standard tools (AWS Console, AWS CLI) to access the data. Which solution should the architect build?
15. 一名解决方案架构师为一家基于中央 Amazon S3 存储桶的数据湖公司工作。该数据包含敏感信息。架构师必须能够精确指定每个用户可以访问哪些文件。用户通过 SAML 联邦单点登录平台访问该平台。架构师需要构建一个解决方案,允许细粒度的访问控制、访问对象的可追溯性,并使用标准工具(AWS 控制台、AWS CLI)来访问数据。架构师应构建哪种解决方案?

16 / 100

分类: DBS

16. 16. A mobile application collects data that must be stored in multiple Availability Zones within five minutes of being captured in the app. What architecture securely meets these requirements?

16. 一个移动应用收集的数据必须在五分钟内存储在多个可用区中。什么架构能安全地满足这些要求?

17 / 100

分类: DBS

17. 17. You are using QuickSight to identify demand trends over multiple months for your top five product lines. Which type of visualization do you choose?

17. 你正在使用QuickSight来识别你前五大产品线在多个月份的需求趋势。你选择哪种类型的可视化?

18 / 100

分类: DBS

18. 18. A company is storing data on Amazon Simple Storage Service (S3). The company’s security policy mandates that data be encrypted at rest. Which of the following methods can achieve this? Choose 3 answers

18. 一家公司将数据存储在Amazon简单存储服务(S3)上。该公司的安全政策要求数据在静态时进行加密。以下哪种方法可以实现这一点?请选择3个答案。

19 / 100

分类: DBS

19. 19. A company that provides economics data dashboards needs to be able to develop software to display rich, interactive, data-driven graphics that run in web browsers and leverages the full stack of web standards (HTML, SVG, and CSS). Which technology provides the most appropriate support for this requirements?

19. 一家公司提供经济数据仪表盘,需要能够开发软件来显示丰富的、互动的、数据驱动的图形,这些图形运行在网页浏览器中,并利用完整的网页标准栈(HTML、SVG 和 CSS)。哪种技术最能满足这些需求?

20 / 100

分类: DBS

20. 20. An enterprise customer is migrating to Redshift and is considering using dense storage nodes in its Redshift cluster. The customer wants to migrate 50 TB of data. The customer’s query patterns involve performing many joins with thousands of rows. The customer needs to know how many nodes are needed in its target Redshift cluster. The customer has a limited budget and needs to avoid performing tests unless absolutely needed. Which approach should this customer use?

20. 一家企业客户正在迁移到Redshift,并考虑在其Redshift集群中使用密集存储节点。客户希望迁移50 TB的数据。客户的查询模式涉及执行许多带有数千行的连接操作。客户需要知道在目标Redshift集群中需要多少节点。客户的预算有限,除非绝对必要,否则需要避免进行测试。该客户应该使用哪种方法?

21 / 100

分类: DBS

21. 21. ABCD has developed a sensor intended to be placed inside of people’s shoes, monitoring the number of steps taken every day. ABCD is expecting thousands of sensors reporting in every minute and hopes to scale to millions by the end of the year. A requirement for the project is it needs to be able to accept the data, run it through ETL to store in warehouse and archive it on Amazon Glacier, with room for a real-time dashboard for the sensor data to be added at a later date. What is the best method for architecting this application given the requirements? Choose the correct answer:

21. ABCD已经开发了一种传感器,旨在放置在人的鞋子内部,监控每天走的步数。ABCD预计每分钟会有成千上万的传感器报告,并希望到年底能够扩展到数百万个传感器。该项目的要求是,它需要能够接受数据,经过ETL处理后存储到数据仓库,并将其归档到Amazon Glacier中,同时为以后添加传感器数据的实时仪表盘留出空间。根据这些要求,架构此应用程序的最佳方法是什么?请选择正确答案:

22 / 100

分类: DBS

22. 22. You need to visualize data from Spark and Hive running on an EMR cluster. Which of the options is best for an interactive and collaborative notebook for data exploration?

22. 您需要可视化来自运行在EMR集群上的Spark和Hive的数据。以下哪个选项最适合用于数据探索的交互式和协作性笔记本?

23 / 100

分类: DBS

23. 23. Your company needs to design a data warehouse for a client in the retail industry. The data warehouse will store historic purchases in Amazon Redshift. To comply with PCI:DSS requirements and meet data protection standards, the data must be encrypted at rest and have keys managed by a corporate on-premises HSM. How can you meet these requirements in a cost-effective manner?

23. 你们公司需要为零售行业的客户设计一个数据仓库。该数据仓库将把历史购买数据存储在 Amazon Redshift 中。为了遵守 PCI:DSS 要求并满足数据保护标准,数据必须在静态时加密,并且密钥由公司内部的 HSM 管理。你如何以成本效益的方式满足这些要求?

24 / 100

分类: DBS

24. 24. A company wants to use Redshift cluster for petabyte-scale data warehousing. Data for processing would be stored on Amazon S3. As a security requirement, the company wants the data to be encrypted at rest. As a solution architect how would you implement the solution?

24. 一家公司希望使用Redshift集群进行PB级数据仓库。处理的数据将存储在Amazon S3上。作为安全要求,公司希望数据在静态时进行加密。作为解决方案架构师,您将如何实现这一解决方案?

25 / 100

分类: DBS

25. 25. An organization needs a data store to handle the following data types and access patterns: Key-value access pattern Complex SQL queries and transactions Consistent reads Fixed schema Which data store should the organization choose?

25. 一个组织需要一个数据存储来处理以下数据类型和访问模式:

  • 键值访问模式
  • 复杂的SQL查询和事务
  • 一致性读取
  • 固定的架构

该组织应该选择哪种数据存储?

26 / 100

分类: DBS

26. 26. A video-sharing mobile application uploads files greater than 10 GB to an Amazon S3 bucket. However, when using the application in locations far away from the S3 bucket region, uploads take extended periods of time, and sometimes fail to complete. Which combination of methods would improve the performance of uploading to the application? (Select TWO.)

26. 一个视频分享移动应用将大于10 GB的文件上传到Amazon S3存储桶。然而,当在距离S3存储桶区域较远的位置使用该应用时,上传需要较长时间,有时甚至无法完成。以下哪种方法的组合可以提高上传性能?(选择两项。)

27 / 100

分类: DBS

27. 27. A company is collected real time senstive data using Amazon Kinesis. As a security requirement, the Amazon Kinesis stream needs to be encrypted. Which approach should be used to accomplish this task?

27. 一家公司正在使用 Amazon Kinesis 收集实时敏感数据。作为安全要求,Amazon Kinesis 流需要进行加密。应该使用哪种方法来完成这个任务?

28 / 100

分类: DBS

28. 28. A customer has a machine learning workflow that consists of multiple quick cycles of reads-writes-reads on Amazon S3. The customer needs to run the workflow on EMR but is concerned that the reads in subsequent cycles will miss new data critical to the machine learning from the prior cycles. How should the customer accomplish this?

28. 一位客户有一个机器学习工作流,该工作流由多个快速的读写读取循环组成,使用的是 Amazon S3。客户需要在 EMR 上运行该工作流,但担心在后续循环中的读取操作会错过来自前一个循环的对机器学习至关重要的新数据。客户应该如何实现这一目标?

29 / 100

分类: DBS

29. 29. Managers in a company need access to the human resources database that runs on Amazon Redshift, to run reports about their employees. Managers must only see information about their direct reports. Which technique should be used to address this requirement with Amazon Redshift?

29. 公司中的经理需要访问运行在 Amazon Redshift 上的人力资源数据库,以便生成有关员工的报告。

经理只能查看有关直接下属的信息。应使用哪种技术来满足在 Amazon Redshift 中的此要求?

30 / 100

分类: DBS

30. 30. A company with a support organization needs support engineers to be able to search historic cases to provide fast responses on new issues raised. The company has forwarded all support messages into an Amazon Kinesis Stream. This meets a company objective of using only managed services to reduce operational overhead. The company needs an appropriate architecture that allows support engineers to search on historic cases and find similar issues and their associated responses. Which AWS Lambda action is most appropriate?

30. 一家拥有支持组织的公司需要支持工程师能够搜索历史案例,以便对新提出的问题提供快速响应。该公司已将所有支持消息转发到 Amazon Kinesis Stream 中。这符合公司仅使用托管服务来减少操作开销的目标。公司需要一个合适的架构,允许支持工程师搜索历史案例并找到类似的问题及其相关响应。哪种 AWS Lambda 操作最为合适?

31 / 100

分类: DBS

31. 31. An online retailer is using Amazon DynamoDB to store data related to customer transactions. The items in the table contains several string attributes describing the transaction as well as a JSON attribute containing the shopping cart and other details corresponding to the transaction. Average item size is – 250KB, most of which is associated with the JSON attribute. The average customer generates – 3GB of data per month. Customers access the table to display their transaction history and review transaction details as needed. Ninety percent of the queries against the table are executed when building the transaction history view, with the other 10% retrieving transaction details. The table is partitioned on CustomerID and sorted on transaction date. The client has very high read capacity provisioned for the table and experiences very even utilization, but complains about the cost of Amazon DynamoDB compared to other NoSQL solutions. Which strategy will reduce the cost associated with the client’s read queries while not degrading quality?

31. 一家在线零售商正在使用 Amazon DynamoDB 存储与客户交易相关的数据。表中的项目包含多个字符串属性,用于描述交易,以及一个 JSON 属性,包含购物车和其他与交易相关的详细信息。每个项目的平均大小为 250KB,其中大部分与 JSON 属性相关。每个客户每月生成大约 3GB 的数据。客户访问该表以显示他们的交易历史记录,并根据需要查看交易详细信息。针对该表的 90% 查询是在构建交易历史记录视图时执行的,其余的 10% 用于检索交易详细信息。该表按 CustomerID 分区,并按交易日期排序。客户端为该表预配置了非常高的读取容量,并且利用率非常均匀,但抱怨与其他 NoSQL 解决方案相比,Amazon DynamoDB 的成本过高。哪种策略可以在不降低质量的情况下减少与客户端读取查询相关的成本?

32 / 100

分类: DBS

32. 32. Your client needs to load a 600 GB file into a Redshift cluster from S3, using the Redshift COPY command. The file has several known (and potentially some unknown) issues that will probably cause the load process to fail. How should the client most efficiently detect load errors without needing to perform cleanup if the load process fails?
32. 您的客户需要使用 Redshift COPY 命令将一个 600 GB 的文件从 S3 加载到 Redshift 集群中。该文件有几个已知的问题(可能还包括一些未知的问题),这些问题很可能导致加载过程失败。客户应该如何最有效地检测加载错误,而无需在加载过程失败时执行清理操作?

33 / 100

分类: DBS

33. 33. A company that manufactures and sells smart air conditioning units also offers add-on services so that customers can see real-time dashboards in a mobile application or a web browser. Each unit sends its sensor information in JSON format every two seconds for processing and analysis. The company also needs to consume this data to predict possible equipment problems before they occur. A few thousand pre-purchased units will be delivered in the next couple of months. The company expects high market growth in the next year and needs to handle a massive amount of data and scale without interruption. Which ingestion solution should the company use?

33. 一家制造和销售智能空调的公司还提供附加服务,让客户可以在移动应用程序或网页浏览器中查看实时仪表盘。每台空调每两秒钟以JSON格式发送其传感器信息进行处理和分析。公司还需要消耗这些数据,以预测设备可能出现的问题。在接下来的几个月里,几千台预购的空调将被交付。公司预计明年市场增长迅猛,需要处理海量数据并在不中断的情况下进行扩展。公司应该使用哪种数据摄取解决方案?

34 / 100

分类: DBS

34. 34. A web application is using Amazon Kinesis Streams for clickstream data that may not be consumed for up to 12 hours. As a security requirement, how can the data be secured at rest within the Kinesis Streams?

34. 一个 web 应用程序正在使用 Amazon Kinesis Streams 处理点击流数据,这些数据可能会在最多 12 小时内没有被消费。作为安全要求,如何在 Kinesis Streams 中确保数据在静态时的安全性?

35 / 100

分类: DBS

35. 35. You’re launching a test Elasticsearch cluster with the Amazon Elasticsearch Service, and you’d like to restrict access to only your office desktop computer that you occasionally share with an intern to allow her to get more experience interacting with Elasticsearch. What’s the easiest way to do this?

35. 你正在使用Amazon Elasticsearch Service启动一个测试Elasticsearch集群,且希望将访问权限限制为仅限你偶尔与实习生共享的办公室桌面电脑,以便让她获得更多与Elasticsearch交互的经验。最简单的方法是什么?

36 / 100

分类: DBS

36. 36. Your application development team is building a solution with two applications. The security team wants each application’s logs to be captured in two different places because one of the applications produces logs with sensitive data. How can you meet the requirements with the least risk and effort?

36. 你的应用开发团队正在构建一个包含两个应用的解决方案。安全团队希望每个应用的日志都能在两个不同的地方进行记录,因为其中一个应用生成包含敏感数据的日志。你如何在最小的风险和努力下满足这些要求?

37 / 100

分类: DBS

37. 37. There are thousands of text files on Amazon S3. The total size of the files is 1 PB. The files contain retail order information for the past 2 years. A data engineer needs to run multiple interactive queries to manipulate the data. The Data Engineer has AWS access to spin up an Amazon EMR cluster. The data engineer needs to use an application on the cluster to process this data and return the results in interactive time frame. Which application on the cluster should the data engineer use?

37. 亚马逊S3上有数千个文本文件。文件的总大小为1PB。这些文件包含过去两年的零售订单信息。数据工程师需要运行多个交互式查询来处理数据。数据工程师有AWS访问权限,可以启动一个Amazon EMR集群。数据工程师需要在集群上使用一个应用程序来处理这些数据,并在交互式时间范围内返回结果。数据工程师应该使用集群上的哪个应用程序?

38 / 100

分类: DBS

38. 38. A company hosts a web application on AWS which uses RDS instance to store critical data. As a part of a security audit, it was recommended hardening of RDS instance. What actions would help achieve the same? (Select TWO)

38. 一家公司在AWS上托管一个Web应用程序,该应用程序使用RDS实例存储关键数据。作为安全审核的一部分,建议加固RDS实例。哪些操作有助于实现这一目标?(选择两个)

39 / 100

分类: DBS

39. 39. A data engineer chooses Amazon DynamoDB as a data store for a regulated application. This application must be submitted to regulators for review. The data engineer needs to provide a control framework that lists the security controls from the process to follow to add new users down to the physical controls of the data center, including items like security guards and cameras. How should this control mapping be achieved using AWS?

39. 一名数据工程师选择了 Amazon DynamoDB 作为受监管应用程序的数据存储。这款应用程序必须提交给监管机构进行审查。数据工程师需要提供一个控制框架,列出从添加新用户的过程到数据中心的物理控制(如保安和摄像头)等项的安全控制。应该如何使用 AWS 实现这一控制映射?

40 / 100

分类: DBS

40. 40. You need to filter and transform incoming messages coming from a smart sensor you have connected with AWS. Once messages are received, you need to store them as time series data in DynamoDB. Which AWS service can you use?

40. 你需要过滤并转换来自你连接到AWS的智能传感器的输入消息。一旦消息被接收,你需要将它们作为时间序列数据存储在DynamoDB中。你可以使用哪个AWS服务?

41 / 100

分类: DBS

41. 41. An administrator is processing events in near real-time using Kinesis streams and Lambda. Lambda intermittently fails to process batches from one of the shards due to a 15-minute time limit. What is a possible solution for this problem?

41. 一名管理员正在使用 Kinesis 流和 Lambda 处理接近实时的事件。由于 15 分钟的时间限制,Lambda 间歇性地无法处理来自其中一个分片的批次。这个问题的一个可能解决方案是什么?

42 / 100

分类: DBS

42. 42. A company is using Kinesis data streams to store the log data, which is processed by an application every 12 hours. As the data needs to reside in Kinesis data streams for 12 hours, the Security team wants the data to be encrypted at rest. How can it be secured in a most efficient way?
42. 一家公司正在使用 Kinesis 数据流存储日志数据,数据每 12 小时由应用程序处理一次。由于数据需要在 Kinesis 数据流中存储 12 小时,安全团队希望数据在静态时进行加密。如何以最有效的方式保护数据安全?

43 / 100

分类: DBS

43. 43. A company needs a churn prevention model to predict which customers will NOT renew their yearly subscription to the company’s service. The company plans to provide these customers with a promotional offer. A binary classification model that uses Amazon Machine Learning is required. On which basis should this binary classification model be built?

43. 一家公司需要一个流失预防模型,以预测哪些客户不会续订他们每年的订阅服务。公司计划向这些客户提供促销优惠。需要使用亚马逊机器学习的二分类模型。该二分类模型应基于什么基础进行构建?

44 / 100

分类: DBS

44. 44. A company launched EMR cluster to support their big data analytics requirements. They have multiple data sources built out of S3, SQL databases, MongoDB, Redis, RDS, other file systems. They are looking for distributed processing framework and programming model that helps you do machine learning, stream processing, or graph analytics using Amazon EMR clusters Which EMR Hadoop ecosystem fulfils the requirements?

44. 一家公司启动了 EMR 集群以支持他们的大数据分析需求。他们有多个数据源,包括 S3、SQL 数据库、MongoDB、Redis、RDS 以及其他文件系统。他们正在寻找一个分布式处理框架和编程模型,能够帮助他们使用 Amazon EMR 集群进行机器学习、流处理或图形分析。哪个 EMR Hadoop 生态系统能够满足这些需求?

45 / 100

分类: DBS

45. 45. Your company produces customer commissioned one-of-a-kind skiing helmets combining high fashion with custom technical enhancements. Customers can show off their Individuality on the ski slopes and have access to head-up-displays. GPS rear-view cams and any other technical innovation they wish to embed in the helmet. The current manufacturing process is data rich and complex including assessments to ensure that the custom electronics and materials used to assemble the helmets are to the highest standards. Assessments are a mixture of human and automated assessments you need to add a new set of assessment to model the failure modes of the custom electronics using GPUs with CUDA across a cluster of servers with low latency networking. What architecture would allow you to automate the existing process using a hybrid approach and ensure that the architecture can support the evolution of processes over time?

45. 贵公司生产客户定制的独一无二的滑雪头盔,结合了高端时尚与定制技术增强功能。客户可以在滑雪场上展示他们的个性,并且可以使用抬头显示器、GPS后视摄像头以及他们希望嵌入头盔中的任何其他技术创新。当前的制造过程数据丰富且复杂,包括评估,以确保用于组装头盔的定制电子设备和材料符合最高标准。评估是人工和自动评估的结合,您需要添加一套新的评估来模拟定制电子设备的故障模式,使用GPU与CUDA在低延迟网络的服务器集群上进行计算。什么架构可以允许您使用混合方法自动化现有过程,并确保该架构能够支持过程随着时间的推移而发展?

46 / 100

分类: DBS

46. 46. A company operates an international business served from a single AWS region. The company wants to expand into a new country. The regulator for that country requires the Data Architect to maintain a log of financial transactions in the country within 24 hours of the product transaction. The production application is latency insensitive. The new country contains another AWS region. What is the most cost-effective way to meet this requirement?

46. 一家公司运营着一个从单一AWS区域提供服务的国际业务。该公司希望扩展到一个新国家。该国的监管机构要求数据架构师在产品交易后的24小时内,维护该国的金融交易日志。生产应用程序对延迟不敏感。新国家包含另一个AWS区域。满足此要求的最具成本效益的方式是什么?

47 / 100

分类: DBS

47. 47. You have recently joined a startup company building sensors to measure street noise and air quality in urban areas. The company has been running a pilot deployment of around 100 sensors for 3 months. Each sensor uploads 1KB of sensor data every minute to a backend hosted on AWS. During the pilot, you measured a peak or 10 IOPS on the database, and you stored an average of 3GB of sensor data per month in the database. The current deployment consists of a load-balanced auto scaled Ingestion layer using EC2 instances and a PostgreSQL RDS database with 500GB standard storage. The pilot is considered a success and your CEO has managed to get the attention or some potential investors. The business plan requires a deployment of at least 100K sensors, which needs to be supported by the backend. You also need to store sensor data for at least two years to be able to compare year over year Improvements. To secure funding, you have to make sure that the platform meets these requirements and leaves room for further scaling. Which setup will meet the requirements?

47. 你最近加入了一家创业公司,该公司正在建设用于测量城市地区街道噪音和空气质量的传感器。该公司已经运行了一个大约100个传感器的试点部署,已经持续了3个月。每个传感器每分钟向托管在AWS上的后端上传1KB的传感器数据。在试点期间,你在数据库上测量到了最高10次I/O操作每秒,并且每月在数据库中存储了平均3GB的传感器数据。当前的部署由一个负载均衡的自动扩展的摄取层组成,使用EC2实例和一个500GB标准存储的PostgreSQL RDS数据库。试点被认为是成功的,你的CEO成功吸引了一些潜在投资者的关注。商业计划要求至少部署10万个传感器,并且需要通过后端来支持。你还需要存储至少两年的传感器数据,以便能够比较逐年改进。为了确保资金到位,你必须确保平台满足这些要求并为进一步扩展留下空间。哪种设置将满足这些要求?

48 / 100

分类: DBS

48. 48. A company receives data sets coming from external providers on Amazon S3. Data sets from different providers are dependent on one another. Data sets will arrive at different times and in no particular order. A data architect needs to design a solution that enables the company to do the following: Rapidly perform cross data set analysis as soon as the data become available Manage dependencies between data sets that arrive at different times Which architecture strategy offers a scalable and cost-effective solution that meets these Requirements?

48. 一家公司从外部提供商处接收来自Amazon S3的数据集。不同提供商的数据集彼此之间是相互依赖的。数据集将在不同的时间到达,并且没有特定的顺序。一位数据架构师需要设计一个解决方案,使公司能够做到以下几点:

  • 一旦数据可用,快速执行跨数据集分析
  • 管理在不同时间到达的数据集之间的依赖关系

哪种架构策略能够提供一个可扩展且具有成本效益的解决方案,满足这些要求?

49 / 100

分类: DBS

49. 49. A media advertising company handles a large number of real-time messages sourced from over 200 websites in real time. Processing latency must be kept low. Based on calculations, a 60-shard Amazon Kinesis stream is more than sufficient to handle the maximum data throughput, even with traffic spikes. The company also uses an Amazon Kinesis Client Library (KCL) application running on Amazon Elastic Compute Cloud (EC2) managed by an Auto Scaling group. Amazon CloudWatch indicates an average of 25% CPU and a modest level of network traffic across all running servers. The company reports a 150% to 200% increase in latency of processing messages from Amazon Kinesis during peak times. There are NO reports of delay from the sites publishing to Amazon Kinesis. What is the appropriate solution to address the latency?

49. 一家媒体广告公司处理来自200多个网站的海量实时消息。处理延迟必须保持较低。根据计算,60分片的Amazon Kinesis流足以处理最大的数据吞吐量,即使在流量高峰期。该公司还使用运行在Amazon Elastic Compute Cloud (EC2)上的Amazon Kinesis客户端库(KCL)应用程序,由自动扩展组进行管理。Amazon CloudWatch显示,所有运行中的服务器的CPU平均使用率为25%,网络流量处于适度水平。公司报告称,在高峰时段,从Amazon Kinesis处理消息的延迟增加了150%到200%。没有报告来自发布到Amazon Kinesis的网站的延迟问题。应对延迟的适当解决方案是什么?

50 / 100

分类: DBS

50. 50. An administrator needs to design a strategy for the schema in a Redshift cluster. The administrator needs to determine the optimal distribution style for the tables in the Redshift schema. In which two circumstances would choosing EVEN distribution be most appropriate? (Choose two.)

50. 管理员需要为Redshift集群中的架构设计策略。管理员需要确定Redshift架构中表的最佳分布样式。在以下两种情况下,选择EVEN分布最为合适?(选择两项。)

51 / 100

分类: DBS

51. 51. A solutions architect for a logistics organization ships packages from thousands of suppliers to end customers. The architect is building a platform where suppliers can view the status of one or more of their shipments. Each supplier can have multiple roles that will only allow access to specific fields in the resulting information. Which strategy allows the appropriate level of access control and requires the LEAST amount of management work?

51. 一位物流组织的解决方案架构师负责将包裹从成千上万的供应商运送到最终客户。该架构师正在构建一个平台,供应商可以查看他们一个或多个货件的状态。每个供应商可以拥有多个角色,这些角色仅允许访问结果信息中的特定字段。哪种策略能够实现适当的访问控制,并且需要最少的管理工作?

52 / 100

分类: DBS

52. 52. A utility company is building an application that stores data coming from more than 10,000 sensors. Each sensor has a unique ID and will send a datapoint (approximately 1KB) every 10 minutes throughout the day. Each datapoint contains the information coming from the sensor as well as a timestamp. This company would like to query information coming from a particular sensor for the past week very rapidly and want to delete all the data that is older than 4 weeks. Using Amazon DynamoDB for its scalability and rapidity, how do you implement this in the most cost effective way?

52. 一家公用事业公司正在构建一个应用程序,用于存储来自超过10,000个传感器的数据。每个传感器都有一个唯一的ID,并且每10分钟会发送一个数据点(大约1KB)到应用程序。每个数据点包含来自传感器的信息以及时间戳。该公司希望能够快速查询过去一周来自特定传感器的信息,并希望删除所有超过4周的数据。该公司使用Amazon DynamoDB来满足其可扩展性和快速性需求,如何以最具成本效益的方式实现这一目标?

53 / 100

分类: DBS

53. 53. You need to provide customers with rich visualizations that allow you to easily connect multiple disparate data sources in S3, Redshift, and several CSV files. Which tool should you use that requires the least setup?

53. 你需要为客户提供丰富的可视化功能,使你能够轻松地连接S3、Redshift和多个CSV文件中的不同数据源。你应该使用哪个工具,要求设置工作最少?

54 / 100

分类: DBS

54. 54. You need to create a recommendation engine for your e-commerce website that sells over 300 items. The items never change, and the new users need to be presented with the list of all 300 items in order of their interest. Which option do you use to accomplish this?

54. 你需要为你的电子商务网站创建一个推荐引擎,该网站销售超过300个商品。这些商品不会改变,新的用户需要按兴趣顺序呈现所有300个商品的列表。你使用哪个选项来完成这个任务?

55 / 100

分类: DBS

55. 55. A web application emits multiple types of events to Amazon Kinesis Streams for operational reporting. Critical events must be captured immediately before processing can continue, but informational events do not need to delay processing. What is the most appropriate solution to record these different types of events?

55. 一个Web应用程序将多种类型的事件发送到Amazon Kinesis Streams进行操作报告。关键事件必须在处理继续之前立即捕获,但信息性事件不需要延迟处理。记录这些不同类型事件的最合适解决方案是什么?

56 / 100

分类: DBS

56. 56. You have to identify potential fraudulent credit card transactions using Amazon Machine Learning. You have been given historical labeled data that you can use to create your model. You will also need to the ability to tune the model you pick. Which model type should you use?

56. 你需要使用亚马逊机器学习来识别潜在的欺诈信用卡交易。你已经获得了可以用来创建模型的历史标记数据。你还需要能够调整所选模型的能力。你应该使用哪种模型类型?

57 / 100

分类: DBS

57. 57. You’ve been asked by the VP of People to showcase the current breakdown of the headcount for each department within your organization. What chart do you select to do this to make it easy to compare each department?

57. 人力资源副总裁要求你展示你所在组织各部门当前的人数分布。你选择什么样的图表来展示,以便轻松比较各部门之间的差异?

58 / 100

分类: DBS

58. 58. An online gaming company uses DynamoDB to store user activity logs and is experiencing throttled writes on the company’s DynamoDB table. The company is NOT consuming close to the provisioned capacity. The table contains a large number of items and is partitioned on user and sorted by date. The table is 200GB and is currently provisioned at 10K WCU and 20K RCU. Which two additional pieces of information are required to determine the cause of the throttling? (Choose two.)
58. 一家在线游戏公司使用DynamoDB存储用户活动日志,并且在公司的DynamoDB表上遇到写入限制。该公司并没有接近所提供的容量。该表包含大量的项,并且按用户进行分区,并按日期排序。该表大小为200GB,当前配置为10K WCU和20K RCPU。为了确定限速的原因,还需要哪些额外的信息?(选择两个。)

59 / 100

分类: DBS

59. 59. A Redshift data warehouse has different user teams that need to query the same table with very different query types. These user teams are experiencing poor performance. Which action improves performance for the user teams in this situation?

59. 一个Redshift数据仓库有不同的用户团队需要以非常不同的查询类型查询相同的表。这些用户团队正在经历性能差的问题。在这种情况下,哪种操作可以改善用户团队的性能?

60 / 100

分类: DBS

60. 60. A data engineer needs to collect data from multiple Amazon Redshift clusters within a business and consolidate the data into a single central data warehouse. Data must be encrypted at all times while at rest or in flight. What is the most scalable way to build this data collection process?

60. 一名数据工程师需要从业务中的多个 Amazon Redshift 集群收集数据,并将数据整合到一个单一的中央数据仓库中。在数据静态或传输过程中,必须始终加密数据。构建此数据收集过程的最具可扩展性的方法是什么?

61 / 100

分类: DBS

61. 61. Your company releases new features with high frequency while demanding high application availability. As part of the application’s A/B testing, logs from each updated Amazon EC2 instance of the application need to be analyzed in near real-time, to ensure that the application is working flawlessly after each deployment. If the logs show any anomalous behavior, then the application version of the instance is changed to a more stable one. Which of the following methods should you use for shipping and analyzing the logs in a highly available manner?

61. 你的公司以高频率发布新功能,同时要求高应用可用性。作为应用程序A/B测试的一部分,需要近实时分析每个更新后的Amazon EC2实例的日志,以确保每次部署后应用程序能够完美运行。如果日志显示任何异常行为,则该实例的应用程序版本会更改为更稳定的版本。以下哪种方法适合以高可用性方式传输和分析日志?

62 / 100

分类: DBS

62. 62. Your company is in the process of developing a next generation pet collar that collects biometric information to assist families with promoting healthy lifestyles for their pets. Each collar will push 30kb of biometric data In JSON format every 2 seconds to a collection platform that will process and analyze the data providing health trending information back to the pet owners and veterinarians via a web portal Management has tasked you to architect the collection platform ensuring the following requirements are met. Provide the ability for real-time analytics of the inbound biometric data to ensure processing of the biometric data is highly durable, Elastic and parallel. The results of the analytic processing should be persisted for data mining. Which architecture outlined below will meet the initial requirements for the collection platform?

62. 你们公司正在开发一款下一代宠物项圈,收集生物特征信息,以帮助家庭促进宠物健康的生活方式。每个项圈每2秒钟将30KB的生物特征数据以JSON格式推送到一个数据收集平台,该平台将处理和分析数据,并通过一个网络门户向宠物主人和兽医提供健康趋势信息。管理层已委托你设计数据收集平台,确保以下要求得到满足:提供实时分析功能,以确保生物特征数据的处理具有高度的持久性、弹性和并行性。分析处理结果应当持久化,以便数据挖掘。以下哪种架构能够满足收集平台的初步要求?

63 / 100

分类: DBS

63. 63. A social media customer has data from different data sources including RDS running MySQL, Redshift, and Hive on EMR. To support better analysis, the customer needs to be able to analyze data from different data sources and to combine the results. What is the most cost-effective solution to meet these requirements?

63. 一个社交媒体客户拥有来自不同数据源的数据,包括运行MySQL的RDS、Redshift和EMR上的Hive。为了支持更好的分析,客户需要能够分析来自不同数据源的数据并结合结果。满足这些需求的最具成本效益的解决方案是什么?

64 / 100

分类: DBS

64. 64. Management has requested a comparison of total sales performance in the five North American regions in January. They’re hoping to determine how to allocate a budget to regions based on performance in that single period. What sort of visualization do you use in Amazon QuickSight?
64. 管理层要求对1月份五个北美地区的总销售表现进行比较。他们希望根据该单一时期的表现来决定如何分配预算给各个地区。你会在Amazon QuickSight中使用什么样的可视化方式?

65 / 100

分类: DBS

65. 65. A new client is requesting a tool that will provide fast query performance for enterprise reporting and business intelligence workloads, particularly those involving extremely complex SQL with multiple joins and sub-queries. They also want the ability to give analysts access to a central system through tradition SQL clients that allow them to explore and familiarize themselves with the data. What solution do you initially recommend they investigate?

65. 一位新客户要求提供一个工具,该工具能够为企业报告和商业智能工作负载提供快速的查询性能,特别是涉及多个连接和子查询的极其复杂的SQL。他们还希望能够通过传统的SQL客户端为分析师提供访问中央系统的能力,使他们能够探索和熟悉数据。你最初推荐他们调查哪种解决方案?

66 / 100

分类: DBS

66. 66. A company stores data in an S3 bucket. Some of the data contains sensitive information. They need to ensure that the bucket complies with PCI DSS (Payment Card Industry Data Security Standard) compliance standards. Which of the following should be implemented to fulfill this requirement? (Select TWO)

66. 一家公司将数据存储在S3桶中。一些数据包含敏感信息。他们需要确保该桶符合PCI DSS(支付卡行业数据安全标准)合规性要求。为了满足这一要求,以下哪些措施应该实施?(选择两个)

67 / 100

分类: DBS

67. 67. Your company sells consumer devices and needs to record the first activation of all sold devices. Devices are not activated until the information is written on a persistent database. Activation data is very important for your company and must be analyzed daily with a MapReduce job. The execution time of the data analysis process must be less than three hours per day. Devices are usually sold evenly during the year, but when a new device model is out, there is a predictable peak in activation’s, that is, for a few days there are 10 times or even 100 times more activation’s than in average day. Which of the following databases and analysis framework would you implement to better optimize costs and performance for this workload?

67. 贵公司销售消费类设备,并需要记录所有销售设备的首次激活信息。设备在信息写入持久化数据库之前不会被激活。激活数据对贵公司非常重要,必须通过MapReduce作业进行每日分析。数据分析过程的执行时间必须少于每天三小时。设备通常在全年均匀销售,但当新设备型号发布时,激活量会出现可预测的峰值,也就是说,在几天内,激活量是平均日的10倍甚至100倍。以下哪种数据库和分析框架将有助于更好地优化此工作负载的成本和性能?

68 / 100

分类: DBS

68. 68. Your company is storing millions of sensitive transactions across thousands of 100-GB files that must be encrypted in transit and at rest. Analysts concurrently depend on subsets of files, which can consume up to 5TB of space, to generate simulations that can be used to steer business decisions. You are required to design an AWS solution that can cost effectively accommodate the long-term storage and in-flight subsets of data.
68. 你的公司正在存储数百万条敏感交易数据,这些数据分布在数千个100GB的文件中,必须在传输和静态时进行加密。分析人员同时依赖部分文件,这些文件最多可占用5TB的空间,用于生成可以用于引导商业决策的模拟。你需要设计一个AWS解决方案,以经济有效的方式容纳长期存储和传输中的数据子集。

69 / 100

分类: DBS

69. 69. An administrator needs to design the event log storage architecture for events from mobile devices. The event data will be processed by an Amazon EMR cluster daily for aggregated reporting and analytics before being archived. How should the administrator recommend storing the log data?

69. 一名管理员需要为来自移动设备的事件设计事件日志存储架构。事件数据将在每天由Amazon EMR集群处理,用于聚合报告和分析,然后再进行归档。管理员应该如何推荐存储日志数据?

70 / 100

分类: DBS

70. 70. Your company uses DynamoDB to support their mobile application and S3 to host the images and other documents shared between users. DynamoDB has a table with 60 partitions and is being heavily accessed by users. The queries run by users do not fully use the per-partition’s throughput. However there are times when in less than 3 minutes, a heavy load of queries flow in and this happen occasionally. Sometimes there are many background tasks that are running in background. How can DynamoDB be configured to handle the workload?
70. 你们公司使用DynamoDB来支持他们的移动应用程序,并使用S3来托管用户之间共享的图像和其他文档。DynamoDB有一个包含60个分区的表,并且正在被用户大量访问。用户执行的查询并没有完全利用每个分区的吞吐量。然而,有时在不到3分钟的时间内,会有大量查询涌入,这种情况偶尔发生。有时后台还会有许多任务在运行。如何配置DynamoDB以处理这些工作负载?

71 / 100

分类: DBS

71. 71. An online photo album app has a key design feature to support multiple screens (e.g, desktop, mobile phone, and tablet) with high-quality displays. Multiple versions of the image must be saved in different resolutions and layouts. The image-processing Java program takes an average of five seconds per upload, depending on the image size and format. Each image upload captures the following image metadata: user, album, photo label, upload timestamp.The app should support the following requirements: Hundreds of user image uploads per second Maximum image upload size of 10 MB Maximum image metadata size of 1 KB Image displayed in optimized resolution in all supported screens no later than one minute after image upload Which strategy should be used to meet these requirements?

71. 一个在线相册应用程序具有一个关键设计特性,支持多个屏幕(例如,桌面、手机和平板)并提供高质量的显示效果。必须以不同的分辨率和布局保存图像的多个版本。图像处理的Java程序每次上传平均需要五秒钟,具体取决于图像的大小和格式。每次图像上传都会捕获以下图像元数据:用户、相册、照片标签、上传时间戳。

该应用程序应支持以下要求:

  • 每秒数百次用户图像上传
  • 最大图像上传大小为10 MB
  • 最大图像元数据大小为1 KB
  • 在所有支持的屏幕上,图像应在上传后不超过一分钟内以优化的分辨率显示

应该使用哪种策略来满足这些要求?

72 / 100

分类: DBS

72. 72. A company is using Amazon Machine Learning as part of a medical software application. The application will predict the most likely blood type for a patient based on a variety of other clinical tests that are available when blood type knowledge is unavailable. What is the appropriate model choice and target attribute combination for this problem?

72. 一家公司正在将 Amazon Machine Learning 用作医疗软件应用的一部分。该应用将根据在血型信息不可用时可以获取的其他临床测试,预测患者最可能的血型。对于这个问题,适当的模型选择和目标属性组合是什么?

73 / 100

分类: DBS

73. 73. A company is developing a video application that will emit a log stream. Each record in the stream may contain up to 400 KB of data. To improve the video-streaming experience, it is necessary to collect a subset of metrics from the stream to be analyzed for trends over time using complex SQL queries. A Solutions Architect will create a solution that allows the application to scale without customer interaction. Which solution should be implemented to meet these requirements?

73. 一家公司正在开发一款视频应用程序,该程序将发出日志流。日志流中的每条记录可能包含最多400 KB的数据。为了改善视频流体验,有必要从流中收集一部分指标,以便通过复杂的SQL查询分析这些指标的趋势。解决方案架构师将创建一个解决方案,使得该应用程序可以在无需客户互动的情况下进行扩展。为了满足这些要求,应该实施哪种解决方案?

74 / 100

分类: DBS

74. 74. A data engineer in a manufacturing company is designing a data processing platform that receives a large volume of unstructured data. The data engineer must populate a well-structured star schema in Amazon Redshift. What is the most efficient architecture strategy for this purpose?

74. 一家制造公司的数据工程师正在设计一个数据处理平台,该平台接收大量非结构化数据。数据工程师必须在 Amazon Redshift 中填充一个结构良好的星型模式。为了这个目的,最有效的架构策略是什么?

75 / 100

分类: DBS

75. 75. You are deploying an application to track GPS coordinates of delivery trucks in the United States. Coordinates are transmitted from each delivery truck once every three seconds. You need to design an architecture that will enable real-time processing of these coordinates from multiple consumers. Which service should you use to implement data ingestion?

75. 你正在部署一个应用程序,用于跟踪美国送货卡车的GPS坐标。每辆送货卡车每三秒钟会传输一次坐标。你需要设计一个架构,以便实现来自多个消费者的这些坐标的实时处理。你应该使用哪个服务来实现数据摄取?

76 / 100

分类: DBS

76. 76. You have a customer-facing application running on multiple M3 instances in two AZs. These instances are in an auto-scaling group configured to scale up when load increases. After taking a look at your CloudWatch metrics, you realize that during specific times every single day, the auto-scaling group has a lot more instances than it normally does. Despite this, one of your customers is complaining that the application is very slow to respond during those time periods every day. The application is reading and writing to a DynamoDB table which has 400 Write Capacity Units and 400 Read Capacity Units. The primary key is the company ID, and the table is storing roughly 20 TB of data. Which solution would solve the issue in a scalable and cost-effective manner?

76. 您有一个面向客户的应用程序,在两个可用区的多个M3实例上运行。这些实例属于一个自动扩展组,在负载增加时配置为扩展。当您查看CloudWatch指标时,您意识到每天的特定时间,自动扩展组的实例数量远远多于通常的数量。尽管如此,其中一位客户抱怨说,在这些时间段内,应用程序的响应非常慢。该应用程序正在读取和写入一个DynamoDB表,该表具有400个写入容量单元和400个读取容量单元。主键是公司ID,该表存储了大约20 TB的数据。哪种解决方案可以以可扩展且具有成本效益的方式解决此问题?

77 / 100

分类: DBS

77. 77. Your enterprise application requires key-value storage as the database. The data is expected to be about 10 GB the first month and grow to 2 PB over the next two years. There are no other query requirements at this time. What solution would you recommend?

77. 你的企业应用需要使用键值存储作为数据库。预计数据在第一个月约为10 GB,并将在接下来的两年内增长到2 PB。目前没有其他查询要求。你会推荐什么解决方案?

78 / 100

分类: DBS

78. 78. A system needs to collect on-premises application spool files into a persistent storage layer in AWS. Each spool file is 2 KB. The application generates 1 M files per hour. Each source file is automatically deleted from the local server after an hour. What is the most cost-efficient option to meet these requirements?

78. 一个系统需要将本地应用程序的打印队列文件收集到AWS中的持久存储层。每个打印队列文件为2 KB。该应用程序每小时生成1百万个文件。每个源文件在一小时后会从本地服务器自动删除。最具成本效益的选项是什么?

79 / 100

分类: DBS

79. 79. A data engineer needs to architect a data warehouse for an online retail company to store historic purchases. The data engineer needs to use Amazon Redshift. To comply with PCI:DSS and meet corporate data protection standards, the data engineer must ensure that data is encrypted at rest and that the keys are managed by a corporate on-premises HSM. Which approach meets these requirements in the most cost-effective manner?

79. 一名数据工程师需要为一家在线零售公司设计一个数据仓库,用于存储历史购买数据。该数据工程师需要使用Amazon Redshift。为了遵守PCI:DSS并满足企业数据保护标准,数据工程师必须确保数据在静态时被加密,并且密钥由企业本地HSM进行管理。哪种方法能够以最具成本效益的方式满足这些要求?

80 / 100

分类: DBS

80. 80. Your application generates a 1 KB JSON payload that needs to be queued and delivered to EC2 instances for applications. At the end of the day, the application needs to replay the data for the past 24 hours. In the near future, you also need the ability for other multiple EC2 applications to consume the same stream concurrently. What is the best solution for this?

80. 您的应用程序生成一个 1 KB 的 JSON 负载,需要将其排队并传送到 EC2 实例以供应用程序使用。每天结束时,应用程序需要重放过去 24 小时的数据。在不久的将来,您还需要其他多个 EC2 应用程序能够同时消费相同的数据流。对此,最佳解决方案是什么?

81 / 100

分类: DBS

81. 81. An organization is designing an application architecture. The application will have over 100 TB of data and will support transactions that arrive at rates from hundreds per second to tens of thousands per second, depending on the day of the week and time of the day. All transaction data must be durably and reliably stored. Certain read operations must be performed with strong consistency. Which solution meets these requirements?

81. 一个组织正在设计一个应用架构。该应用将拥有超过100 TB的数据,并且支持每秒数百到数万个的交易请求,具体取决于星期几和一天中的时间。所有交易数据必须可靠且持久地存储。某些读取操作必须在强一致性的要求下执行。哪种解决方案能够满足这些需求?

82 / 100

分类: DBS

82. 82. A data engineer wants to use an Amazon Elastic Map Reduce for an application. The data engineer needs to make sure it complies with regulatory requirements. The auditor must be able to confirm at any point which servers are running and which network access controls are deployed. Which action should the data engineer take to meet this requirement?

82. 一名数据工程师希望为一个应用程序使用 Amazon Elastic Map Reduce。该数据工程师需要确保它符合监管要求。审计员必须能够随时确认哪些服务器正在运行,以及哪些网络访问控制已部署。数据工程师应该采取什么措施来满足这一要求?

83 / 100

分类: DBS

83. 83. You are using IOT sensors to monitor the movement of a group of hikers on a three day trek and send the information into an Kinesis Stream. They each have a sensor in their shoe and you know for certain that there is no problem with mobile coverage so all the data is getting back to the stream. You have used default settings for the stream. At the end of the third day the data is sent to an S3 bucket. When you go to interpret the data in S3 there is only data for the last day and nothing for the first 2 days. Which of the following is the most probable cause of this?

83. 你正在使用物联网传感器监控一组徒步旅行者在三天旅行中的运动情况,并将信息发送到Kinesis流中。每个人的鞋子里都有一个传感器,你可以确定手机信号覆盖没有问题,因此所有数据都能顺利返回流中。你使用了流的默认设置。第三天结束时,数据被发送到一个S3存储桶。当你去S3中解析数据时,只能看到最后一天的数据,前两天的数据却没有。以下哪项是最可能导致这种情况的原因?

84 / 100

分类: DBS

84. 84. A research scientist is planning for the one-time launch of an Elastic MapReduce cluster and is encouraged by her manager to minimize the costs. The cluster is designed to ingest 200TB of genomics data with a total of 100 Amazon EC2 instances and is expected to run for around four hours. The resulting data set must be stored temporarily until archived into an Amazon RDS Oracle instance. Which option will help save the most money while meeting requirements?

84. 一名研究科学家计划一次性启动一个Elastic MapReduce集群,并受到她经理的鼓励,尽量减少成本。该集群旨在处理200TB的基因组数据,共有100个Amazon EC2实例,预计运行约四小时。生成的数据集必须暂时存储,直到归档到Amazon RDS Oracle实例中。哪种选项能够在满足要求的同时节省最多的费用?

85 / 100

分类: DBS

85. 85. A company needs to deploy a data lake solution for their data scientists in which all company data is accessible and stored in a central S3 bucket. The company segregates the data by business unit, using specific prefixes. Scientists can only access the data from their own business unit. The company needs a single sign-on identity and management solution based on Microsoft Active Directory (AD) to manage access to the data in Amazon S3. Which method meets these requirements?

85. 一家公司需要为其数据科学家部署一个数据湖解决方案,其中所有公司数据都可以访问并存储在一个中央S3桶中。公司按照业务部门划分数据,使用特定的前缀。科学家只能访问自己业务部门的数据。公司需要一个基于Microsoft Active Directory (AD)的单一登录身份和管理解决方案来管理对Amazon S3中数据的访问。哪种方法符合这些要求?

86 / 100

分类: DBS

86. 86. A company has several teams of analysts. Each team of analysts has their own cluster. The teams need to run SQL queries using Hive, Spark-SQL, and Presto with Amazon EMR. The company needs to enable a centralized metadata layer to expose the Amazon S3 objects as tables to the analysts. Which approach meets the requirement for a centralized metadata layer?

86. 一家公司有多个分析团队。每个分析团队都有自己的集群。团队需要使用Hive、Spark-SQL和Presto在Amazon EMR上运行SQL查询。公司需要启用一个集中式元数据层,将Amazon S3对象作为表暴露给分析人员。哪种方法可以满足集中式元数据层的要求?

87 / 100

分类: DBS

87. 87. A Company has two batch processing applications that consume financial data about the day’s stock transactions. Each transaction needs to be stored durably and guarantee that a record of each application is delivered so the audit and billing batch processing applications can process the data. However, the two applications run separately and several hours apart and need access to the same transaction information. After reviewing the transaction information for the day, the information no longer needs to be stored. What is the best way to architect this application? Choose the correct answer from the options below

87. 一家公司有两个批处理应用程序,它们消耗关于当天股票交易的财务数据。每个交易需要持久化存储,并保证每个应用程序的记录都能够被传递,以便审计和计费批处理应用程序能够处理数据。然而,这两个应用程序是分开运行的,且相隔几个小时,需要访问相同的交易信息。在审查完当天的交易信息后,该信息不再需要存储。最佳的架构方式是什么?请选择以下选项中的正确答案。

88 / 100

分类: DBS

88. 88. Your website is serving on-demand training videos to your workforce. Videos are uploaded monthly in high resolution MP4 format. Your workforce is distributed globally often on the move and using company-provided tablets that require the HTTP Live Streaming (HLS) protocol to watch a video. Your company has no video transcoding expertise and it required you might need to pay for a consultant. How do you implement the most cost-efficient architecture without compromising high availability and quality of video delivery?

88. 您的网站正在为员工提供按需培训视频。视频每月上传一次,采用高分辨率的MP4格式。您的员工分布在全球各地,常常在外出差,并使用公司提供的平板电脑观看视频,这些平板电脑需要HTTP实时流(HLS)协议才能观看视频。贵公司没有视频转码方面的专业知识,因此可能需要支付顾问费用。您如何在不妥协视频交付的高可用性和质量的前提下,实施最具成本效益的架构?

89 / 100

分类: DBS

89. 89. You need to create an Amazon Machine Learning model to predict how many inches of rain will fall in an area based on the historical rainfall data. What type of modeling will you use?

89. 您需要创建一个亚马逊机器学习模型,以预测根据历史降雨数据,一个地区将降下多少英寸的雨。您将使用哪种类型的建模?

90 / 100

分类: DBS

90. 90. A company has launched EMR cluster to support their big data analytics requirements. AFS has multiple data sources built out of S3, SQL databases, MongoDB, Redis, RDS, other file systems. They are looking for a web application to create and share documents that contain live code, equations, visualizations, and narrative text. Which EMR Hadoop ecosystem fulfils the requirements?

90. 一家公司已经启动了EMR集群,以支持他们的大数据分析需求。AFS构建了多个数据源,包括S3、SQL数据库、MongoDB、Redis、RDS和其他文件系统。他们正在寻找一个Web应用程序,用于创建和共享包含实时代码、方程式、可视化和叙述文本的文档。哪个EMR Hadoop生态系统能满足这些需求?

91 / 100

分类: DBS

91. 91. A company is building a new application in AWS. The architect needs to design a system to collect application log events. The design should be a repeatable pattern that minimizes data loss if an application instance fails, and keeps a durable copy of a log data for at least 30 days. What is the simplest architecture that will allow the architect to analyze the logs?

91. 一家公司正在AWS中构建一个新的应用程序。架构师需要设计一个系统来收集应用程序日志事件。该设计应是一个可重复的模式,能够最小化应用程序实例故障时的数据丢失,并保持至少30天的日志数据持久副本。什么是最简单的架构,能够让架构师分析日志?

92 / 100

分类: DBS

92. 92. A large oil and gas company needs to provide near real-time alerts when peak thresholds are exceeded in its pipeline system. The company has developed a system to capture pipeline metrics such as flow rate, pressure, and temperature using millions of sensors. The sensors deliver to AWS IoT. What is a cost-effective way to provide near real-time alerts on the pipeline metrics?

92. 一家大型石油和天然气公司需要在其管道系统的峰值阈值被超越时提供近实时的警报。该公司已开发出一个系统,通过数百万个传感器捕捉管道的指标,如流量、压力和温度。这些传感器将数据传输到AWS IoT。提供管道指标近实时警报的经济有效的方式是什么?

93 / 100

分类: DBS

93. 93. You need real-time reporting on logs generated from your applications. In addition, you need anomaly detection. The processing latency needs to be one second or less. Which option would you choose if your team has no experience with Machine learning libraries and doesn’t want to have to maintain any software installations yourself?

93. 你需要对应用程序生成的日志进行实时报告。此外,你还需要异常检测。处理延迟需要在一秒钟或更短时间内。如果你的团队没有机器学习库的经验,并且不希望自己维护任何软件安装,你会选择哪个选项?

94 / 100

分类: DBS

94. 94. A city has been collecting data on its public bicycle share program for the past three years. The 5PB dataset currently resides on Amazon S3. The data contains the following datapoints: Bicycle origination points Bicycle destination points Mileage between the points Number of bicycle slots available at the station (which is variable based on the station location) Number of slots available and taken at a given time The program has received additional funds to increase the number of bicycle stations available. All data is regularly archived to Amazon Glacier. The new bicycle stations must be located to provide the most riders access to bicycles. How should this task be performed?

94. 一座城市在过去三年里一直在收集其公共自行车共享项目的数据。目前,5PB的数据集存储在Amazon S3上。数据包含以下数据点:

  • 自行车起始点
  • 自行车目的地点
  • 两点之间的里程
  • 车站可用的自行车车位数量(根据车站位置有所不同)
  • 特定时间内可用和已占用的车位数量

该项目已获得额外资金,以增加可用的自行车站数量。所有数据定期存档到Amazon Glacier。新的自行车站必须选址,以便为最多的骑行者提供自行车。该任务应如何执行?

95 / 100

分类: DBS

95. 95. A large grocery distributor receives daily depletion reports from the field in the form of gzip archives of CSV files uploaded to Amazon S3. The files range from 500MB to 5GB. These files are processed daily by an EMR job. Recently it has been observed that the file sizes vary, and the EMR jobs take too long. The distributor needs to tune and optimize the data processing workflow with this limited information to improve the performance of the EMR job. Which recommendation should an administrator provide?

95. 一个大型杂货分销商每天从现场接收以gzip压缩格式上传到Amazon S3的CSV文件的消耗报告。这些文件的大小范围从500MB到5GB。这些文件每天由EMR作业处理。最近观察到,文件大小存在变化,且EMR作业执行时间过长。分销商需要在有限的信息下调整和优化数据处理工作流,以提高EMR作业的性能。管理员应该提供什么建议?

96 / 100

分类: DBS

96. 96. An administrator is deploying Spark on Amazon EMR for two distinct use cases: machine learning algorithms and ad-hoc querying. All data will be stored in Amazon S3. Two separate clusters for each use case will be deployed. The data volumes on Amazon S3 are less than 10 GB. How should the administrator align instance types with the cluster’s purpose?

96. 一名管理员正在为两个不同的使用场景在Amazon EMR上部署Spark:机器学习算法和临时查询。所有数据将存储在Amazon S3中。每个使用场景将部署两个独立的集群。Amazon S3上的数据量小于10 GB。管理员应该如何根据集群的目的来选择实例类型?

97 / 100

分类: DBS

97. 97. A customer’s nightly EMR job processes a single 2-TB data file stored on Amazon Simple Storage Service (S3). The Amazon Elastic Map Reduce (EMR) job runs on two On-Demand core nodes and three On-Demand task nodes. Which of the following may help reduce the EMbR job completion time? Choose 2 answers
97. 客户的夜间EMR作业处理存储在Amazon Simple Storage Service (S3)上的单个2-TB数据文件。该Amazon Elastic Map Reduce (EMR)作业在两个按需核心节点和三个按需任务节点上运行。以下哪项可能有助于减少EMR作业的完成时间?选择两个答案。

98 / 100

分类: DBS

98. 98. A travel website needs to present a graphical quantitative summary of its daily bookings to website visitors for marketing purposes. The website has millions of visitors per day, but wants to control costs by implementing the least-expensive solution for this visualization. What is the most cost-effective solution?
98. 一个旅游网站需要向网站访客展示其每日预订的图形化定量摘要,用于营销目的。该网站每天有数百万的访客,但希望通过实施最具成本效益的解决方案来控制费用。最具成本效益的解决方案是什么?

99 / 100

分类: DBS

99. 99. You need to perform ad-hoc business analytics queries on well-structured data. Data comes in constantly at a high velocity. Your business intelligence team can understand SQL. What AWS service(s) should you look to first?

99. 您需要对结构良好的数据执行临时的业务分析查询。数据以高速不断流入。您的商业智能团队可以理解SQL。您应该首先考虑使用哪些AWS服务?

100 / 100

分类: DBS

100. 100. A game company needs to properly scale its game application, which is backed by DynamoDB. Amazon Redshift has the past two years of historical data. Game traffic varies throughout the year based on various factors such as season, movie release, and holiday season. An administrator needs to calculate how much read and write throughput should be provisioned for DynamoDB table for each week in advance. How should the administrator accomplish this task?
100. 一家游戏公司需要正确地扩展其由DynamoDB支持的游戏应用程序。Amazon Redshift包含过去两年的历史数据。游戏流量根据季节、电影上映和假期等各种因素在全年内波动。管理员需要计算每周需要为DynamoDB表预配置多少读取和写入吞吐量。管理员应该如何完成这一任务?

您的分数是

平均分为 0%

0%

评价表

感谢评价

本文地址:https://www.neiwangchuantou.com/2025/03/aws-dbs%e7%9c%9f%e9%a2%98-no-1-100/,禁止转载
0

评论0

AWS SAP-C02真题 No.401-600
AWS SAP-C02真题 No.401-600
9分钟前 有人购买 去瞅瞅看
显示验证码
没有账号?注册  忘记密码?