Tuesday, January 23, 2018

YARN Application Execution

YARN application execution consists of the following steps:
  • Application submission.
  • Bootstrapping the Application Master instance for the application.
  • Application execution managed by the Application Master instance.


Let’s walk through an application execution sequence (steps are illustrated in the diagram):
  1. A client program submits the application, including the necessary specifications to launch the application-specific Application Master.
  2. The Resource Manager assumes responsibility for negotiating a specified Container in which to start the Application Master, and then launches the Application Master.
  3. On boot-up, the Application Master registers with the Resource Manager. The registration allows the client program to query the Resource Manager for details, which allows it to directly communicate with its own Application Master.
  4. During normal operation, the Application Master negotiates appropriate resource Containers via the resource-request protocol.
  5. Upon successful Container allocations, the Application Master launches the Container by providing the Container launch specification to the Node Manager. The launch specification typically includes the necessary information to allow the Container to communicate with the Application Master.
  6. The application code executing within the Container then provides necessary information (progress, status etc.) to its Application Master via an application-specific protocol.
  7. During the application execution, the client that submitted the program communicates directly with the Application Master to get status, progress updates, etc. via an application-specific protocol.
  8. Once the application is complete, the Application Master de-registers with the Resource Manager and shuts down, allowing its own Container to be repurposed.

1 comment:

  1. It is nice blog Thank you porovide importent information and i am searching for same information to save my timeHadoop Administration Online Course
    v

    ReplyDelete

Kafka Architecture

Apache Kafka is a distributed publish-subscribe messaging system and a robust queue that can handle a high volume of data and enables you t...