DataStage is an ETL tool and it uses a graphical representation to build data integration. Datastage is available in various versions.

We will discuss here about Datastage architecture.

The dataStage architecture is a client-server architecture. We have different types of client-server architectures. DataStage 7.5 version was a standalone version where DataStage engine, service and repository (metadata) was all installed in once server and client was installed in local PC and access the servers using the ds-client. Here the users are created in Unix/windows DataStage server and was included to the dataStage group. To give access to the new user just make new Unix/windows user in the DS-server and add them to dstage group. They will have access to the DataStage server from the client.

Client components of DataStage Architecture

DataStage has four client components –

  • Data Stage Administrator – It is responsible for creating, deleting projects and setting the environment variable.
  • Data Stage Designer – It is used for designing the job.
  • Data Stage Director – It is responsible for run, validate and scheduling the jobs.
  • Data Stage Manager – It is used for export and import the project.

Server components of DataStage Architecture

It has three server components –

  • DS Server – It is responsible to run executable server jobs
  • DS Package Installer – It is used to install packaged DS jobs
  • Repository or project – It is a central store containing all info.



Datastage Architecture