About the group:

Cognizant’s Cloud, Infrastructure, and Security Services Practice (CIS), is all about accepting digital transformation by driving core modernization holistically across layers. We help customers transform infrastructure and workplace to meet the constantly evolving needs of the digital era. Our broad approach delivers key results for our customers by achieving cloud driven modernization and workplace and operational transformation to own the business in a secure environment.

EEO Statement & Accommodations

Cognizant is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law. If you have a disability that requires a reasonable accommodation to search for a job opening or submit an application, please email [email protected] with your request and contact information.

Role: Kafka Site Reliability Engineer

Location: Austin, Texas

Roles & Responsibilities :

  • Carry out SRE duties for Kafka Streaming Platform. Have detailed understanding on the Kafka architecture along with the concepts of Producer, Consumer, topics, partitions etc. Keep an eye on the platforms and enforce to runbooks/SOPs to run platform and application problems. Familiarize yourself with the cluster maintenance processes and implement changes as per the detailed installation and validation plans.
  • Showcase robust solving and debugging skills, striving to pinpoint and resolve the issue, while also offering advice on how to prevent such problems in the future. Conduct detailed root cause analysis of major production incidents, document for future reference, and put in place proactive measures to improve system reliability. Automate routine tasks using scripts or automation tools to lessen manual work, decrease the chance of human errors, and boost system reliability.
  • At least 2-3 years of experience for a junior level role and 5+ for mid-level/senior level working as a Site reliability engineer for Kafka Platform. Deep level Knowledge on core Kafka components like producers, consumers, topics, partitions etc. Solving both Kafka platform service, application problems and identifying the root cause. Writing Ansible playbooks and automate manual tasks using Ansible, shell scripting and python. Should be familiar with Unix/Linux system internals, networking, and distributed systems

Benefits: Cognizant offers the following benefits for this position, subject to applicable eligibility requirements:

Medical/Dental/Vision/Life Insurance. Paid holidays plus Paid Time Off. 401(k) plan and contributions. Long-term/Short-term Disability. Paid Parental Leave. Employee Stock Purchase Plan. Eligible for Cognizant’s discretionary annual incentive program, based on performance and subject to the terms of Cognizant’s applicable plans.

Disclaimer! The hourly rate, other compensation, and benefits information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time.