ansible Remote Jobs

248 Results

+30d

TalanParis, France, Remote

DevOPS ● terraform ● sql ● ansible ● mongodb ● azure ● docker ● postgresql ● kubernetes ● linux ● python ● AWS

Talan is hiring a Remote Consultant Cloud Azure - H/F

Description du poste

Pourquoi nous avons besoin de vous :

En tant que Consultant Cloud Azure F/H, vous travaillerez en mission chez nos clients ou en mode projet depuis nos locaux ou ceux du client, vous apporterez votre expertise sur les technologies Cloud Azure et évoluerez au fil des phases de nos projets IT.

Votre rôle :

Vous accompagnez nos clients à définir leur stratégie cloud en :  

Réalisant la modernisation des Infrastructures et/ou Applications
Définissant les règles, préconisations et mettant en place les outils pour optimiser leur infrastructure.
Participant à la définition d’architectures de référence.
Identifiant les optimisations infrastructure et/ou applicative pour tirer profit du cloud

Vous prenez en charge des projets d’infrastructure de bout en bout :  

Vous validez la faisabilité de la mise en œuvre ou de la migration
Vous participez à l’élaboration de l’architecture technique et des solutions d’automatisation
Vous participez à la rédaction de documentation d’architecture technique détaillée.

Qualifications

Vous vous reconnaissez :

Vous justifiez d’une expérience significative sur des projets sur l’un des cloud provider publique liées à l’infrastructure Datacenter dont au moins 4 années d’expériences.

Compétences techniques :

Cloud : concepts Cloud (SaaS, PaaS, and IaaS solutions)
Gestion des identités : AD, Entra ID
Sécurité : Identités, MFA, Infrastructures
Infra As Code : Template ARM, Terraform, Bicep
InfraOps / SecOps, serait un plus DevOps
Scripting : PowerShell, Shell/Bash, Python, Azure CLI
Solutions de conteneurisation : Docker, Kubernetes
Orchestration : Ansible, Terraform, Docker, Puppet
Operating System : Windows, Linux
Réseau : Firewall, Loadbalancer, ExpressRoute, Hub & Spoke
Base de données : SQL Server, SQL as Service, MongoDB, RedShift, PostGreSQL, …
CI/CD : GitLab, Azure DevOps, GitHub

Une connaissance des technologies suivantes serait un plus :

Gestion de la configuration : DSC, Ansible
FinOps
Autre Cloud Providers (AWS, GCP, etc….)

Les certifications AZ104 et AZ400 seront un réel plus à votre candidature.

Doté d’un bon relationnel, faites preuve d’ouverture d’esprit et disposées de qualités de communication. On vous reconnaît une bonne capacité d’analyse et de synthèse, vous êtes autonome dans la réalisation de vos missions et vous faites preuve d’adaptabilité, de curiosité. Un niveau d’anglais courant est requis.

See more jobs at Talan

Apply for this job

+30d

Senior Site Reliability Engineer

CatalystRemote (US & Canada)

kotlin ● terraform ● airflow ● Design ● ansible ● ruby ● java ● docker ● elasticsearch ● postgresql ● kubernetes ● linux ● python ● AWS ● backend ● Node.js

Catalyst is hiring a Remote Senior Site Reliability Engineer

Company Overview

Totango + Catalyst have joined forces to build a leading customer growth platform that helps businesses protect and grow their revenue. Built by an experienced team of industry leaders, our software integrates with all the tools CS teams already use to provide one centralized view of customer data. Our modern and intuitive dashboards help CS leaders develop impactful workflows and take the right actions to understand health, prevent churn, increase adoption, and drive expansion.

Position Overview

As a Senior Site Reliability Engineer at Totango + Catalyst, you will help shape our infrastructure and build the foundation our team relies on for the rapid delivery of our product. We’ll depend on you to instill best practices for building scalable distributed systems, emphasizing development experience, observability and fault tolerance. Our current stack consists of technologies such as Ruby on Rails, RDS, Elasticsearch, Java, and Kubernetes, and we are moving towards microservices and serverless. If you thrive in a growth-stage startup environment and are looking for more ownership and the ability to have a significant impact, we would love to meet you.

This role is opened to candidates working remotely anywhere in Canada and the U.S.

What You’ll Do

Manage our AWS infrastructure, with an emphasis on configuration as code.
Keep our site and our services up and running, or get it back up and running quickly when a failure occurs
Improve monitoring and work with developers to improve performance and reliability
Participate in technical design reviews and architecture planning
Debugging complex problems across an entire stack and creating solid solutions
Collaborate with product managers and developers to evolve our delivery pipeline
Working closely with internal partners and teams to ensure that we ship software that meets security, SLA, performance, and budget requirements
Help build our on-call policies and runbooks
Take ownership of projects and demonstrate a high level of accountability
Manage our data infrastructure and pipeline
Focus on quality, cost-effective scalability, and distributed system reliability and establish automated mechanisms

Who You Are:

You are passionate about learning. Obstacles and challenges don’t deter you, you find these as opportunities to learn and grow.
You have a positive demeanor and a go-getter attitude!
You are a strong team player. You collaborate well with others, and want to work together to solve common goals.
You are proactive in seeking opportunities to learn and identifying opportunities to improve our processess.

What You’ll Need

5+ years of experience building and maintaining cloud infrastructure for distributed production systems
1+ year of experience as a backend engineer developing enterprise web applications
Excellent communication skills, both verbal and written
Know your way around a Unix/Linux shell, can write shell scripts, and understands Linux internals
Experience debugging complex problems
Experience designing, building, and operating large-scale production systems
Proficiency in Bash, Python, or other scripting languages
Experience in databases and data warehouses
Experience with security requirements for SOC2/ISO
FinOps experience
Strong Project Management skills
A strong desire to show ownership of problems you identify
Optional CKAD, CKS, CKA Exam, AWS Certified Exams

Technologies You’ll Need

Demonstrated experience with configuration and orchestration tools such as Terraform, CloudFormation and Ansible
Experience with containers, such as Docker
Experience with administering, securing, and optimizing Kubernetes clusters
Experience building monitoring, observability, logging, and developer tooling
Experience with Helm, Kustomize, ArgoCD, Grafana, Prometheus, Thanos, VictoriaMetrics, Cilium, Linkerd, Envoy, AWS App Mesh, CoreDNS
Experience creating CI/CD Pipelines for different coding languages
Experience with one or more: Ruby on Rails, Python, Java, Kotlin, Go, Node.js
Experience with version control systems like GitHub
Familiarity with AWS services, AWS best practices and securing AWS accounts
Experience operating and tuning data stores such as PostgreSQL and Elasticsearch
Experience with managing the infrastructure that backs data pipelines and data lakes such as Airflow
Experience managing streaming infrastructure such as Kafka or Kinesis

Why You’ll Love Working Here!

Work from anywhere!
Highly competitive compensation package, including equity
Comprehensive benefits, including up to 100% paid medical, dental, & vision insurance coverage for you & your loved ones
Open vacation policy, encouraging you to take the time you need
Monthly Mental Health Days and Mental Health Weeks twice per year
Ability to influence and drive key technical and architectural decisions
High visibility and impact across the whole company

Your base pay is one part of your total compensation package and is determined within a range. The base salary for this role is from $140,000.00 - $175,000.00 per year. We take into account numerous factors in deciding on compensation, such as experience, job-related skills, relevant education or training, and other business and organizational requirements. The salary range provided corresponds to the level at which this position has been defined.

Totango + Catalyst is an equal opportunity employer, meaning that we do not discriminate based on race, religion, national origin, gender identity, age, sexual orientation, or any other protected class. Diversity is more than just good intentions; we are committed to creating an inclusive environment for all employees

See more jobs at Catalyst

Apply for this job

+30d

Senior Application Integration Engineer

Torc RoboticsBlacksburg, VA; Remote, US

Bachelor's degree ● terraform ● sql ● Design ● ansible ● api ● git ● c++ ● jenkins ● python ● javascript ● PHP

Torc Robotics is hiring a Remote Senior Application Integration Engineer

About the Company

At Torc, we have always believed that autonomous vehicle technology will transform how we travel, move freight, and do business.

A leader in autonomous driving since 2007, Torc has spent over a decade commercializing our solutions with experienced partners. Now a part of the Daimler family, we are focused solely on developing software for automated trucks to transform how the world moves freight.

Join us and catapult your career with the company that helped pioneer autonomous technology, and the first AV software company with the vision to partner directly with a truck manufacturer.

As a member of Torc Robotics' Information Technology team the Application Integration Engineer designs, implements, and supports Torc’s enterprise-wide application ecosystem to deliver optimal performance, security, and value. This role applies a deep understanding of Application Programming Interfaces (APIs) and supports complex data flows between systems while prioritizing data integrity and security. The Integration Engineer defines and supports Torc IT’s automation goals using industry best-practices and methodologies for git, CI/CD, and orchestration. Partners with departments, teams, and users to provide solutions. Serves as the technical steward of the application ecosystem they support.

What you'll be doing:

Maintains knowledge of Torc’s core application offerings and their related API and system integrations; design, develop, test, implement, maintain, and support secure data flow between enterprise information systems and their related applications. Systems include HCM and ERP, CRM, and IAM
Assists in the evaluation, selection, and implementation of new software and systems that integrate with enterprise applications; reviews existing integrations and recommend optimizations that maximize data integrity, process efficiency, and user experience
Provides timely application and integration support as necessary, including responding to tickets, errors, alerts, and incidents; works with vendors and third parties as a liaison to implement, maintain, and support applications for our internal customers
Create and update policies and procedures to define security standards and minimum requirements of API integrations; develops and maintains automation scripts to streamline administrative tasks, reduce toil, and improve system efficiency
Support, secure, and standardize Torc IT’s automation tooling environment, including Ansible and GitHub; understand SDLC best practices and how it applies to applications, systems, and infrastructure
Collaborate with the Development Experience team to ensure IT practices align with company standards and requirements
Develop policy, procedure, standards, and guidelines for IT’s automation platforms; manages and reviews code repository structure, approvals, workflows, and audits
Provide training, direction, and support to the IT organization around how to consume and optimize automation scripting, version control, code review, testing, and scheduling
Lead and participate in application and automation projects, including migrations, upgrades, and new implementations
Write and maintain clear documentation, including diagrams, architecture design reviews, runbooks, test plans, policies, and procedures
Develop detailed project plans that deliver successful project outcomes and staying current on latest industry standards and trends regarding application integration and automation
Contribute, as needed, to other IT Enterprise Application team projects and goals
Support Torc IT’s software management system of application inventory, licensing, ownership, and compliance; collaborate, mentor, and train junior engineers and teammates

What you need to succeed:

Bachelor’s degree in Computer Science, Information Technology, Software Development, or other related field
5+ years working as an integration or automation engineer, or similar role in a high-tech environment; work experience equivalent in lieu of education
Proficiency in at least one software development language and familiarity with various other technologies such as: Python, Powershell, C#, PHP, JSON, JavaScript, RESTful APIs, SQL, XML, YAML, BASH, or Terraform
Experience developing and supporting custom integrations and/or using integration middleware such as Anypoint, Boomi, Jitterbit, or SnapLogic
Knowledge of a variety of databases and ETL (extract, transform, load) tools
Implementation and administration knowledge of orchestration tools such as Ansible
Experience with version control and CI/CD tooling such as GitHub, GitLab, and Jenkins
Strong working knowledge of SDLC (software development life cycle) best-practices and methodologies
Security first mindset with proven experience implementing centralized logging and auditing
Proven analytical and problem-solving abilities, especially including the ability to anticipate, identify, and solve critical incidents proactively
Strong interpersonal skills able to build effective relationships and work collaboratively across a diverse set of technical constituents
May travel occasionally (<10%) to Torc offices or their partner sites
Requires appropriate Personal Protective Equipment (PPE) in areas identified through hazard assessment and continuous technical education and training with a passion for knowledge in the field of study to maintain the highest level of knowledge, ingenuity, and creative thinking
Ability to be flexible on short notice and may work extended hours/weekends/evenings when project demands and ability to work and collaborate across locations over different time zones

Perks of Being a Full-time Torc’r

Torc cares about our team members and we strive to provide benefits and resources to support their health, work/life balance, and future. Our culture is collaborative, energetic, and team focused. Torc offers:

A competitive compensation package that includes a bonus component and stock options
100% paid medical, dental, and vision premiums for full-time employees
401K plan with a 6% employer match
Flexibility in schedule and generous paid vacation (available immediately after start date)
Company-wide holiday office closures
AD+D and Life Insurance

Hiring Range for Job Opening

US Pay Range

$114,400—$137,300 USD

At Torc, we’re committed to building a diverse and inclusive workplace. We celebrate the uniqueness of our Torc’rs and do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, veteran status, or disabilities.

Even if you don’t meet 100% of the qualifications listed for this opportunity, we encourage you to apply.

See more jobs at Torc Robotics

Apply for this job

+30d

Senior Software Engineer - Core Infrastructure

LambdaRemote (US & CAN)

DevOPS ● Lambda ● golang ● terraform ● Design ● ansible ● api ● c++ ● kubernetes ● linux ● python ● AWS

Lambda is hiring a Remote Senior Software Engineer - Core Infrastructure

Lambda's GPU cloud is used by deep learning engineers at Stanford, Berkeley, and Carnegie Mellon. Lambda's on-prem systems power research and engineering at Intel, Microsoft, Kaiser Permanente, major universities, and the Department of Defense.

If you'd like to build the world's best deep learning cloud, join us.

What You’ll Do

Design and implement scalable, secure, and highly available Kubernetes clusters to support our growing application portfolio
Bootstrap new on-prem and managed Kubernetes environments from the ground up, including networking, storage, and security configurations
Extend our existing Kubernetes platforms with advanced features such as service mesh, serverless frameworks, and custom resource definitions (CRDs)
Develop and maintain infrastructure-as-code (IaC) templates using Cluster API (CAPI) for automated cluster provisioning and configuration management
Implement robust monitoring, logging, and alerting solutions using OpenTelemetry to ensure platform health and performance
Optimize resource utilization and cost-effectiveness of Kubernetes deployments across multiple cloud providers
Collaborate with teams to design and implement CI/CD pipelines for containerized applications
Troubleshoot complex issues in production Kubernetes environments and lead incident response efforts
Stay up-to-date with the latest Kubernetes ecosystem developments and evaluate new technologies for potential adoption
Mentor junior engineers and contribute to the development of platform engineering best practices

You

Have 5+ years bootstrapping, extending and operating K8s at scale (1,500+ nodes)
Have 5+ years automating the provisioning, configuration management, and deployment of production systems
Have 5+ years building resilient, scalable systems with Python/Go
Have 5+ years managing and securing infrastructure at scale (2,000+ hosts)
Possess Sound experience with Infrastructure as Code (Terraform, Ansible, etc.)
Possess Sound knowledge of DevOps, Infrastructure, and Platform concepts
Possess Strong development skills in Python or Golang
Possess Strong proficiency with Linux command line and debugging tools

Nice to Have

Experience with building complex hybrid environments (AWS and on-premise preferred)
Experience with service mesh technologies (e.g., Istio, Linkerd) and serverless frameworks (e.g., Knative)
Experience with multi-cluster or multi-cloud Kubernetes deployments
Experience in the machine learning or computer hardware industry
Certified Kubernetes Administrator (CKA) and/or Certified Kubernetes Application Developer (CKAD) certification
Contributions to open-source Kubernetes projects or tools
Familiarity with GitOps principles and tools like ArgoCD or Flux

Salary Range Information

Based on market data and other factors, the salary range for this position is $153,000-$240,000. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.

About Lambda

We offer generous cash & equity compensation
Investors include Gradient Ventures, Google’s AI-focused venture fund
We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability
Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG
We have a wildly talented team of 300, and growing fast
Health, dental, and vision coverage for you and your dependents
Commuter/Work from home stipends for select roles
401k Plan with 2% company match
Flexible Paid Time Off Plan that we all actually use

A Final Note:

You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.

Equal Opportunity Employer

Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

See more jobs at Lambda

Apply for this job

+30d

Senior Network DevOps Engineer

Live PersonHyderabad, Telangana, India (Remote)

DevOPS ● agile ● terraform ● Design ● ansible ● azure ● api ● git ● kubernetes ● linux ● jenkins ● python ● AWS

Live Person is hiring a Remote Senior Network DevOps Engineer

LivePerson (NASDAQ: LPSN) is the global leader in enterprise conversations. Hundreds of the world’s leading brands — including HSBC, Chipotle, and Virgin Media — use our award-winning Conversational Cloud platform to connect with millions of consumers. We power nearly a billion conversational interactions every month, providing a uniquely rich data set and safety tools to unlock the power of Conversational AI for better customer experiences.

At LivePerson, we foster an inclusive workplace culture that encourages meaningful connection, collaboration, and innovation. Everyone is invited to ask questions, actively seek new ways to achieve success, nd reach their full potential. We are continually looking for ways to improve our products and make things better. This means spotting opportunities, solving ambiguities, and seeking effective solutions to the problems our customers care about.

Overview:

Our global NetDevOps team is growing rapidly, requiring engineers to collaborate across US, EMEA, and APAC regions to support our datacenter and cloud environments. This team focuses on the stability and reliability of our global infrastructure leveraging existing standards, processes, and automation solutions. The NetDevOps Engineer will serve as a domain expert in networking technologies and the supporting both datacenter and cloud infrastructure.

You will:

Design, deploy, and manage Kubernetes clusters on the cloud (e.g., GCP) and on-prem to support containerized applications.
Implement best practices for monitoring, logging, and troubleshooting within Kubernetes.
Collaborate with the cloud team to provision, configure, and maintain cloud resources on GCP, ensuring optimal performance and cost efficiency.
Implement automation for resource provisioning and scaling using tools like Terraform and Helm.

Skills:

Strong working knowledge in configuring and troubleshooting routing protocols (BGP, OSPF, and static).
Extensive experience with data center and cloud based networking technologies and infrastructure (LAN, WAN, firewall, SDWAN, BGP, DNS, load balancing, VPN, etc)
Experience with Arista and Cisco configurations and maintenance.
Deep understanding of network protocols and services.
Extensive experience in linux environments and enterprise distros
Experience with software development and strong scripting skills.
Experience with Palo Alto firewall configurations and maintenance.
Experience with F5 LTM and AFM configurations and maintenance.
Experience with networking and securing kubernetes with Calico.
Experience with cloud technologies and IaC deployments.
Experience with GCP, AWS, Azure cloud environments. (Certifications preferred)
Experience with virtual and containerized deployments in both data center and cloud.
Experience with Kubernetes and GKE deployments and networking elements. (CNI, Itsio, Calico)
Experience with CI/CD pipeline components, support, functionality, and tools.
Experience with version control concepts and operations. (Git)
Experience with data formats XML, JSON, YAML and parsing with Python data structures.
Experience working within an Agile development environment
Experience with webhooks, API styles, HTTP Response codes, and authentication mechanisms.
Experience with Ansible deployments and creating ansible playbooks
Experience with Jenkins and parameterization.
Use of automation tools and modules (Rundeck/Puppet/Terraform)
Experience with Network Automation and Programmability Abstraction Layer with Multivendor (NAPALM) framework
Leverage model driven programmability within an Arista networking environment.
Experience with cloud infrastructure such as Compute, Network, Storage and Backup
Understand the need to organize code into methods, functions, classes, and modules
Experience with monitoring performance metrics and KPIs.

Additional requirements:

Collect feedback and requirements from design and technical staff
Create diagrams, business cases, and architectural designs documents.
Support on-call and weekend rotation as needed
Collaborate with cross functional teams.
Able to handle stressful situations with a level headed approach
Excellent verbal and writing skills (English)
Oncall and shift rotation (primarily between US and APAC hours)

Benefits:

Health: medical, dental, and vision
Time away: vacation and holidays
Development: Generous tuition reimbursement and access to internal professional development resources.
Equal opportunity employer
#LI-Remote

Why you’ll love working here:

As leaders in enterprise customer conversations, we celebrate diversity, empowering our team to forge impactful conversations globally. LivePerson is a place where uniqueness is embraced, growth is constant, and everyone is empowered to create their own success. And, we're very proud to have earned recognition from Fast Company, Newsweek, and BuiltIn for being a top innovative, beloved, and remote-friendly workplace.

Belonging at LivePerson:

We are proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local law.

We are committed to the accessibility needs of applicants and employees. We provide reasonable accommodations to job applicants with physical or mental disabilities. Applicants with a disability who require reasonable accommodation for any part of the application or hiring process should inform their recruiting contact upon initial connection.

Apply for this job

+30d

Systems Reliability Engineer (SRE) - Edge

CloudflareHybrid or Remote

sql ● Design ● ansible ● docker ● postgresql ● linux ● python

Cloudflare is hiring a Remote Systems Reliability Engineer (SRE) - Edge

About Us

At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world’s largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine’s Top Company Cultures list and ranked among the World’s Most Innovative Companies by Fast Company.

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us!

Available Locations:Lisbon or Remote Portugal; London or Remote UK, Munich or Remote Germany

About the Role

We are looking for talented Systems Reliability Engineers to build and operate our Edge platform running in more than 320 cities in over 120 countries. Our SREs come from diverse technical backgrounds and have built up their knowledge working in different environments, but common factors across all of our reliability-focused engineers include a passion for automation, scalability, and operational excellence. We support our services in a “follow the sun” model with offices in East Asia, Europe and North America.

This is a superb opportunity to join a high-performing team and scale our high-growth network as Cloudflare’s business grows. We live at the boundary between systems, network, and software, and love improving the glue that holds them together. Working with us, you will build tools to constantly improve service availability, performance, and operational velocity. You will nurture a passion for an “automate everything” approach that makes systems failure resistant and ready to scale.

SREs focus on the immediate state and functionality of the Cloudflare platform around the world, leveraging an array of monitoring, alerting and diagnostics tools while developing and enhancing the Cloudflare platform and its capabilities. We own a wide portfolio of applications and services, running a tight feedback loop of developer and operator patterns. The ideal SRE candidate has a passionate curiosity about how the Internet fundamentally works and has a strong knowledge of networking, Linux and TLS along with coding ability in Go or Python.

Requisite Skills

Aptitude for identifying problems, owning them and working with others to solve them
Linux systems experience
3 years experience in an SRE role or a role with similar functions
Software development skills in some programming language such as Go or Python
Understanding of distributed software systems and large scale system design tradeoffs
Intermediate experience of common network protocols like DNS and HTTP
Understanding of routing protocols and concepts such as BGP and IP anycast

Examples of desirable skills, knowledge and experience

Experience with the Linux kernel and Linux software packaging
Performance analysis and debugging
Configuration management systems such as Saltstack, Chef, Puppet or Ansible
Load balancing and reverse proxies such as Nginx, Varnish, HAProxy, Squid or Apache
SQL databases
Time series databases such as OpenTSDB, Graphite, Prometheus or Grafana
Key/Value stores

Bonus Points

Experience with continuous / rapid release engineering
Strong tooling and automation development experience
Experience working in a 24/7/365 service environment
Experience working with large scale production distributed systems
A history of contributing to Open Source Software

Some tools that we use

Nginx
PostgreSQL
Docker
Prometheus
Grafana
Consul
Nomad
Salt

What Makes Cloudflare Special?

We’re not just a highly ambitious, large-scale technology company. We’re a highly ambitious, large-scale technology company with a soul. Fundamental to our mission to help build a better Internet is protecting the free and open Internet.

Project Galileo: We equip politically and artistically important organizations and journalists with powerful tools to defend themselves against attacks that would otherwise censor their work, technology already used by Cloudflare’s enterprise customers--at no cost.

Athenian Project: We created Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration.

1.1.1.1: We released 1.1.1.1to help fix the foundation of the Internet by building a faster, more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use - it is the first consumer-focused service Cloudflare has ever released. Here’s the deal - we don’t store client IP addresses never, ever. We will continue to abide by our privacy commitmentand ensure that no user data is sold to advertisers or used to target consumers.

Sound like something you’d like to be a part of? We’d love to hear from you!

This position may require access to information protected under U.S. export control laws, including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.

Cloudflare is proud to be an equal opportunity employer. We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness. All qualified applicants will be considered for employment without regard to their, or any other person's, perceived or actual race, color, religion, sex, gender, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, family care status, or any other basis protected by law.We are an AA/Veterans/Disabled Employer.

Cloudflare provides reasonable accommodations to qualified individuals with disabilities. Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include, but are not limited to, changing the application process, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment. If you require a reasonable accommodation to apply for a job, please contact us via e-mail athr@cloudflare.comor via mail at 101 Townsend St. San Francisco, CA 94107.

See more jobs at Cloudflare

Apply for this job

+30d

Linux VmWare Operations Engineer

Information International Associates, Inc.Alexandria, VA, Remote

oracle ● ansible ● UX ● linux

Information International Associates, Inc. is hiring a Remote Linux VmWare Operations Engineer

Job Description

Senior Linux VMWare Operations Engineer

KeyLogic is currently recruiting for a Senior Linux VMWare Operations Engineer and Deputy Team Lead to support our Federal Client in Alexandria, VA with a hybrid telework arrangement.

Description:

The Senior Linux VMWare Operations Engineer will support the client’s Infrastructure Services Division’s (ISD) Operating Systems Operations Section (OSOS) by performing administration and maintenance activities on over 8000 RHEL/HPUX/AIX servers in use. Additionally, the candidate will have a secondary role as the Deputy Team Lead regarding the day-to-day tasks and operations of the team. The candidate should be a self-starter and one not afraid to undertake and lead a project from beginning to end accompanied with broad technical exposure is the ideal candidate. The selected candidate will work the daytime shift Monday through Friday.

The servers supported are located in both production and lab data centers located at the client’s campus in Alexandria, VA. Additional remote support is provided for systems located at the Federal Client’s Alternate Processing Site (APS) located in Manassas, VA.

Duties performed will include, but are not limited to, the following:

· Provide escalation support from subordinates and junior resources, including on-call rotation support.

· Ensure proper planning and execution of major projects and O&M (operations and maintenance) activities.

· Mentor and direct junior staff in the course of daily assignments and projects to promote a collaborative learning environment

· Troubleshoot hardware, Operating System, and software problems with Linux and VMWare servers.

· Develop and maintain installation and configuration procedures for server builds, configurations, and scheduled maintenance activities.

· Provide suggestions and best practices for various activities and communicate them to stakeholders at various technical levels

· Install and configure ESXi hypervisors, vCenter servers, create data centers, clusters, add hosts, and configure their firewall, services and advanced settings.

· Setup and configure virtual switches, port groups, and VLANs in VMWare

· Configure HA, DRS and setup affinity rules on each VMWare cluster as needed.

· Perform storage expansion, migration, and reclamations on block storage from SAN, familiarity with boot-from-SAN on RHEL is preferred.

· Perform cyber-security remediation and server hardening as needed.

· Write custom shell scripts to poll server inventory for health and configuration data within the environment.

· Work on assigned change requests/incident tickets.

· Investigate and responding to alerts generated by the various monitoring systems in use at USPTO.

· Evaluate and implementing potential new tools and technologies.

· Provide vCenter permissions to users as well as creating folders, placing VMs under folders to provide access rights to certain users\groups to be able to manage their VMs.

· Monitor, maintain, and troubleshoot all issues that might arise in the VMware virtual environment.

· Install and troubleshoot physical and virtual server’s performance and connectivity issues.

· Patch and update hypervisor’s baseline using Update Manager.

· Perform Cisco UCS service profile migration.

· Maintain documentation for processes and procedures as required.

· Perform detailed analysis of incidents - utilize log management tools and performance data to author and submit RCA (Root Cause Analysis) reports following service outages

· Perform hardware repair procedures and activities.

Work Experience/Skills Requirements

The successful candidate will have experience in the following areas:

· 7+ years of experience with Red Hat Enterprise Linux/CentOS

· 7+ years of experience with VMWare vCenter and ESXi hypervisors

· 5+ years of experience supporting Tomcat, Apache, JBoss, Oracle, and MySQL.

· Knowledge of Red Hat Virtualization (RHV) or oVirt

· Knowledge of Cisco UCS and managing systems via UCSCentral

· Ability to write shell scripts in bash.

· Ability to write custom Ansible playbooks and run them across the environment

· RHCE Certification is highly recommended

· VCP Certification is highly recommended

· Experience in supervision of teams and personnel

The ideal candidate also has experience in the following areas:

· Rocky Linux (or other Open Source Enterprise Linux)

· Red Hat Satellite 6 or Katello

· Red Hat Identity Management (IDM) or FreeIPA

· Foreman

· Puppet

· Powershell/PowerCLI

· HP-UX

· IBM AIX

· Windows Server

A Bachelor’s Degree is strongly preferred.

Clearance Requirements:

Must be a U.S. Citizen and able to hold a security clearance. You do not need a current/active clearance to apply, but must be able to pass a government Public Trust (SF-85) background investigation.

We are proud to be an EEO/AA employer M/F/D/V. We maintain a drug-free workplace and perform pre-employment substance abuse testing.

Qualifications

Work Experience/Skills Requirements

The successful candidate will have experience in the following areas:

· 7+ years of experience with Red Hat Enterprise Linux/CentOS

· 7+ years of experience with VMWare vCenter and ESXi hypervisors

· 5+ years of experience supporting Tomcat, Apache, JBoss, Oracle, and MySQL.

· Knowledge of Red Hat Virtualization (RHV) or oVirt

· Knowledge of Cisco UCS and managing systems via UCSCentral

· Ability to write shell scripts in bash.

· Ability to write custom Ansible playbooks and run them across the environment

· RHCE Certification is highly recommended

· VCP Certification is highly recommended

· Experience in supervision of teams and personnel

The ideal candidate also has experience in the following areas:

· Rocky Linux (or other Open Source Enterprise Linux)

· Red Hat Satellite 6 or Katello

· Red Hat Identity Management (IDM) or FreeIPA

· Foreman

· Puppet

· Powershell/PowerCLI

· HP-UX

· IBM AIX

· Windows Server

A Bachelor’s Degree is strongly preferred.

See more jobs at Information International Associates, Inc.

Apply for this job

+30d

Windows Support Technician (End User Support Specialist II)

Information International Associates, Inc.Alexandria, VA, Remote

DevOPS ● agile ● jira ● terraform ● Design ● ansible ● azure ● UX ● docker ● kubernetes ● jenkins ● AWS

Information International Associates, Inc. is hiring a Remote Windows Support Technician (End User Support Specialist II)

Job Description

Platform Automation Support Specialist

Job Description or Summary:

KeyLogic is seeking Platform Services Automation Specialist with strong systems, software, and Agile experience to support our program at the USPTO.

Job Duties:

As a DevOps Platform Engineer, you will be working closely with our Automation teams to develop USPTO's Platform Services environment. This is a Full-Time position and work location will be at the KeyLogic's office in Alexandria, VA.

Job Requirements or Skills Required:

Engineer and deploy hybrid-cloud solutions for enterprise environments by leveraging Configuration management tools such as Puppet and IaC tools such as Ansible and Terraform

Design and implementation of automated infrastructure in on-prem and cloud environments.

Design and implementation of CI/CD, testing and operations infrastructure on-premise and in cloud

Required Skills:

5+ years of hands-on experience in Linux/Unix, HP-UX, and Windows server administration

3+ years of hands-on experience developing Puppet modules for platform products.

2+ years of hands-on experience with DevOps using Terraform, Ansible, etc.

2+ years of hands-on experience working with containers, and container orchestration technologies such as Kubernetes, Docker, etc.

2+ years of hands-on experience with CI/CD tools, such as GitHub, Jenkins, Jenkins Pipeline, Maven, and Nexus

At least entry-level certification in at least one of the three major CSPs (AWS, Google, or Azure)

Experience with automation/orchestration platforms and tools such as Red Hat OpenShift, Red Hat CloudForms, Puppet, Chef and Ansible

Experience working with defining, configuring, and building CI/CD pipelines using Jenkins, GitHub actions and other automation techniques.

Experience working within an Agile Environment and working with Agile tools such as JIRA and Rally

Excellent written and verbal communication skills

Education Requirements:

Bachelor’s in computer science or related field