ansible Remote Jobs

248 Results

+30d

Systems Engineer

Hack TheAlimos,Attica,Greece, Remote Hybrid
sqlmobileansiblegitrubyMySQLlinuxpython

Hack The is hiring a Remote Systems Engineer

Ready to embark on the quest of joining Hack The Box?

At the end of this thrilling journey, you'll become a proud member of Hack The Box, with the ultimate mission to help redefine cybersecurity expertise. Get ready for an exciting adventure into the world of cybersecurity! ????????????

✨The core mission of the Systems Engineer:

Our product is developing rapidly, which is why we are on the lookout for an experienced System Engineer to ensure a smooth operation of our infrastructure and a great experience for our community of Hack The Box gamers. 

Your role will be, firstly to understand the scope of the team's activities and then gradually undertake ownership of the certain responsibilities. Finally, as a member of Hack The Box team you’ll be able to develop and sharpen your cyber security knowledge and skills, having unlimited access to all public content. 

We're all hackers at heart, and as such a hacker mentality will enable you to contribute and integrate more efficiently. A slew of certifications or university degrees are not what we're looking for - what we're looking for is passion.

???? The fellowship you’ll be joining:

As a part of Hack The Box, you will work with a group of talented and passionate people. Reporting to the relevant Team Lead within the department, you will be part of the greater Infrastructure team. The team consists of Data Engineers, SREs and, last but not least, System Engineers. 

⚔️Technology tools & weapons you’ll be using:

  • Ansible, vSphere, vCenter, Linux, scripting languages, SQL

???? Interesting resources you should check:

???? The adventures that await you after becoming Systems Engineer at Hack The Box:

  • Monitor and maintain installed systems and infrastructure within the team scope
  • Proactively ensure the highest levels of systems and infrastructure availability
  • Work closely with content developers to support them from an infrastructure perspective
  • Troubleshoot complex problems on critical infrastructure & production networks 
  • Maintain proper documentation on Git 
  • Research on new technologies to improve the quality of our systems

????Skills, knowledge, and experience points required to unlock the role of Systems Engineer at Hack The Box:

  • Strong experience on VMware ecosystem (mainly vSphere, vCenter) 
  • Strong experience on Linux OS and some on Microsoft
  • Previous experience with systems monitoring ( e.g., zabbix)
  • Good scripting skills (e.g., Ansible, powershell, shell scripts, Perl, Ruby, Python, Go)
  • Experience with network troubleshooting
  • Experience in relational databases (ideally MySQL) and ability to conduct solid SQL queries
  • Good understanding of security compliance principles (legal implications, GDPR)
  • Strong written and spoken English

????️ What your Hack The Box adventure will have in store:

  • ????You'll have the exhilarating opportunity to contribute to a product that is highly appreciated by users and the cybersecurity community at large.
  • ???? You'll experience a highly supportive and caring environment, fostering growth, flexibility, and autonomy.
  • ???? You'll embark on an exciting journey of continuous learning and problem-solving, leveling up as our organization grows.
  • ???? Most importantly, you'll have a blast at HTB ???? because fun is an essential ingredient in our recipe for success! Just wait until you see our global meet-ups!

???? The gems you’ll be enjoying as Systems Engineer

  • Private insurance
  • 25 annual leave days
  • Dedicated budget for training and professional development, participation in conferences
  • State-of-the-art equipment (Macbook, iPhone, and mobile plan)
  • Free lunch & snacks at the office
  • Full access to the Hack The Box lab offerings; so you can learn how to hack
  • Flexible/Hybrid working

????️ The Quest of Becoming Hack The Box’s Systems Engineer:

  • Level 1: Like in any game, you start as a Noob. Level one’s objective: submit your application.
  • Level 2: After applying, you unlock the Script Kiddie rank! This level’s objective: pass the screening process.
  • Level 3: Now you’re officially ranked as Hacker and you’re ready to meet the Talent Acquisition team. Level’s objective: highlight your past achievements, ambitions, and values.
  • Level 4: As a Pro-Hacker at level 4, you’ll unlock the “boss level”, which involves meeting the hiring manager. Level’s objective: connect with the hiring manager and share with them your achievements.
  • Level 5: Now you’re an Elite Hacker! Level’s objective: complete an assignment that aligns with day-to-day job-related tasks and responsibilities.
  • Level 6: Congratulations, you're now a Guru! Not many reach this level ????. Level’s objective: have a constructive, final conversation with senior leadership to explore the role and your future at HTB.
  • Level 7: You've achieved the Omniscient rank and officially received an offer from HTB! To complete the last level and the Quest, all you need to do is accept the offer.
  • Quest complete. Congratulations, you’re officially one of us ????????????Your next quest: complete the onboarding.

Hack Your Career, Today. Join us in this epic adventure of cybersecurity at Hack The Box! ????????????

At Hack The Box, we are on a quest to find the most exceptional and enthusiastic talent to join our team. Whether or not you consider yourself a gamer, we value what makes you unique and want to know more about you. This job post provides just a glimpse of the incredible gamified experience our business and consumer customers enjoy through our platforms. So, if you're ready to embark on a journey of growth and adventure, we can't wait to meet you!

About HTB:

Hack The Box is a leading gamified cybersecurity upskilling, certification, and talent assessment platform enabling individuals, businesses, government institutions, and universities to sharpen their offensive and defensive security expertise.

Launched in 2017, Hack The Box brings together the largest global cybersecurity community of more than 2m platform members and is on a mission to create and connect cyber-ready humans and organizations through highly engaging hacking experiences that cultivate out-of-the-box thinking. Offering a fully guided and exploratory skills development environment, Hack The Box is the ideal solution for cybersecurity professionals and organizations to continuously enhance their cyber-attack readiness by improving their red, blue, and purple team capabilities.

Rapidly growing its international footprint and reach, Hack The Box is headquartered in the UK, with additional offices in Greece and the US.

???? Exciting News:

  • We are super proud to share that HTB’s all three entities across the UK, US, and Greece have been Certified as a Great Place to Work(Oct 2023-Oct 2024). 
  • Furthermore, the HTB's Greek entity has been listed by the Great Place to Work Insitute as the#4 Best Workplacein Greece and #7 in Europe for 2023, among more than 3,300 companies???? 
  • Get more insights about our HTB culture and employee experience by visiting our career site and Glassdoor.

At Hack The Box, we are committed to fostering a diverse, inclusive, and equitable workplace. We believe that diversity enriches our performance, services, and the communities we serve. As such, we ensure that all job applications are considered solely based on merit, skills, and qualifications. We do not discriminate on grounds of race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. We are dedicated to providing a fair and respectful work environment that reflects our values.

See more jobs at Hack The

Apply for this job

+30d

Consultant Devosp senior H/F

Business & DecisionParis, France, Remote
DevOPSansiblegitpython

Business & Decision is hiring a Remote Consultant Devosp senior H/F

Description du poste

•           Expertise de la suite Elastic et Elastic Cloud Enterprise

•           Bonnes connaissances des outils de fabrication, Git, Gitlab, Gitlab CI, Ansible

•           Bonnes connaissances dans le développement Python

•           Bonnes connaissances de la culture DevOps et des pratiques SRE

•           Bonnes connaissances système et réseaux

•           Maîtrise de l’anglais professionnel

Qualifications

-           Curieux et force de propositions

-           Rigueur, sens de la méthode ;

-           Capacités relationnelles et travail en équipe ;

-           Disponibilité et réactivité ;

-           Analyse et résolution d’incidents ;

-           Sens du service et respect des engagements

-           Bonne communication

See more jobs at Business & Decision

Apply for this job

+30d

Technical Enablement Architect, Network Services

SalesMaster’s DegreeBachelor's degree3 years of experienceDesignansibleazurec++pythonAWS

Cloudflare is hiring a Remote Technical Enablement Architect, Network Services

About Us

At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world’s largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine’s Top Company Cultures list and ranked among the World’s Most Innovative Companies by Fast Company. 

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us! 

Overview

We are seeking a highly motivated, experienced, and self-driven Technical Enablement Architect to design, develop, and deliver training content on Cloudflare Network Services products and solutions. This role is critical in enabling our stakeholders, including Sales, Pre-Sales, Customer Success, Professional Services, Partners, and Customers.

Reporting to the Senior Manager of Technical & Partner Enablement on the Sales Enablement team, the successful candidate will collaborate closely with product technical marketing, product management, and the specialist organization to empower global field teams. This position is part of an experienced global team of technical enablement architects, responsible for creating and delivering technical training for all Cloudflare products and solutions.

As the Technical Enablement Architect (TEA) for Network Services, you will be an integral part of the Technical Enablement team within the Sales Enablement organization. Your responsibilities will include:

  • Technical Mastery: Maintain a deep technical understanding of Cloudflare Network Services solutions, including new product updates and industry trends.
  • Scalable Solutions: Architect scalable enablement solutions such as labs, demos, apps, and virtual classrooms to benefit our audience.
  • Leadership: Display high standards of leadership by providing subject matter expertise within the Network Services domain.
  • Technical Conversations: Lead technical discussions and drive subject matter expertise within the Network Services product line.
  • Ongoing Enablement: Support technical enablement activities for new hires, developing skills for effective demonstrations, proof of concepts, and product deployments related to Network Services products.
  • Training Materials: Design, develop, update, and implement objective-based training materials for Cloudflare Network Services products and solutions.
  • Training Modules: Develop and update training modules with clear learning paths aligned to Sales Plays and Sales Stages for Pre-Sales and Post-Sales audiences.
  • Hands-On Solutions: Curate enablement content by developing scalable hands-on educational solutions within our sandbox environment (e.g., labs, demos, apps, tools, virtual classrooms).
  • Stakeholder Engagement: Participate in stakeholder interlock meetings, such as NPIs with product management, technical marketing, specialist teams, and Pre-Sales and Post-Sales leaders to review a rolling 4-quarter enablement plan for feedback, refinement, and augmentation.
  • Field Readiness: Drive field readiness for product improvements and future release capabilities by collaborating with product management and technical marketing teams.
  • Strategy Ownership: Own and drive the Enablement strategy and educational offerings for Network Services, aligning with the Go-to-Market strategy.
  • Cross-Functional Collaboration: Navigate across multiple roles within the Enablement ecosystem and stakeholders to develop scalable solutions.

This position offers a unique opportunity to contribute to the growth and success of Cloudflare by enabling our teams with the knowledge and tools they need to excel in their roles.

Locations: Remote US, Remote Canada

About the Team

Do you thrive on creating innovative solutions?  Do you value new capabilities?  Can you help us develop and deliver technical educational programs that are engaging and hands-on to enable our teams?

The Technical Enablement team within the Enablement organization is focused on increasing and improving Cloudflare’s technical product educational offerings for our Sales, Pre-Sales and Post Sales, Partner and Customer organization by advancing our capabilities through state-of-the-art hands-on learning environments such as labs, demos, apps and virtual classrooms. By focusing our educational offerings on product, competition, market, architectural landscapes, and certifications we will help elevate the stakeholders to the next generation of technical value selling and consulting experts. By elevating our training offerings, we will enable our internal customers to use Cloudflare for their cloud strategies while creating customers for life.

Key activities of the team include but are not limited to:

  • Growing our educational product line offerings and revenue
  • Architecting hands on solution across Cloudflare’s key Product lines
  • Driving quality and collaboration in our Product Communities and SME program
  • Developing and maintaining a Technical Enablement sandbox environment to create scalable labs, demos, apps, tool and virtual classrooms
  • Delivering Competitive, Market, Architectural landscape, Certification programs by Product Lines

Qualifications:

Required Education and Experience

Applicants must meet one of the following education and experience requirements: 

  • 5-10 years of relevant experience in the fields of Pre-Sales, Post-Sales, Enablement, Technical Marketing, or Product Marketing
  • A Bachelor’s or a Master’s degree or its equivalent in a technical domain
  • Strong Network Security domain knowledge
  • Minimum 5 years of overall experience in the IT industry.
  • Minimum 3 years of experience in technical pre-sales/sales engineering, consulting, and/or other customer facing role consulting on, delivering technical products or services.

Required Skills

Technical Skills: Strong technical knowledge in network security and proven ability to establish credibility with product management, technical pre-sales, and customers.

    • Proficiency in network topologies, protocols, and technologies (e.g., Ethernet, MPLS, BGP, OSPF, VPN)
    • Familiarity with cloud service providers (e.g., AWS, Azure, Google Cloud) and their networking components.
    • Knowledge of hybrid cloud environments and cloud-native networking services.
    • Expertise in network security principles and best practices.
    • Experience with firewalls, intrusion detection/prevention systems (IDS/IPS), and secure access controls.
    • In-depth knowledge of advanced routing and switching technologies and protocols.
    • Proficiency in designing and implementing load balancing solutions.
    • Experience with traffic management and optimization techniques.
    • Familiarity with network automation tools and scripting languages (e.g., Python, Ansible).
    • Proficiency in network monitoring and diagnostic tools.
    • Strong problem-solving skills and the ability to troubleshoot complex network issues.

Soft Skills: 

    • Excellent verbal and written communication skills for effectively conveying complex technical concepts to stakeholders.
    • Ability to work collaboratively with cross-functional teams, including IT, Product Management, and Product Engineering business units.
    • Ability to align network strategies with overall business goals and objectives.
    • Flexibility to adapt to rapidly changing technologies and industry trends.

Desired Skills

  • Experience with Network Security or Enterprise solutions
  • Experience with Technology Services Learning and Development organizations or projects
  • Self-Starter and interpersonal skills, such as time management, team leadership and managing conflict
  • Experience with professional development – presentation skills, pitching technical solutions, technical negotiation, and consultant skills
  • Proven track record of strong technical solutions selling and ability to think creatively
  • Intellectual curiosity with the desire and ability to understand complex technical concepts
  • Situational fluency, ability to influence and motivate others, and perseverance to handle challenging business situations
  • Able to set priorities and maneuver in a corporate environment with a strong sense of urgency

 

Certifications Preferred

  • Cisco Certified Internetwork Expert (CCIE)
  • Certified Information Systems Security Professional (CISSP)
  • Amazon Web Services (AWS) Certified Solutions Architect
  • Microsoft Certified: Azure Solutions Architect Expert
  • Google Professional Cloud Network Engineer

 

What Makes Cloudflare Special?

We’re not just a highly ambitious, large-scale technology company. We’re a highly ambitious, large-scale technology company with a soul. Fundamental to our mission to help build a better Internet is protecting the free and open Internet.

Project Galileo: We equip politically and artistically important organizations and journalists with powerful tools to defend themselves against attacks that would otherwise censor their work, technology already used by Cloudflare’s enterprise customers--at no cost.

Athenian Project: We created Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration.

1.1.1.1: We released 1.1.1.1to help fix the foundation of the Internet by building a faster, more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use - it is the first consumer-focused service Cloudflare has ever released. Here’s the deal - we don’t store client IP addresses never, ever. We will continue to abide by our privacy commitmentand ensure that no user data is sold to advertisers or used to target consumers.

Sound like something you’d like to be a part of? We’d love to hear from you!

This position may require access to information protected under U.S. export control laws, including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.

Cloudflare is proud to be an equal opportunity employer.  We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness.  All qualified applicants will be considered for employment without regard to their, or any other person's, perceived or actual race, color, religion, sex, gender, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, family care status, or any other basis protected by law.We are an AA/Veterans/Disabled Employer.

Cloudflare provides reasonable accommodations to qualified individuals with disabilities.  Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include, but are not limited to, changing the application process, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment.  If you require a reasonable accommodation to apply for a job, please contact us via e-mail athr@cloudflare.comor via mail at 101 Townsend St. San Francisco, CA 94107.

See more jobs at Cloudflare

Apply for this job

+30d

Senior Postgres Database Engineer

terraformpostgressqlDesignmobileansibleazuregitc++postgresql

Signify Health is hiring a Remote Senior Postgres Database Engineer

How will this role have an impact?

As a Senior Database Reliability Engineer specializing in PostgreSQL, you will play a critical role in designing, managing, and optimizing our cloud based PostgreSQL database infrastructure. You will ensure the reliability, availability, and performance of our database systems while fostering a collaborative and innovative environment. This position requires deep operational expertise in PostgreSQL, experience with Azure, and a strategic mindset to drive database solutions that align with our commitment to reliability and sustainability.

Key Responsibilities:

  • Database Management: Design, implement, and maintain PostgreSQL database systems, primarily managed instances in cloud environments (Azure), to ensure high availability, performance, and reliability.
  • Performance Tuning: Conduct performance tuning and optimization of database queries, indexes, and configurations to enhance efficiency.
  • Backup and Recovery: Develop and manage robust backup and recovery strategies, ensuring data integrity and availability in case of failures.
  • Monitoring and Troubleshooting: Implement and maintain monitoring solutions using tools such as Redgate, pganalyze, and New Relic to proactively identify and resolve database issues, ensuring minimal downtime and optimal performance.
  • Security: Implement and maintain database security measures, including user management, encryption, data masking and access controls, to ensure compliance with healthcare regulations.
  • Collaboration: Work closely with cross functional teams to support application development and deployment processes.
  • Documentation: Maintain comprehensive documentation of database configurations, procedures, scripts and best practices.
  • Standards and Procedures: Provide guidance in the creation and modification of standards and procedures including scripts for automation and reporting for adherence to standards.
  • On-call Support: Participate in on-call rotations as an opportunity to enhance the reliability and performance of our database systems, ensuring minimal disruptions and optimal performance without frequent interruptions.

What You’ll Need (Required Qualifications):

  • Experience:
    • Minimum of 10 years of operational experience at scale in database management with at least 6 years focused on PostgreSQL.
    • Extensive experience managing PostgreSQL databases in cloud environments, particularly Azure. Experience with GCP is a plus but not required.
    • Hands-on experience with Continuous Integration/Continuous Deployment pipelines, Database/Schema as Code tools, and Git workflows.
    • Expert level experience analyzing and tuning queries to improve application performance.
    • Experience with database recovery, including point-in-time recovery and transaction wraparound process remediation.
    • Experience with management of Postgres Vacuuming process, able to design and implement processes and standards to prevent production impacting Vacuuming events.
    • Experience with PostgreSQL upgrade processes.
  • Technical Skills:
    • Advanced knowledge of PostgreSQL architecture, performance tuning, and query optimization experience with Azure Query Performance Insight is a plus.
    • Advanced knowledge of operating databases at large scale in cloud-managed environments including optimizing for cpu, memory,IO usage and cost management.
    • Proficiency in automation tools such as Terraform, Ansible, and Flyway.
    • Strong understanding of security practices and compliance frameworks.
    • Extensive knowledge of PostgreSQL internals, index design, statistics and wait types.
  • Soft Skills:
    • Excellent written and verbal communication skills with the ability to convey complex technical concepts to both technical and non-technical stakeholders.
    • Strategic thinker with the ability to drive long-term database solutions aligned with business goals.
    • Collaborative and team-oriented, contributes ideas, suggestions, and effort to the group.
    • Ability to work effectively in a collaborative team environment or independently.

The base salary hiring range for this position is $108,900 to $189,700. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits.
In addition to your compensation, enjoy the rewards of an organization that puts our heart into caring for our colleagues and our communities.  Eligible employees may enroll in a full range of medical, dental, and vision benefits, 401(k) retirement savings plan, and an Employee Stock Purchase Plan.  We also offer education assistance, free development courses, paid time off programs, paid holidays, a CVS store discount, and discount programs with participating partners.  

About Us:

Signify Health is helping build the healthcare system we all want to experience by transforming the home into the healthcare hub. We coordinate care holistically across individuals’ clinical, social, and behavioral needs so they can enjoy more healthy days at home. By building strong connections to primary care providers and community resources, we’re able to close critical care and social gaps, as well as manage risk for individuals who need help the most. This leads to better outcomes and a better experience for everyone involved.

Our high-performance networks are powered by more than 9,000 mobile doctors and nurses covering every county in the U.S., 3,500 healthcare providers and facilities in value-based arrangements, and hundreds of community-based organizations. Signify’s intelligent technology and decision-support services enable these resources to radically simplify care coordination for more than 1.5 million individuals each year while helping payers and providers more effectively implement value-based care programs.

To learn more about how we’re driving outcomes and making healthcare work better, please visit us at www.signifyhealth.com

Diversity and Inclusion are core values at Signify Health, and fostering a workplace culture reflective of that is critical to our continued success as an organization.

We are committed to equal employment opportunities for employees and job applicants in compliance with applicable law and to an environment where employees are valued for their differences.

See more jobs at Signify Health

Apply for this job

+30d

Director of Cloud Operations (US)

NexthinkBoston, MA, Remote
agileterraformansibledockerkubernetesjenkins

Nexthink is hiring a Remote Director of Cloud Operations (US)

Job Description

Nexthink is looking for a Director of Cloud / SRE Operations who is passionate about building and running a high-performance cloud platform and SRE and operations. This role will support US-based operations generally but will, in addition, focus on enabling Nexthink to deliver to the US Public Sector market, in particular a FedRAMP moderate offering. The candidate will drive the development of modern, cloud-native SRE processes and the management and operations for Nexthink’s multi-tenant, microservices-based cloud platform. The platform has multiple instances deployed across the globe.

You will also work closely with engineering to mature our CI/CD pipeline and ensure high-quality product releases. You should have demonstrated robust technical and organizational skills in managing large cloud platform engineering and operations for a SaaS product company.

This role reports to the Senior Vice President (SVP) of Cloud and Architecture. The person will collaborate with teams in Engineering, Security, Support, Product Management, and Sales. The main task is to lead the development and launch of our new digital platform for employees.
 

Key Responsibilities

You will bring a solid SRE mindset to your role and drive the adoption of SRE industry best practices. You will also bring a background in operations in a Security and compliance-centric delivery model, particularly for the US Public Sector market.

The Director of Operations responsibilities include but are not limited to:

  • In charge of all operations and SRE functions within the US organization, including incident response and forward-thinking monitoring.
  • Own and drive compliance and evidence-gathering activities for regulated deployments such as FedRAMP Moderate and equivalent environments.
  • Drive capacity forecasting and change management processes.
  • Automation for delivery and operations of platform services using infrastructure-as-code and monitoring-as-code.
  • Tasked with building and managing service availability, performance, and scalability in production environments to enable business-defined SLAs.
  • Collaborate with the development organization to manage micro-services at scale on the platform.
  • Set clear SLOs to meet or exceed our SLAs. Ensure our systems are always operational. Create alert systems to foresee potential issues. Monitor our dashboards. Prepare playbooks to address any anticipated problems.
  • Collaborate with application and business stakeholders to ensure a high-quality product is developed and deployed in production.
  • Work closely with the architecture and security teams to define and implement enterprise-grade practices.
  • Recruit, manage, and inspire a proficient cloud engineering and SRE team.

Qualifications

  • Degree in Computer Science or Engineering or equivalent professional experience
  • 10+ years’ in cloud operations engineering leadership roles in SaaS companies
  • 5+ years in a senior management/leadership role, leading large SRE and Cloud Operations teams
  • Experience operating workloads in a secured, highly regulated environment such as FedRAMP
  • Deep understanding and experience working with one of the three major Cloud Service Providers running native cloud technologies based on Docker, Kubernetes, Istio, Kafka at scale
  • Experience working with modern CI/CD and automation tools such as Jenkins, Ansible, Terraform, etc.
  • Experience building, scaling & monitoring infrastructure needed for SaaS-based application and services. Experience with APM and Infrastructure monitoring tools such as Datadog, NewRelic, SumoLogic, Splunk, Dynatrace, etc.
  • Managed on-call 24x7 rotation teams, to serve global customers
  • Experience creating a strong and passionate customer-focused SRE-driven operations culture
  • Excellent interpersonal and communication skills
  • Knowledge of lean and agile software engineering best practices
  • Excellent communication skills in English

See more jobs at Nexthink

Apply for this job

+30d

Kubernetes Admin

AristaBengaluru, India, Remote
DesignansiblemetalelasticsearchMySQLkuberneteslinuxjenkinspythonjavascript

Arista is hiring a Remote Kubernetes Admin

Job Description

Who You’ll Work With

Working in the Engineering Productivity (EngProd) group, you will collaborate and work with other engineers to design, build, scale, and operate the systems that the rest of Arista’s development teams use.  The EngProd team uses industry-standard systems like Ansible, Jenkins, Kubernetes, Grafana, Spinnaker, MySQL, ElasticSearch, Google Cloud, and Varnish and also internal systems that we’ve built from the ground-up to automate CI/CD, testing, analysis, and visualization.

What You’ll Do

Arista Networks is looking for world-class Kubernetes-aware engineers passionate about driving systems reliability and scalability to provide the best possible development experience for our 1400+ person engineering team. You will be part of a fast paced, high caliber team building the internal systems and infrastructure used to build the routing and switching products driving the industry's largest data center networks.

Arista’s Software Engineering team runs at a scale rarely found - TBs of source control, 60GB work trees with 1000s of developer branches in flight at any given time, over 400K daily build/test jobs and over 150 homegrown and cloud native services running on a 100 node on-prem bare metal kubernetes cluster.  Operating these systems takes vigilance, responsiveness to alerts, and a steady stream of updates and bug fixes to keep things running smoothly and efficiently as well as to increase our ability to monitor, understand and visualize them. The role will cover all aspects of our Kubernetes infrastructure, and may include monitoring, responding to, and enhancing alerts, working to unify and standardize our alerts, fine tuning code for scalability and performance, debugging problems, simplifying and securing developer experience with k8s etc. You will own your projects from definition to deployment, developer and vendor interactions, and you will be responsible for the quality of everything you deliver.

Responsibilities:

  • Work with existing k8s admin team to own different aspects of managing a production k8s cluster (eg: upgrades, monitoring, capacity planning, security, developer experience etc)
  • Proactively monitor, respond to, and enhance alerts and set up automated alert handling where applicable
  • Create and maintain the incident response runbooks working with the service dev teams
  • Debug and resolve issues impacting developer user experience and infrastructure stability around the k8s platform
  • Adopt current best practices in k8s cluster management. Evaluate and adopt OSS projects that simplify k8s cluster management. 
  • Set up guidelines and paved paths for service dev teams improving developer experience around the k8s platform.
  • Work with Arista’s software engineers to identify bottlenecks and limitations in our workflows, tooling, and infrastructure around k8s and provide fixes for those problems.
  • Engage with 3rd party vendor support as part of triage

Qualifications

  • At least BS Computer Science or Engineering + 4 years’ experience, MS Computer Science or Engineering + 5 years’ experience, or Ph.D.  in Computer Science or equivalent work experience.
  • Knowledge of one or more of Go, Python, Javascript, Shell Scripting to be able to implement medium complexity automation workflows
  • Knowledge of Linux (or UNIX).
  • Experience operating software systems at scale
  • Strong understanding of the fundamentals of storage and networking
  • Comfortable with Ansible and GitOps
  • Strong expertise with managing onprem / baremetal Kubernetes clusters
  • Applied understanding of software engineering principles.
  • Strong problem solving and software troubleshooting skills.
  • Ability to design a solution and implement features independently. Ability to work in small teams.
  • Comfortable with security principles and 
  • Able to study source code of OSS projects, conduct experiments as necessary to debug issues
  • Proven expertise with debugging complex issues that span the technology stack
  • Experience dealing with network proxies and containerized storage.

Apply for this job

+30d

Vaga Afirmativa Para Mulheres - Senior Data Center Network Engineer (27348)

Bosch GroupCampinas, Brazil, Remote
Bachelor's degreeDesignansiblelinuxpython

Bosch Group is hiring a Remote Vaga Afirmativa Para Mulheres - Senior Data Center Network Engineer (27348)

Descrição da vaga

Job Description:

  • Take on a key role in managing our central datacenter network, including our Bosch hybrid cloud activities and remote datacenter facilities. You´ll be collaborating in an international diverse team of experts - you'll be responsible for the full lifecycle — design, build, and run — of this critical environment, driving innovation, performance, and security.

What You’ll Do:

  • Design cutting-edge network architectures.
  • Build and deploy scalable, resilient solutions globally.
  • Run and optimize our network for peak performance.
  • Why Join Us? Work with the latest technologies, have a global impact, collaborate with top experts, and grow your career.

     

 

Qualificações

.

  • Personality: Creative, flexible, with strong communication skills; a true team player.
  • Working Style: Independent, structured, and goal oriented.
  • Experience and Know-How:
    • Software Skills (desired): Knowledge in Python, Ansible, CI/CD pipelines, DevTools like GIT.
    • Additional Skills (desired): Knowledge of Linux, container networking, cloud principles, and project management.
    • In-Depth Experience (expected): Strong background with Cisco ACI, Data Center Switching, and Software Defined Networking (SDN).
    • Network Skills (expected): Solid understanding of switching, routing (BGP, OSPF, etc.).
  • Enthusiasm: Passion for working in an international team environment.
  • Languages: Proficiency in English required.
  • Bachelor's degree in Computer Engineering, Computer Network Analyst and similar.

 

See more jobs at Bosch Group

Apply for this job

+30d

Lead Devops Engineer

Full TimeDevOPSremote-firstDesignansibleazuredockerkubernetesjenkinspythonAWS

Second Nature is hiring a Remote Lead Devops Engineer

Lead Devops Engineer - Second Nature - Career PageSee more jobs at Second Nature

Apply for this job

+30d

Site Reliability Engineer (Chile, All-Levels)

SezzleChile, Remote
SalesgolangBachelor's degreeterraformsqlDesignansiblec++dockerkuberneteslinuxpythonAWS

Sezzle is hiring a Remote Site Reliability Engineer (Chile, All-Levels)

Sezzle is a remote U.S.-based company listed on NASDAQ. Our salary ranges are as follows:

  • Junior: $2,000 - $5,200 USD per month
  • Mid: $5,400 - $8,750 USD per month
  • Senior: $7,000 - $9,200 USD per month

About Sezzle:

Sezzle is a cutting-edge fintech company dedicated to financially empowering the next generation. With only one in three millennials owning a credit card and the majority lacking their desired credit scores, Sezzle addresses these challenges through a payment platform that offers interest-free installment plans at online stores. By increasing consumers' purchasing power, Sezzle drives sales and basket sizes for thousands of eCommerce merchants that it partners with.

About the Role:

We are looking for a Site Reliability Engineer to work on our Infrastructure team, who will assist us in running and scaling our cloud infrastructure. Your duties will blend software development and operations in order to continuously automate our environments. We are seeking a talented and motivated Site Reliability Engineer who is best in class with a high IQ plus a high EQ. This role presents an exciting opportunity to thrive in a dynamic, fast-paced environment within a rapidly growing team, with abundant prospects for career advancement.

What You'll Do:

  • Be on a Pagerduty on-call rotation to respond to production incidents
  • Maintain and develop monitoring and alerting solutions to improve the on-call experience
  • Design, build and maintain scalable infrastructure for running our systems
  • Assist product developers in debugging and triaging production issues

What We Look For:

  • Bachelor's in computer science (preferred) or equivalent related experience

Ideal Skills & Experience:

  • Basic knowledge of a Microservice Architecture
  • Basic knowledge of AWS, Kubernetes, Docker
  • Familiarity with deployment/provisioning tools like Terraform, Helm, Ansible
  • Knowledge in linux platform
  • Comfortable working with Golang, Python and shell script
  • Knowledge of Relational Databases, SQL and ORM technologies
  • Close familiarity with software engineering tools, software development methodology, and release processes

About You:

  • You have relentlessly high standards - many people may think your standards are unreasonably high. You are continually raising the bar and driving those around you to deliver great results. You make sure that defects do not get sent down the line and that problems are fixed so they stay fixed.
  • You’re not bound by convention - your success—and much of the fun—lies in developing new ways to do things
  • You need action - speed matters in business. Many decisions and actions are reversible and do not need extensive study. We value calculated risk-taking.
  • You earn trust - you listen attentively, speak candidly, and treat others respectfully.
  • You have backbone; disagree, then commit - you can respectfully challenge decisions when you disagree, even when doing so is uncomfortable or exhausting. You have conviction and are tenacious. You do not compromise for the sake of social cohesion. Once a decision is determined, you commit wholly.
  • You deliver results - you focus on the key inputs and deliver them with the right quality and in a timely fashion. Despite setbacks, you rise to the occasion and never settle.

Sezzle’s Technology Stack:

  • Languages:Golang, Typescript, Python
  • Frontend:Typescript - React and React Native
  • Backend:Golang
  • Database:MySQL, Postgres, Elasticsearch
  • DevOps & Cloud:AWS, Kubernetes
  • Version Control:Git
  • CI/CD:Gitlab
  • Testing:Developer-driven, focus on automated unit, integration, and end-to-end tests
  • Sezzle is focused on using open source, and we build what we can before buying!

What Makes Working at Sezzle Awesome:

At Sezzle, we are more than just brilliant engineers, passionate data enthusiasts, out-of-the-box thinkers, and determined innovators. We believe in surrounding ourselves with only the best and the brightest individuals. Our culture is not defined by a certain set of perks designed to give the illusion of the traditional startup culture, but rather, it is the visible example living in every employee that we hire.

Compensation:

Our ranges are very broad to accommodate all types of candidates and encourage growth. Specific compensation offered to a candidate may be dependent on factors such as education, experience, qualifications, and alignment with market data. Exceptional candidates may receive salaries outside of the posted ranges.

Sezzle provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, creed, gender, sex (including pregnancy, childbirth, and related medical conditions), sexual orientation, gender identity, national origin, age, disability, genetic information or characteristics, marital status, familial status, veteran or military status, status regarding public assistance, membership or activity in a local commission, or any other protected status in accordance with applicable federal, state and local laws.

See more jobs at Sezzle

Apply for this job

+30d

Site Reliability Engineer (Brazil, All-Levels)

SezzleBrazil, Remote
SalesgolangBachelor's degreeterraformsqlDesignansiblec++dockerkuberneteslinuxpythonAWS

Sezzle is hiring a Remote Site Reliability Engineer (Brazil, All-Levels)

Sezzle is a remote U.S.-based company listed on NASDAQ. Our salary ranges are as follows:

  • Junior: $3,200 - $5,200 USD per month
  • Mid: $5,400 - $8,750 USD per month
  • Senior: $7,000 - $9,200 USD per month

About Sezzle:

Sezzle is a cutting-edge fintech company dedicated to financially empowering the next generation. With only one in three millennials owning a credit card and the majority lacking their desired credit scores, Sezzle addresses these challenges through a payment platform that offers interest-free installment plans at online stores. By increasing consumers' purchasing power, Sezzle drives sales and basket sizes for thousands of eCommerce merchants that it partners with.

About the Role:

We are looking for a Site Reliability Engineer to work on our Infrastructure team, who will assist us in running and scaling our cloud infrastructure. Your duties will blend software development and operations in order to continuously automate our environments. We are seeking a talented and motivated Site Reliability Engineer who is best in class with a high IQ plus a high EQ. This role presents an exciting opportunity to thrive in a dynamic, fast-paced environment within a rapidly growing team, with abundant prospects for career advancement.

What You'll Do:

  • Be on a Pagerduty on-call rotation to respond to production incidents
  • Maintain and develop monitoring and alerting solutions to improve the on-call experience
  • Design, build and maintain scalable infrastructure for running our systems
  • Assist product developers in debugging and triaging production issues

What We Look For:

  • Bachelor's in computer science (preferred) or equivalent related experience

Ideal Skills & Experience:

  • Basic knowledge of a Microservice Architecture
  • Basic knowledge of AWS, Kubernetes, Docker
  • Familiarity with deployment/provisioning tools like Terraform, Helm, Ansible
  • Knowledge in linux platform
  • Comfortable working with Golang, Python and shell script
  • Knowledge of Relational Databases, SQL and ORM technologies
  • Close familiarity with software engineering tools, software development methodology, and release processes

About You:

  • You have relentlessly high standards - many people may think your standards are unreasonably high. You are continually raising the bar and driving those around you to deliver great results. You make sure that defects do not get sent down the line and that problems are fixed so they stay fixed.
  • You’re not bound by convention - your success—and much of the fun—lies in developing new ways to do things
  • You need action - speed matters in business. Many decisions and actions are reversible and do not need extensive study. We value calculated risk-taking.
  • You earn trust - you listen attentively, speak candidly, and treat others respectfully.
  • You have backbone; disagree, then commit - you can respectfully challenge decisions when you disagree, even when doing so is uncomfortable or exhausting. You have conviction and are tenacious. You do not compromise for the sake of social cohesion. Once a decision is determined, you commit wholly.
  • You deliver results - you focus on the key inputs and deliver them with the right quality and in a timely fashion. Despite setbacks, you rise to the occasion and never settle.

What Makes Working at Sezzle Awesome:

At Sezzle, we are more than just brilliant engineers, passionate data enthusiasts, out-of-the-box thinkers, and determined innovators. We believe in surrounding ourselves with only the best and the brightest individuals. Our culture is not defined by a certain set of perks designed to give the illusion of the traditional startup culture, but rather, it is the visible example living in every employee that we hire.

Compensation:

Our ranges are very broad to accommodate all types of candidates and encourage growth. Specific compensation offered to a candidate may be dependent on factors such as education, experience, qualifications, and alignment with market data. Exceptional candidates may receive salaries outside of the posted ranges.

Sezzle provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, creed, gender, sex (including pregnancy, childbirth, and related medical conditions), sexual orientation, gender identity, national origin, age, disability, genetic information or characteristics, marital status, familial status, veteran or military status, status regarding public assistance, membership or activity in a local commission, or any other protected status in accordance with applicable federal, state and local laws.

See more jobs at Sezzle

Apply for this job

+30d

Site Reliability Engineer (Argentina, All-Levels)

SezzleArgentina, Remote
SalesgolangBachelor's degreeterraformsqlDesignansiblec++dockerkuberneteslinuxpythonAWS

Sezzle is hiring a Remote Site Reliability Engineer (Argentina, All-Levels)

Sezzle is a remote U.S.-based company listed on NASDAQ. Our salary ranges are as follows:

  • Junior: $2,000 - $5,200 USD per month
  • Mid: $5,400 - $8,750 USD per month
  • Senior: $7,000 - $9,200 USD per month

About Sezzle:

Sezzle is a cutting-edge fintech company dedicated to financially empowering the next generation. With only one in three millennials owning a credit card and the majority lacking their desired credit scores, Sezzle addresses these challenges through a payment platform that offers interest-free installment plans at online stores. By increasing consumers' purchasing power, Sezzle drives sales and basket sizes for thousands of eCommerce merchants that it partners with.

About the Role:

We are looking for a Site Reliability Engineer to work on our Infrastructure team, who will assist us in running and scaling our cloud infrastructure. Your duties will blend software development and operations in order to continuously automate our environments. We are seeking a talented and motivated Site Reliability Engineer who is best in class with a high IQ plus a high EQ. This role presents an exciting opportunity to thrive in a dynamic, fast-paced environment within a rapidly growing team, with abundant prospects for career advancement.

What You'll Do:

  • Be on a Pagerduty on-call rotation to respond to production incidents
  • Maintain and develop monitoring and alerting solutions to improve the on-call experience
  • Design, build and maintain scalable infrastructure for running our systems
  • Assist product developers in debugging and triaging production issues

What We Look For:

  • Bachelor's in computer science (preferred) or equivalent related experience

Ideal Skills & Experience:

  • Basic knowledge of a Microservice Architecture
  • Basic knowledge of AWS, Kubernetes, Docker
  • Familiarity with deployment/provisioning tools like Terraform, Helm, Ansible
  • Knowledge in linux platform
  • Comfortable working with Golang, Python and shell script
  • Knowledge of Relational Databases, SQL and ORM technologies
  • Close familiarity with software engineering tools, software development methodology, and release processes

Sezzle’s Technology Stack:

  • Languages:Golang, Typescript, Python
  • Frontend:Typescript - React and React Native
  • Backend:Golang
  • Database:MySQL, Postgres, Elasticsearch
  • DevOps & Cloud:AWS, Kubernetes
  • Version Control:Git
  • CI/CD:Gitlab
  • Testing:Developer-driven, focus on automated unit, integration, and end-to-end tests
  • Sezzle is focused on using open source, and we build what we can before buying!

About You:

  • You have relentlessly high standards - many people may think your standards are unreasonably high. You are continually raising the bar and driving those around you to deliver great results. You make sure that defects do not get sent down the line and that problems are fixed so they stay fixed.
  • You’re not bound by convention - your success—and much of the fun—lies in developing new ways to do things
  • You need action - speed matters in business. Many decisions and actions are reversible and do not need extensive study. We value calculated risk-taking.
  • You earn trust - you listen attentively, speak candidly, and treat others respectfully.
  • You have backbone; disagree, then commit - you can respectfully challenge decisions when you disagree, even when doing so is uncomfortable or exhausting. You have conviction and are tenacious. You do not compromise for the sake of social cohesion. Once a decision is determined, you commit wholly.
  • You deliver results - you focus on the key inputs and deliver them with the right quality and in a timely fashion. Despite setbacks, you rise to the occasion and never settle.

What Makes Working at Sezzle Awesome:

At Sezzle, we are more than just brilliant engineers, passionate data enthusiasts, out-of-the-box thinkers, and determined innovators. We believe in surrounding ourselves with only the best and the brightest individuals. Our culture is not defined by a certain set of perks designed to give the illusion of the traditional startup culture, but rather, it is the visible example living in every employee that we hire.

Compensation:

Our ranges are very broad to accommodate all types of candidates and encourage growth. Specific compensation offered to a candidate may be dependent on factors such as education, experience, qualifications, and alignment with market data. Exceptional candidates may receive salaries outside of the posted ranges.

Sezzle provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, creed, gender, sex (including pregnancy, childbirth, and related medical conditions), sexual orientation, gender identity, national origin, age, disability, genetic information or characteristics, marital status, familial status, veteran or military status, status regarding public assistance, membership or activity in a local commission, or any other protected status in accordance with applicable federal, state and local laws.

See more jobs at Sezzle

Apply for this job

+30d

Site Reliability Engineer (Colombia, All-Levels)

SezzleColombia, Remote
SalesgolangBachelor's degreeterraformsqlDesignansiblec++dockerkuberneteslinuxpythonAWS

Sezzle is hiring a Remote Site Reliability Engineer (Colombia, All-Levels)

Sezzle is a remote U.S.-based company listed on NASDAQ. Our salary ranges are as follows:

  • Junior: $2,000 - $5,200 USD per month
  • Mid: $5,400 - $8,750 USD per month
  • Senior: $7,000 - $9,200 USD per month

About Sezzle:

Sezzle is a cutting-edge fintech company dedicated to financially empowering the next generation. With only one in three millennials owning a credit card and the majority lacking their desired credit scores, Sezzle addresses these challenges through a payment platform that offers interest-free installment plans at online stores. By increasing consumers' purchasing power, Sezzle drives sales and basket sizes for thousands of eCommerce merchants that it partners with.

About the Role:

We are looking for a Site Reliability Engineer to work on our Infrastructure team, who will assist us in running and scaling our cloud infrastructure. Your duties will blend software development and operations in order to continuously automate our environments. We are seeking a talented and motivated Site Reliability Engineer who is best in class with a high IQ plus a high EQ. This role presents an exciting opportunity to thrive in a dynamic, fast-paced environment within a rapidly growing team, with abundant prospects for career advancement.

What You'll Do:

  • Be on a Pagerduty on-call rotation to respond to production incidents
  • Maintain and develop monitoring and alerting solutions to improve the on-call experience
  • Design, build and maintain scalable infrastructure for running our systems
  • Assist product developers in debugging and triaging production issues

What We Look For:

  • Bachelor's in computer science (preferred) or equivalent related experience

Ideal Skills & Experience:

  • Basic knowledge of a Microservice Architecture
  • Basic knowledge of AWS, Kubernetes, Docker
  • Familiarity with deployment/provisioning tools like Terraform, Helm, Ansible
  • Knowledge in linux platform
  • Comfortable working with Golang, Python and shell script
  • Knowledge of Relational Databases, SQL and ORM technologies
  • Close familiarity with software engineering tools, software development methodology, and release processes

About You:

  • You have relentlessly high standards - many people may think your standards are unreasonably high. You are continually raising the bar and driving those around you to deliver great results. You make sure that defects do not get sent down the line and that problems are fixed so they stay fixed.
  • You’re not bound by convention - your success—and much of the fun—lies in developing new ways to do things
  • You need action - speed matters in business. Many decisions and actions are reversible and do not need extensive study. We value calculated risk-taking.
  • You earn trust - you listen attentively, speak candidly, and treat others respectfully.
  • You have backbone; disagree, then commit - you can respectfully challenge decisions when you disagree, even when doing so is uncomfortable or exhausting. You have conviction and are tenacious. You do not compromise for the sake of social cohesion. Once a decision is determined, you commit wholly.
  • You deliver results - you focus on the key inputs and deliver them with the right quality and in a timely fashion. Despite setbacks, you rise to the occasion and never settle.

Sezzle’s Technology Stack:

  • Languages:Golang, Typescript, Python
  • Frontend:Typescript - React and React Native
  • Backend:Golang
  • Database:MySQL, Postgres, Elasticsearch
  • DevOps & Cloud:AWS, Kubernetes
  • Version Control:Git
  • CI/CD:Gitlab
  • Testing:Developer-driven, focus on automated unit, integration, and end-to-end tests
  • Sezzle is focused on using open source, and we build what we can before buying!

What Makes Working at Sezzle Awesome:

At Sezzle, we are more than just brilliant engineers, passionate data enthusiasts, out-of-the-box thinkers, and determined innovators. We believe in surrounding ourselves with only the best and the brightest individuals. Our culture is not defined by a certain set of perks designed to give the illusion of the traditional startup culture, but rather, it is the visible example living in every employee that we hire.

Compensation:

Our ranges are very broad to accommodate all types of candidates and encourage growth. Specific compensation offered to a candidate may be dependent on factors such as education, experience, qualifications, and alignment with market data. Exceptional candidates may receive salaries outside of the posted ranges.

Sezzle provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, creed, gender, sex (including pregnancy, childbirth, and related medical conditions), sexual orientation, gender identity, national origin, age, disability, genetic information or characteristics, marital status, familial status, veteran or military status, status regarding public assistance, membership or activity in a local commission, or any other protected status in accordance with applicable federal, state and local laws.

See more jobs at Sezzle

Apply for this job

+30d

Linux Systems Administrator

NextivaScottsdale, Arizona (Hybrid)
Full Timejiraoracleansiblec++linuxAWS

Nextiva is hiring a Remote Linux Systems Administrator

Redefine the future of customer experiences. One conversation at a time.

We’re changing the game with a first-of-its-kind, conversation-centric platform that unifies team collaboration and customer experience in one place. Powered by AI, built by amazing humans.

Our culture is forward-thinking, customer-obsessed and built on an unwavering belief that connection fuels business and life; connections to our customers with our signature Amazing Service®, our products and services, and most importantly, each other. Since 2008, 100,000+ companies and 1M+ users rely on Nextiva for customer and team communication.

If you’re ready to collaborate and create with amazing people, let your personality shine and be on the frontlines of helping businesses deliver amazing experiences, you’re in the right place. 

Build Amazing - Deliver Amazing - Live Amazing - Be Amazing

 

Nextiva is currently seeking bright and talented individuals for a Linux Systems Administrator position. The Linux Systems Administrator is responsible for supporting the day-to-day activities of Nextiva’s physical, virtual, and cloud servers running various flavors of the Linux operating system. The role is predominantly focused on problem resolution, enhancement request fulfillment, proactive monitoring, and preventative maintenance. In addition, the Linux SysAdmin will be responsible for automating common to complex tasks, overseeing patch management and deployment, multi-level troubleshooting and various other related activities.

Key Responsibilities

  • Ensure the availability and reliability of production, staging, and development servers and virtual machines meets the business needs
  • Troubleshoot issues and problems related to Linux OS, services and applications, networking, VMware, storage, and other related components
  • Collaborate with network engineering, software engineering, SRE/DevOps and others, to resolve incidents and outages
  • Maintain a robust patch and lifecycle management strategy to minimize security vulnerability exposure and maximize supportability
  • Manage and deploy virtualized instances running Linux and Windows
  • Write Ansible playbooks to facilitate automation
  • Monitor servers and infrastructure to proactively identify potential issues and outages, before they happen
  • Build, maintain, and promote strong technical documentation
  • Collaborating with and cross-train other administrators to guarantee transfer of knowledge

Qualifications

  • Bachelor’s degree in Information Technology, Computer Science, or related work experience
  • 8+ years of experience in Linux systems administration, including both Red Hat and Debian based operating systems
  • Strong experience with VMware virtualization platform and vSphere management
  • Strong experience writing automation with an emphasis on Ansible
  • Strong awareness of networking and internet protocols including TCP/IP, DNS, SMTP, HTTP and distributed networks
  • Working knowledge of physical and virtual storage concepts, including iSCSI
  • Working knowledge of common web services including Nginx, Apache, and Tomcat
  • Working knowledge of public clouds such as AWS, GCP, and Oracle Cloud
  • Experience with monitoring and logging tools like Datadog, Solarwinds, Kibana, Filebeat
  • Experience working with Atlassian products such as Jira, Confluence, and Bitbucket
  • Excellent time management skills with the proven ability to prioritize, handle multiple tasks, and work under tight deadlines while delivering high quality results
  • Demonstrate good judgment in solving problems, identifying problems in advance, and proposing solutions
  • Highly organized, detail oriented, adaptable and quick-thinking with a proactive approach, an enterprise focus, an understanding of shifting business needs and the ability to change priorities with ease
  • Excellent written and verbal communication skills with the ability to communicate knowledgeably and effectively
  • Ability to excel both individually and as a team player in a fast paced, self-directed, constantly evolving environment
  • Flexible availability for on-call support during non-peak hours

Nextiva Core Competencies / DNA:

  • Drives Results:  The successful candidate will be action oriented, with a passion for solving problems.  They will bring clarity and simplicity to ambiguous situations.  This individual will challenge the status quo; asking what we can do differently and finding ways to create and build more success.  They are a change agent, prepared to lead and drive changes as we transform. 
  • Critical Thinker:  The successful candidate is fact based and data driven, able to understand and articulate the “why,” identifying key drivers and learning from the past.  They are forward-thinking, anticipating problems before they arise.  They’ll recommend and action well thought out solutions, understanding the risks and dependencies. 
  • Right Attitude:  The successful candidate will be team-oriented, collaborative and competitive with a winning mindset; they’re resilient and able to easily bounce back from setbacks.  They will be able to zoom in / out, willing to be hands-on to help solve important problems while being a motivating figure for the team along the way.  They will embrace a culture of service and learning with a focus on caring, supporting and respecting our customers and team members.

Compensation, Rewards & Benefits:

The salary or hourly wage offered by Nextiva to external candidates considers a wide range of factors, including but not limited to skills sets, experience, training, licensure and certifications, etc. Our compensation decisions are dependent on the facts and circumstances of each case. A different level in the job hierarchy may apply to a specific candidate resulting in a different hiring range.

Nextiva provides a comprehensive employee benefits package that includes medical (including supplemental plans for accident, hospitalization and critical illness), telemedicine, dental, vision, disability, life insurance, legal assistance, an Employee Assistance Plan, paid parental bonding leave, PTO for hourly employees and Flexible Time Off (FTO) for salaried employees, an employee long-term savings plan (401k) through Fidelity with Nextiva matching, comprehensive employee wellness programs and loads of learning and development opportunities which are coupled with career paths to last a lifetime.

Interested in joining our amazing team at Nextiva HQ? Apply today as we launch the future of business conversations!????

Established in 2008 and headquartered in Scottsdale, Arizona, Nextiva secured $200M from Goldman Sachs in late 2021, valuing the company at $2.7B.To check out what’s going on at Nextiva, check us out on Instagram, Instagram (MX), YouTube, LinkedIn, and the Nextiva blog

Nextiva is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We prohibit discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.Nextiva participates in the E-Verify Program where and as required by law. For additional information about E-Verify visit USCIS

#LI-RQ1   

Apply for this job

+30d

Senior SRE Engineer

FortanixBengaluru,Karnataka,India, Remote Hybrid
DevOPSterraformDesignansibleazureqaAWS

Fortanix is hiring a Remote Senior SRE Engineer

As a Senior Site Reliability Engineer at Fortanix, you will be at the forefront of ensuring the reliability, scalability, and performance of our cutting-edge production environments. You’ll design and build operations as code, architecting automated solutions that enhance system stability. Partnering closely with our product engineering teams, you'll have a hands-on role in continuously improving the reliability of our platforms, ensuring our systems are robust and resilient. You'll develop and implement a comprehensive, actionable monitoring framework that detects and prevents issues before they impact our users.

In this role, you'll be a critical part of our production on-call rotation, responding to incidents with agility and executing post-incident reviews to drive continuous improvement. If you’re passionate about automation, enjoy tackling complex reliability challenges, and thrive in a fast-paced, high-impact environment, this role is for you!

Join us to shape the future of secure computing with a focus on building reliable, scalable, and secure production systems.

Key Responsibilities

  • System Architecture & Design
    • Collaborate with software development teams to design scalable, reliable, and secure systems.
    • Architect and build robust infrastructure to handle growth and ensure system uptime.
  • Automation & Infrastructure as Code (IaC)
    • Automate infrastructure deployment and management using tools like Terraform, Ansible, or CloudFormation.
    • Implement continuous integration and continuous deployment (CI/CD) pipelines for automated testing and deployment.
    • Write automation scripts and code for scaling and self-healing systems.
  • Monitoring & Incident Management
    • Design and implement comprehensive monitoring and alerting solutions to detect anomalies and issues before they impact users.
    • Implement logging and observability tools to gain insight into system health and performance (e.g., Prometheus, Grafana, ELK stack).
    • Manage on-call rotations, ensure timely responses to incidents, and perform root cause analysis and post-mortems.
  • Performance Tuning & Optimization
    • Perform load testing and system benchmarking to identify performance bottlenecks.
    • Optimize application and infrastructure performance, reducing latency and improving response times.
  • Security & Compliance
    • Ensure systems are secure by design, incorporating security best practices (e.g., encryption, firewalls, access controls).
    • Stay up-to-date with security vulnerabilities and patch systems accordingly.
    • Implement compliance standards (e.g., GDPR, HIPAA) where applicable.
  • Collaboration & Mentoring
    • Work closely with developers to ensure that applications are designed for reliability and scalability.
    • Serve as a mentor to junior engineers, fostering a culture of reliability and best practices.
    • Collaborate across teams (DevOps, Development, QA) to enhance system robustness.
  • Disaster Recovery & High Availability
    • Develop and maintain disaster recovery and business continuity plans.
    • Ensure systems are highly available, designing systems that can withstand failures without service disruptions.
  • Capacity Planning & Scalability
    • Forecast future system demand and plan for capacity increases as needed.
    • Design infrastructure that scales automatically to handle increased loads.
  • Continuous Improvement & Reliability Culture
    • Analyze incidents and failures to identify opportunities for improving system reliability.
    • Drive a culture of reliability across the engineering organization, advocating for best practices and SRE principles.
  • Cloud & Hybrid Infrastructure Management
    • Manage cloud infrastructure (AWS, GCP, Azure) and hybrid environments, ensuring optimal usage of cloud resources.
    • Implement cost optimization strategies for cloud resources while maintaining performance and reliability.

This role requires a deep understanding of both software engineering and infrastructure management, as well as strong collaboration and problem-solving skills

Technical Experience

Demonstrated expertise in modern enterprise Site Reliability Engineering is essential for this role. In addition, experience in the following areas is highly beneficial:

  • Proficiency in Programming/Scripting Languages -Strong coding skills in languages such as Python, Go, or similar. Familiarity with scripting languages like Bash or PowerShell is also important.
  • Problem Solving -Advanced experience with Linux administration and automation. Experience with production debugging and the ability to implement fast workarounds.
  • CI/CD & Devops - Advanced experience in managing software deployment on Cloud via pipelines (example: bitbucket/Gitlab). Understanding DevOps practices on how modern software is deployed, upgraded and monitored.
  • Containers & Orchestration - Strong hands-on experience with container technologies like Docker and Kubernetes, and other orchestration tools like Helm or OpenShift. Experience with both managed (AKS, EKS, GKE.) and unmanaged (on-prem) Kubernetes.
  • Monitoring & Observability - Expertise with monitoring, alerting, and logging tools such as Prometheus, Grafana, Datadog, ELK stack, or similar. Understanding of metrics collection and analysis.
  • Networking/Infra - Solid understanding of networking concepts (TCP/IP, DNS, VPN, load balancing, firewalls, etc.) and network performance tuning in cloud environments. Experience with high-level Network Fnfrastructure for Datacentre and Cloud

Key Requirements

  • Bachelors/Masters in Computer Science, Engineering or a related field.
  • Engineering: 8+ Years of engineering experience with 3+ Years of core Site reliability engineering experience.
  • Experience with managing and resolving high-severity incidents in production environments. Ability to lead post-mortems and implement improvements.
  • Solid understanding of Cloud technologies.
  • Strong experience with automation practices and principles to reduce manual work and improve efficiency.
  • Experience working in a cross-functional team environment, often collaborating with developers, QA, and security teams.
  • Must be a team player.

Certifications (Optional but Preferred)

  • Cloud Certifications: AWS Certified Solutions Architect, Google Cloud Certified - Professional Cloud Architect, Microsoft Certified: Azure Solutions Architect Expert.
  • DevOps Certifications: Certified Kubernetes Administrator (CKA), HashiCorp Terraform Associate, or similar certifications.
  • Top range of market compensation
  • A friendly culture that brings the best out of everybody
  • Mediclaim Insurance – Employees and their eligible dependents including dental coverage
  • Personal Accident Insurance
  • Internet Reimbursement
  • See more jobs at Fortanix

    Apply for this job

    +30d

    Engineer, DevOps

    DevOPSBachelor's degreeterraformDesignansiblegraphqlc++dockerkubernetesjenkinspythonAWSjavascriptfrontend

    hims & hers is hiring a Remote Engineer, DevOps

    Hims & Hers Health, Inc. (better known as Hims & Hers) is the leading health and wellness platform, on a mission to help the world feel great through the power of better health. We are revolutionizing telehealth for providers and their patients alike. Making personalized solutions accessible is of paramount importance to Hims & Hers and we are focused on continued innovation in this space. Hims & Hers offers nonprescription products and access to highly personalized prescription solutions for a variety of conditions related to mental health, sexual health, hair care, skincare, heart health, and more.

    Hims & Hers is a public company, traded on the NYSE under the ticker symbol “HIMS”. To learn more about the brand and offerings, you can visit hims.com and forhers.com, or visit our investor site. For information on the company’s outstanding benefits, culture, and its talent-first flexible/remote work approach, see below and visit www.hims.com/careers-professionals.

    ​​About the Role:

    We are seeking a DevOps Engineer to help build and maintain the infrastructure powering our ecommerce platform with a focus on supporting our Store Frontend teams. We believe that moving fast is our competitive advantage; that moving fast enables us to better serve our users. We also know that the faster we move, the more risk we face causing disruptions, so we invest heavily into observability and security tools.

    You Will:

    • Enable frontend/growth product teams to focus on developing features while DevOps takes care of reliably operating our platform and production sites with a focus on continuous improvement and developer enablement.
    • Actively seek and identify opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation.
    • Independently lead key projects in support of business and technology strategy.
    • Use automation extensively to design, configure, manage, and monitor systems in support of our product development teams
    • Manage Infrastructure through automation (Infrastructure as Code - terraform knowledge preferred)
    • Perform and run blameless RCAs on incidents and outages aggressively looking for answers that will prevent incident reoccurrence.
    • Define, Design, and implement DevOps practices ensuring availability, scalability and observability of production systems with a strong focus on excellent customer experience
    • Manage incidents and emergency response, track outages, ensure data integrity and engineer releases to promote secure, efficient and rapid deployments. 
    • There will be an On call rotation but it will be fairly distributed across the team, including the manager, who also takes a weekly rotation!
    • Troubleshoot various production or non-production issues and alerts and identify the root cause

    You Have:

    • 3+ years as a DevOps Engineer, Site Reliability Engineer or Platform Engineer
    • 5+ years of total experience supporting technical environments within Engineering domains
    • Bachelor's degree in Computer Science, Engineering, or related field, or relevant years of work experience
    • Exposure to Frontend technologies: NodeJS, Next.js, Javascript, GraphQL preferred
    • Experience with service-oriented architectures and microservices at scale
    • Strong proficiency with Public Cloud providers (AWS, GCP)
    • Experience with Terraform or other IaC tools such as Chef, Puppet or Ansible
    • Experience with CI/CD tools such as Jenkins, CircleCI or GitHub Actions
    • Ability to use containers and orchestration frameworks (Kubernetes, Docker, Container registries etc.) -EKS + Helm Experience is a plus.
    • Proficiency scripting in one or more languages such as Python, Bash, Go and/or others
    • Experience with configuring, customizing, and extending monitoring tools (Datadog, ELK, Prometheus, etc.)
    • Excellent debugging and troubleshooting skills
    • Strong technical competency, with a data-driven analytical approach towards solving complex challenges
    • Have a systematic problem-solving approach, coupled with strong and effective communication skills and a sense of drive
    • Knowledge of information security standards rules and regulations related to information security and data confidentiality (e.g. HIPAA, PCI DSS, NIST, ISO, etc.)

     

    Our Benefits (there are more but here are some highlights):

    • Competitive salary & equity compensation for full-time roles
    • Unlimited PTO, company holidays, and quarterly mental health days
    • Comprehensive health benefits including medical, dental & vision, and parental leave
    • Employee Stock Purchase Program (ESPP)
    • Employee discounts on hims & hers & Apostrophe online products
    • 401k benefits with employer matching contribution
    • Offsite team retreats

     

    #LI-Remote

     

    Outlined below is a reasonable estimate of H&H’s compensation range for this role for US-based candidates. If you're based outside of the US, your recruiter will be able to provide you with an estimated salary range for your location.

    The actual amount will take into account a range of factors that are considered in making compensation decisions including but not limited to skill sets, experience and training, licensure and certifications, and location. H&H also offers a comprehensive Total Rewards package that may include an equity grant.

    Consult with your Recruiter during any potential screening to determine a more targeted range based on location and job-related factors.

    An estimate of the current salary range for US-based employees is
    $90,000$115,000 USD

    We are focused on building a diverse and inclusive workforce. If you’re excited about this role, but do not meet 100% of the qualifications listed above, we encourage you to apply.

    Hims considers all qualified applicants for employment, including applicants with arrest or conviction records, in accordance with the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance, the California Fair Chance Act, and any similar state or local fair chance laws.

    Hims & Hers is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please contact us at accommodations@forhims.com and describe the needed accommodation. Your privacy is important to us, and any information you share will only be used for the legitimate purpose of considering your request for accommodation. Hims & Hers gives consideration to all qualified applicants without regard to any protected status, including disability. Please do not send resumes to this email address.

    For our California-based applicants – Please see our California Employment Candidate Privacy Policy to learn more about how we collect, use, retain, and disclose Personal Information. 

    See more jobs at hims & hers

    Apply for this job

    +30d

    Site Reliability Engineer - II (SRE II)

    Live PersonHyderabad, Telangana, India (Remote)
    DevOPSterraformnosqlpostgressqlansiblemongodbazureelasticsearchMySQLkuberneteslinuxjenkinsAWS

    Live Person is hiring a Remote Site Reliability Engineer - II (SRE II)

    LivePerson (NASDAQ: LPSN) is the global leader in enterprise conversations. Hundreds of the world’s leading brands — including HSBC, Chipotle, and Virgin Media — use our award-winning Conversational Cloud platform to connect with millions of consumers. We power nearly a billion conversational interactions every month, providing a uniquely rich data set and safety tools to unlock the power of Conversational AI for better customer experiences.

    At LivePerson, we foster an inclusive workplace culture that encourages meaningful connection, collaboration, and innovation. Everyone is invited to ask questions, actively seek new ways to achieve success, nd reach their full potential. We are continually looking for ways to improve our products and make things better. This means spotting opportunities, solving ambiguities, and seeking effective solutions to the problems our customers care about.

    Overview:

    LivePerson is looking for a Site Reliability Engineer for the GPT (Global Product & Technology) Division. You will be part of the LiverPerson SRE team building and managing highly available, distributed systems. You will have the opportunity to be part of a strong team and enjoy the work environment of a start-up, with a robust product and the benefits of a leading company in its field.

    You will: 

    • Ensure product high uptime and reliability 24x7.
    • Manage Linux servers in a multi-cloud environment
    • Manage high availability Kubernetes resources using Helm charts
    • Assist with deploying upgrades and patches using Chef/Ansible/Puppet/Helm
    • Monitoring and troubleshooting warnings and alerts related to the reporting platform’s performance
    • Develop monitoring resources and alerting systems such as Grafana, Prometheus, Kibana, DataDog and PagerDuty
    • Coordinate with DBA and developers to manage SQL and NOSQL database systems, including MongoDB, ElasticSearch, Postgres, MySQL and others
    • Managing message bus systems such as Kafka and Pulsar
    • Build and maintain CI/CD pipelines using Jenkins/Gitlab/Teamcity

    You have:

    • Minimum 4+ years of experience of managing cloud based production environment (AWS, GCP, Azure, etc)
    • Highly experienced working in the Linux environment, good scripting in Bash / Python.
    • Highly experienced working configuration management systems like OpsCode Chef, Ansible, Puppet,  etc.
    • Strong experience in Terraform, CloudFormation or other IAC
    • Experienced in SQL, including DDL and complex queries
    • Experienced working in the Kubernetes platform
    • Experience working in a microservices architecture using a message bus
    • Good knowledge of CI/CD pipelines orchestrators like TeamCity, Jenkins, Gitlab
    • Ability to integrate security best practices into the SRE workflow.
    • Highly motivated and independent.
    • Team player and excellent interpersonal Skills.
    • Excellent written and verbal communication skills.
    • BS in Computer Science or a related field, or equivalent work experience.
    • A strong background in cloud, network and application security and compliance
    • Experience with GPT or other LLMs a strong advantage

    Benefits

    • Health: Medical, Dental, and Vision
    • Time away: Vacation and holidays
    • Development: Generous tuition reimbursement and access to internal professional development resources.
    • Equal opportunity employer

    Why You’ll Love Working Here

    As leaders in enterprise customer conversations, we celebrate diversity, empowering our team to forge impactful conversations globally. LivePerson is a place where uniqueness is embraced, growth is constant, and everyone is empowered to create their own success. And, we're very proud to have earned recognition from Fast Company, Newsweek, and BuiltIn for being a top innovative, beloved, and remote-friendly workplace.

    Belonging At LivePerson

    We are proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local law.

    We are committed to the accessibility needs of applicants and employees. We provide reasonable accommodations to job applicants with physical or mental disabilities. Applicants with a disability who require reasonable accommodation for any part of the application or hiring process should inform their recruiting contact upon initial connection.

    Apply for this job

    +30d

    Manager, Site Reliability Engineering

    GeminiRemote (USA)
    DevOPSagileBachelor's degreeremote-firstDesignansibleazuredockerkuberneteslinuxjenkinspythonAWS

    Gemini is hiring a Remote Manager, Site Reliability Engineering

    About the Company

    Gemini is a global crypto and Web3 platform founded by Tyler Winklevoss and Cameron Winklevoss in 2014. Gemini offers a wide range of crypto products and services for individuals and institutions in over 70 countries.

    Crypto is about giving you greater choice, independence, and opportunity. We are here to help you on your journey. We build crypto products that are simple, elegant, and secure. Whether you are an individual or an institution, we help you buy, sell, and store your bitcoin and cryptocurrency. 

    At Gemini, our mission is to unlock the next era of financial, creative, and personal freedom.

    In the United States, we have a flexible hybrid work policy for employees who live within 30 miles of our office headquartered in New York City and our office in Seattle. Employees within the New York and Seattle metropolitan areas are expected to work from the designated office twice a week, unless there is a job-specific requirement to be in the office every workday. Employees outside of these areas are considered part of our remote-first workforce. We believe our hybrid approach for those near our NYC and Seattle offices increases productivity through more in-person collaboration where possible.

    The Department: Platform

    Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Platform focuses around building a scalable and secure foundations platform, enabling Engineering to deploy, validate, and operate their services in production, improve resiliency of the service and increase organizational efficiency by reducing operational toil and increase system efficiency through architectural evolution.

    The Site Reliability Engineering team engages directly with our other engineering teams to onboard them onto our platform systems, reviewing and recommending design and architectural decisions, and guiding our engineering teams on how to implement the tooling provided by the larger Platform organization required to ensure systems can scale and react to changing conditions, with continuous improvement loops.

    The Role: Manager, Site Reliability Engineering

    In this position, you will lead a team of skilled Site Reliability Engineers responsible for the design, deployment, and maintenance of our production systems. You will play a crucial role in ensuring the reliability, scalability, and performance of our infrastructure, as well as driving continuous improvement initiatives. Your expertise in SRE practices and experience with the listed technologies will enable you to effectively guide the team towards achieving operational excellence. 

    Responsibilities:

    • Lead, mentor and manage a team of Site Reliability Engineers, fostering a culture of collaboration, innovation, and operational excellence. Provide guidance and career development opportunities to team members.
    • Develop, communicate, and execute the SRE team's strategic goals, objectives, and roadmap in alignment with the overall business objectives.
    • Oversee the design, implementation, and maintenance of highly available and scalable production systems.
    • Drive continuous improvement initiatives by identifying areas for enhancement and implementing best practices, automation, and process improvements.
    • Collaborate with cross-functional teams and Departments to ensure smooth integration of applications and systems.
    • Define and enforce Service Level Objectives (SLOs) and Service Level Agreements (SLAs) to ensure system reliability and uptime.
    • Monitor system performance, troubleshoot issues, and ensure timely incident response, root cause analysis, and problem resolution.
    • Implement effective monitoring, logging, and alerting systems to proactively identify and mitigate potential issues.
    • Stay up-to-date with industry trends, emerging technologies, and best practices related to SRE and DevOps, and apply them to improve operational efficiency.
    • Identify potential risks to system reliability and implement strategies to mitigate them.
    • Ensure that all systems and processes comply with relevant regulations, standards, and best practices.

    Minimum Qualifications:

    • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience).
    • Proven experience as a Site Reliability Engineer or similar role, with at least 3-5 years of hands-on experience in managing production systems.
    • Strong expertise in the listed technologies: Ansible, Concourse CI, Jenkins, Github Actions, EKS (Kubernetes), Linux Administration, terraform.
    • Demonstrated experience in leading and managing a team of technical professionals for at least 2 years.
    • Solid understanding of SRE principles, including reliability, scalability, availability, and performance.
    • Proficient in scripting and automation (e.g., Python, Bash, or similar).
    • Experience with infrastructure-as-code (IaC) tools, configuration management, and CI/CD pipelines.
    • Knowledge of cloud platforms (e.g., AWS, Azure, or Google Cloud) and containerization technologies (e.g., Docker).
    • Excellent problem-solving skills and the ability to thrive in a fast-paced, dynamic environment.
    • Strong communication and leadership skills, with the ability to collaborate effectively with both technical and non-technical stakeholders.

    Preferred Qualifications:

    • Relevant certifications, such as Certified Kubernetes Administrator (CKA) or AWS Certified DevOps Engineer.
    • Experience with monitoring and observability tools (e.g., Datadog, New Relic, Prometheus, Grafana, ELK Stack).
    • Familiarity with agile methodologies and experience working in an Agile/Scrum environment.
    It Pays to Work Here
     
    The compensation & benefits package for this role includes:
    • Competitive starting salary
    • A discretionary annual bonus
    • Long-term incentive in the form of a new hire equity grant
    • Comprehensive health plans
    • 401K with company matching
    • Paid Parental Leave
    • Flexible time off

    Salary Range: The base salary range for this role is between $172,000 - $215,000 in the State of New York, the State of California and the State of Washington. This range is not inclusive of our discretionary bonus or equity package. When determining a candidate’s compensation, we consider a number of factors including skillset, experience, job scope, and current market data.

    At Gemini, we strive to build diverse teams that reflect the people we want to empower through our products, and we are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. Equal Opportunity is the Law, and Gemini is proud to be an equal opportunity workplace. If you have a specific need that requires accommodation, please let a member of the People Team know.

    Apply for this job

    +30d

    Principal Platform Engineer, SRE

    GeminiRemote (USA)
    DevOPSremote-firstterraformDesignansibleazuredockerpythonAWS

    Gemini is hiring a Remote Principal Platform Engineer, SRE

    About the Company

    Gemini is a global crypto and Web3 platform founded by Tyler Winklevoss and Cameron Winklevoss in 2014. Gemini offers a wide range of crypto products and services for individuals and institutions in over 70 countries.

    Crypto is about giving you greater choice, independence, and opportunity. We are here to help you on your journey. We build crypto products that are simple, elegant, and secure. Whether you are an individual or an institution, we help you buy, sell, and store your bitcoin and cryptocurrency. 

    At Gemini, our mission is to unlock the next era of financial, creative, and personal freedom.

    In the United States, we have a flexible hybrid work policy for employees who live within 30 miles of our office headquartered in New York City and our office in Seattle. Employees within the New York and Seattle metropolitan areas are expected to work from the designated office twice a week, unless there is a job-specific requirement to be in the office every workday. Employees outside of these areas are considered part of our remote-first workforce. We believe our hybrid approach for those near our NYC and Seattle offices increases productivity through more in-person collaboration where possible.

    The Department: Platform

    Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Platform focuses around building a scalable and secure foundations platform, enabling Engineering to deploy, validate, and operate their services in production, improve resiliency of the service and increase organizational efficiency by reducing operational toil and increase system efficiency through architectural evolution.

    The Site Reliability Engineering team engages directly with our other engineering teams to onboard them onto our platform systems, reviewing and recommending design and architectural decisions, and guiding our engineering teams on how to implement the tooling provided by the larger Platform organization required to ensure systems can scale and react to changing conditions, with continuous improvement loops.

    The Role: Principal Platform Engineer, Site Reliability Engineering

    You will be an integral part of leading Gemini’s engineering teams towards modern DevOps practices, both by developing and providing modern automation and operational tooling, and working cross-functionally across Gemini’s engineering teams to influence and shape our development practices and culture.

    Responsibilities:

    • Provide primary operational support and engineering for various Gemini services
    • Improve reliability, quality and time-to-market across all Gemini services and offerings
    • Guide engineering teams onto the various supported services provided by Platform
    • Run on-going performance evaluations and improvements for Gemini systems
    • Architecture recommendations and engagement as part of SDLC
    • Create “Production-ready Scorecards” to evaluate the health of systems pre-launch
    • Implement and teaching monitoring, alerting and automated resolution best practices
    • Define SLIs, SLOs with Engineering teams
    • Educate and guide Engineering teams on reliability and resiliency best practices, like statelessness, chaos testing, blue/green deployments, etc.
    • Design, build, and maintain operational tooling and automation that streamline processes and enhance system reliability

    Qualifications:

    • 10+ years using monitoring, alerting, and automation tooling to understand and remediate performance and health issues in systems at scale
    • Good knowledge for various cloud technology providers like AWS, GCP , or Azure
    • Expert in an infrastructure as code environment (Terraform), developing automated solutions to solve support and operational issues
    • Experience as a Technical Leader within a team, helping evaluating and making tech decisions for the team
    • Expert working with containerization such as Nomad, EKS (k8s), Docker, etc.
    • Expert working with Configuration Management such as Ansible, Chef, Puppet
    • Proficient at writing scripts or cli tools that help increase Developer Productivity in high-level languages like Python, Go, etc.
    • Expert analyzing system and application performance, identifying bottlenecks, and recommending architectural or systemic improvements
    • Experience working with Engineering teams, teaching, training, and mentoring on how to implement best-practice technical solutions
    It Pays to Work Here
     
    The compensation & benefits package for this role includes:
    • Competitive starting salary
    • A discretionary annual bonus
    • Long-term incentive in the form of a new hire equity grant
    • Comprehensive health plans
    • 401K with company matching
    • Paid Parental Leave
    • Flexible time off

    Salary Range: The base salary range for this role is between $198,000 - $247,000 in the State of New York, the State of California and the State of Washington. This range is not inclusive of our discretionary bonus or equity package. When determining a candidate’s compensation, we consider a number of factors including skillset, experience, job scope, and current market data.

    At Gemini, we strive to build diverse teams that reflect the people we want to empower through our products, and we are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. Equal Opportunity is the Law, and Gemini is proud to be an equal opportunity workplace. If you have a specific need that requires accommodation, please let a member of the People Team know.

    Apply for this job

    +30d

    Senior Database Engineer

    terraformpostgresDesignmobileansibleazuregitc++postgresql

    Signify Health is hiring a Remote Senior Database Engineer

    How will this role have an impact?

    As a Senior Database Reliability Engineer specializing in PostgreSQL, you will play a critical role in designing, managing, and optimizing our cloud based PostgreSQL database infrastructure. You will ensure the reliability, availability, and performance of our database systems while fostering a collaborative and innovative environment. This position requires deep operational expertise in PostgreSQL, experience with Azure, and a strategic mindset to drive database solutions that align with our commitment to reliability and sustainability.

    Key Responsibilities:

    • Database Management: Design, implement, and maintain PostgreSQL database systems, primarily managed instances in cloud environments (Azure), to ensure high availability, performance, and reliability.
    • Performance Tuning: Conduct performance tuning and optimization of database queries, indexes, and configurations to enhance efficiency.
    • Backup and Recovery: Develop and manage robust backup and recovery strategies, ensuring data integrity and availability in case of failures.
    • Monitoring and Troubleshooting: Implement and maintain monitoring solutions using tools such as Redgate, pganalyze, and New Relic to proactively identify and resolve database issues, ensuring minimal downtime and optimal performance.
    • Security: Implement and maintain database security measures, including user management, encryption, data masking and access controls, to ensure compliance with healthcare regulations.
    • Collaboration: Work closely with cross functional teams to support application development and deployment processes.
    • Documentation: Maintain comprehensive documentation of database configurations, procedures, scripts and best practices.
    • Standards and Procedures: Provide guidance in the creation and modification of standards and procedures including scripts for automation and reporting for adherence to standards.
    • On-call Support: Participate in on-call rotations as an opportunity to enhance the reliability and performance of our database systems, ensuring minimal disruptions and optimal performance without frequent interruptions.

    What You’ll Need (Required Qualifications):

    • Experience:
      • Minimum of 10 years of operational experience at scale in database management with at least 6 years focused on PostgreSQL.
      • Extensive experience managing PostgreSQL databases in cloud environments, particularly Azure. Experience with GCP is a plus but not required.
      • Hands-on experience with Continuous Integration/Continuous Deployment pipelines, Database/Schema as Code tools, and Git workflows.
      • Expert level experience analyzing and tuning queries to improve application performance.
      • Experience with database recovery, including point-in-time recovery and transaction wraparound process remediation.
      • Experience with management of Postgres Vacuuming process, able to design and implement processes and standards to prevent production impacting Vacuuming events.
      • Experience with PostgreSQL upgrade processes.
    • Technical Skills:
      • Advanced knowledge of PostgreSQL architecture, performance tuning, and query optimization experience with Azure Query Performance Insight is a plus.
      • Advanced knowledge of operating databases at large scale in cloud-managed environments including optimizing for cpu, memory,IO usage and cost management.
      • Proficiency in automation tools such as Terraform, Ansible, and Flyway.
      • Strong understanding of security practices and compliance frameworks.
      • Extensive knowledge of PostgreSQL internals, index design, statistics and wait types.
    • Soft Skills:
      • Excellent written and verbal communication skills with the ability to convey complex technical concepts to both technical and non-technical stakeholders.
      • Strategic thinker with the ability to drive long-term database solutions aligned with business goals.
      • Collaborative and team-oriented, contributes ideas, suggestions, and effort to the group.
      • Ability to work effectively in a collaborative team environment or independently.

    The base salary hiring range for this position is $108,900 to $189,700. Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits.
    In addition to your compensation, enjoy the rewards of an organization that puts our heart into caring for our colleagues and our communities.  Eligible employees may enroll in a full range of medical, dental, and vision benefits, 401(k) retirement savings plan, and an Employee Stock Purchase Plan.  We also offer education assistance, free development courses, paid time off programs, paid holidays, a CVS store discount, and discount programs with participating partners.  

    About Us:

    Signify Health is helping build the healthcare system we all want to experience by transforming the home into the healthcare hub. We coordinate care holistically across individuals’ clinical, social, and behavioral needs so they can enjoy more healthy days at home. By building strong connections to primary care providers and community resources, we’re able to close critical care and social gaps, as well as manage risk for individuals who need help the most. This leads to better outcomes and a better experience for everyone involved.

    Our high-performance networks are powered by more than 9,000 mobile doctors and nurses covering every county in the U.S., 3,500 healthcare providers and facilities in value-based arrangements, and hundreds of community-based organizations. Signify’s intelligent technology and decision-support services enable these resources to radically simplify care coordination for more than 1.5 million individuals each year while helping payers and providers more effectively implement value-based care programs.

    To learn more about how we’re driving outcomes and making healthcare work better, please visit us at www.signifyhealth.com

    Diversity and Inclusion are core values at Signify Health, and fostering a workplace culture reflective of that is critical to our continued success as an organization.

    We are committed to equal employment opportunities for employees and job applicants in compliance with applicable law and to an environment where employees are valued for their differences.

    See more jobs at Signify Health

    Apply for this job

    +30d

    Software Engineer, Infrastructure

    GeminiRemote (USA)
    golangagileremote-firstscalaDesignansibleazurerubyjavadockerjenkinspythonAWS

    Gemini is hiring a Remote Software Engineer, Infrastructure

    About the Company

    Gemini is a global crypto and Web3 platform founded by Tyler Winklevoss and Cameron Winklevoss in 2014. Gemini offers a wide range of crypto products and services for individuals and institutions in over 70 countries.

    Crypto is about giving you greater choice, independence, and opportunity. We are here to help you on your journey. We build crypto products that are simple, elegant, and secure. Whether you are an individual or an institution, we help you buy, sell, and store your bitcoin and cryptocurrency. 

    At Gemini, our mission is to unlock the next era of financial, creative, and personal freedom.

    In the United States, we have a flexible hybrid work policy for employees who live within 30 miles of our office headquartered in New York City and our office in Seattle. Employees within the New York and Seattle metropolitan areas are expected to work from the designated office twice a week, unless there is a job-specific requirement to be in the office every workday. Employees outside of these areas are considered part of our remote-first workforce. We believe our hybrid approach for those near our NYC and Seattle offices increases productivity through more in-person collaboration where possible.

    The Department: Onchain

    The Role: Software Engineer, Infrastructure

    The infrastructure team at Gemini creates and manages tools and platforms, automates the creation and support of this infrastructure, helps integrate complex processes, and supports secure data access. We take a software-first mindset to use code to make and maintain our software infrastructure. We seek engineers who think of themselves as a software engineer first, with correlated expertise in back-end engineering. 

    Our team builds and operates environments for the purpose of digital asset access. There are three main pillars of work including building and running network nodes, building and running validators, and supporting our next generation wallet infrastructure. In these pillars, we use and implement tools and software to support our cloud-based infrastructure. Given the need to build and integrate more of our software in the cloud, the ideal engineer will have extensive experience in developing, automating, and building software with associated cloud expertise (e.g., AWS or GCP). This engineer will also work closely with various teams including various teams such as Protocols, Product Security, On-chain, and Custody. 

    We are a dynamic group with both entrepreneurial spirit and security engineering experience. We have incredibly high aspirations, and we are looking for like-minded individuals who want to guide the transition to a new more decentralized world where access to digital assets is normalized and ubiquitous.

    Responsibilities:

    • Design, build, and deploy infrastructure in our three areas of focus 1) building and running network nodes, 2) building and running validators, and 3) building and running our next generation wallet infrastructure
    • Develop tools and automation that integrate these systems in a secure way
    • With a focus on our next generation wallet infrastructure, improve the capabilities of the existing infrastructure with a mindset towards infrastructure as code
    • Improve availability and reliability while maintaining acceptable security 
    • Integrate the use of cloud-based security mechanisms into the build infrastructure. Example security mechanisms include identity and access management and key management
    • Participate in disaster recovery (DR) scenarios to validate operability of physical and digital material

    Minimum Qualifications:

    • 5+ years implementing software 
    • Experience in at least one area of software development, operating systems or device driver development, hardware, secure protocols, encryption, authentication, key management, or applied cryptography – has expertise beyond automation
    • Hands-on experience in at least one or more cloud platforms (e.g., AWS, GCP, Azure, or others)
    • Hands-on expertise with one or more of the following including ansible, puppet, docker, KMS, IAM, jenkins
    • Experience implementing software automation processes applied in a cloud environment
    • Proficiency in a common scripting language including but not limited to Python, Ruby, etc.
    • Able to troubleshoot and debug issues, and demonstrate a methodical approach to root cause analysis
    • Strong written and verbal communication skills; attentive to details

    Preferred Qualifications:

    • Previous experience in one of the three focus areas of blockchain node operations, validators as a service, and wallet infrastructure
    • 1+ year Golang experience
    • 2+ years implementing software in the cloud (e.g., AWS)
    • 1+ years using monitoring, alerting, and automation tooling 
    • Experience in a code-first environment, developing automated solutions to solve support and operational issues
    • Experience working with engineering teams, teaching, training, and mentoring on how to implement best-practice technical solutions
    • Ability to read and write code written in Python, Java, Scala, C/C++, and Golang
    • Demonstrated ability to convert theoretical security concepts into production
    • Solid understanding of Product Management and Product Ownership, Agile practices and methodologies
    It Pays to Work Here
     
    The compensation & benefits package for this role includes:
    • Competitive starting salary
    • A discretionary annual bonus
    • Long-term incentive in the form of a new hire equity grant
    • Comprehensive health plans
    • 401K with company matching
    • Paid Parental Leave
    • Flexible time off

    Salary Range: The base salary range for this role is between $120,000 - $150,000 in the State of New York, the State of California and the State of Washington. This range is not inclusive of our discretionary bonus or equity package. When determining a candidate’s compensation, we consider a number of factors including skillset, experience, job scope, and current market data.

    At Gemini, we strive to build diverse teams that reflect the people we want to empower through our products, and we are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. Equal Opportunity is the Law, and Gemini is proud to be an equal opportunity workplace. If you have a specific need that requires accommodation, please let a member of the People Team know.

    Apply for this job