Meest populaire vacatures

1596Banen gevonden

1596 Banen gevonden 

M
M

Service Reliability Engineer

Mambu

Amsterdam
11 dagen geleden
Amsterdam
11 dagen geleden
Mambu is the SaaS banking engine powering innovative loan and deposit products, the lean alternative to cumbersome core banking systems. Helping clients to successfully start up business ventures, transform existing operations, launch new products and expand into new markets. Mambu provides financial institutions of all sizes with the agility to rapidly design, launch, service and scale their banking and lending portfolio. We believe that a great company is built on great people. We are proud to have brought together incredibly bright minds to help make financial services ready for the 21st century. Our clients understand what it takes to succeed in a fully digital world and our team is a trusted partner in their endeavours. We are looking for a passionate, skilled and enthusiastic Site Reliability Engineer to join our team in Amsterdam. As a SRE you will build, operate and improve a highly available, performant, scalable, cost-effective, reliable and secure Mambu Cloud Platform using latest tools and technologies. More about us: To stay on top of the latest Fin-Tech trends and our success stories, please follow us on LinkedIn For more details regarding our global career opportunities, please visit Career Site
E
E

Maintenance Engineer, Zaandam of Dordrecht

Engie Nederland

Zaandam, UT
3 dagen geleden
Zaandam, UT
3 dagen geleden
FUNCTIEOMSCHRIJVING
Als maintenance engineer focus je je op de cyclus van onderhoudsoptimalisatie van technische installaties van infrastructurele werken. De belangrijkste bijdragen zijn het opzetten, standaardiseren en optimaliseren van onderhoudsconcepten. Je bent van de tenderfase tot en met de daadwerkelijke invulling van het contract betrokken. Hiernaast zorg je voor het ontwikkelen en borgen van onderhoudskennis. 

De werkzaamheden bestaan uit:  
  • Je stelt onderhoudsprogramma’s op, op basis van methodieken zoals reliability centered maintenance en je voert verbeteringen door in het beheer en onderhoud van de diverse infra-installaties Dit doe je d.m.v. methodieken zoals root cause analysis en FMECA’s om tot de meest ideale oplossing te komen 
  • Je haalt informatie op bij jouw collega’s in het werkveld, door middel van gesprekken en bezoeken aan de engineers en monteurs op locatie 
  • Je vertaalt deze informatie in heldere en optimale onderhoudsstrategieën 
  • Je bezoekt de diverse objecten in lopende en nieuwe beheer- en onderhoudscontracten, en je onderhoudt contact met de opdrachtgever zodat je over alle relevante informatie beschikt om het proces te optimaliseren 

FUNCTIE-EISEN 
  • Je beschikt over een HBO/ WO diploma binnen de techniek Je hebt minimaal drie jaar werkervaring in een soortgelijke functie 
  • Ervaring met FMECA/RCM/Faulttree methodieken en basiskennis van RAMS 
  • Je kunt werken met Onderhouds Management Systemen 
  • Je bezit kennis m.b.t. onderhoudsprocessen en continu verbeteren d.m.v. PDCA 
  • Communicatief
  • Initiatiefrijk 
  • Verantwoordelijkheid gevoel 
  • Je bent woonachtig in de randstad vanwege de standplaats Dordrecht of Zaandam 

WAT BIEDEN WIJ JE 
Persoonlijke begeleiding, vrijheid, afwisselend werk en een werkomgeving waar jouw mening telt. Je maakt bovendien onderdeel uit van het wereldwijde ENGIE concern, één van de meest innovatieve bedrijven ter wereld. 

Naast aandacht voor persoonlijke groei bieden we een goed salaris en aantrekkelijke secundaire arbeidsvoorwaarden. Afhankelijk van je functie horen daar een laptop, telefoon, en een leaseauto bij. Maar ook een goede pensioenregeling, een collectieve zorgverzekering en personeelskortingen op o.a. uitjes en producten. En wist je dat je bij een dienstverband van 40 uur per week maar liefst 38 vrije dagen per jaar hebt? Zo kun je werk en privé extra goed in balans houden. 

BEDRIJFSINFORMATIE 
ENGIE Nederland is onderdeel van de beursgenoteerde ENGIE Groep. ENGIE is actief in 70 landen, met wereldwijd 150.000 medewerkers. Als groep is het onze missie om bij te dragen aan de verduurzaming van de wereld. ENGIE Nederland bestaat uit ENGIE Services en ENGIE Energie. Samen voorzien we onze klanten van energie en technische oplossingen. Van B2B en B2C, van kleine tot grote klanten, van Terneuzen tot Delfzijl. 

Bij ENGIE Infra & Mobility werken we aan omvangrijke infrastructurele projecten door heel Nederland. Denk hier aan complexe technische opdrachten voor tunnels en bruggen, sluizen en verkeerssignalering. Maar ook aan verkeerscentrales en energieprojecten zoals oplossingen voor elektrisch vervoer, Smart Grids en windparken. We zetten innovatieve techniek in op een efficiënte en duurzame manier. Teamwork en uitwisseling van kennis en kunde vormen de basis van ons succes. 

CONTACTINFORMATIE 
Spreekt deze vacature je aan? Reageer dan direct via de button ‘solliciteren’ en solliciteer eenvoudig via de Chatbot. Voor vragen kun je contact opnemen met Michelle Hartmann via

michelle.hartmann@engie.com

of telefonisch op 06- 13 28 63 72 
 Engineers_
Additional Information
  • Main Tile: #Engineers_NL
M
M

Senior Service Reliability Engineer - Observability

Mambu

Amsterdam
11 dagen geleden
Amsterdam
11 dagen geleden
Mambu is the leading SaaS core banking engine. If you’re a customer of the largest digital bank in the EU, then you’ve probably interacted with our platform and didn't even know it. We are at the heart of what makes digital banks and lenders work - the system that processes banking transactions and updates accounts and other financial records from deposits to loans and credit balances. But Mambu is different.  We are not just cloud-native, lean and flexible - we are helping to revolutionise financial services globally. We are in a growth phase and we’ve only just begun. To help us on our mission, we bring together people with the best skills and attitude. It doesn’t matter where you are from, what matters is the impact you have and your passion to make a difference. We are looking for a passionate, skilled and enthusiastic Service Reliability Engineer - Observability to join our team. As a Monitoring Engineer, you will build, operate and improve monitoring of Mambu core banking services, across all product engineering tribes and enable engineering teams by improving observability of their services. More about us: To stay on top of the latest Fin-Tech trends and our success stories, please follow us on LinkedIn For more details regarding our global career opportunities, please visit Career Site
T
T

Principal Site Reliability Engineer

TomTom

Amsterdam
30+ dagen geleden
Amsterdam
30+ dagen geleden

At TomTom…
You’ll move the world forward. Every day, we create the most innovative mapping and location technologies to shape tomorrow’s mobility for the better.

We are proud to be one team of more than 5,000 unique, curious, passionate problem-solvers spread across the world. We bring out the best in each other. And together, we help the automotive industry, businesses, developers, drivers, citizens and cities move towards a safe, autonomous world that is free of congestion and emissions.

The SRE team at TomTom brings software and system engineering skills together. We code our way out of operational problems, working with internal and external teams to build resilient, scalable and reliable systems in order to deliver services of the highest quality to our customers.

This is a unique chance to solve challenging problems, dig deep and troubleshoot complex systems, learn from incidents, work across the stack to drive the reliability of our services, have a real impact on global mobility every day.


What you’ll do

  • Work with partners to shape the architecture, design, and implementation of new and existing systems and ensure their reliability.
  • Join the Incident Commander rotation in high priority incidents, getting hands-on when required to improve the TTR.
  • Apply resiliency engineering and drive incident response, analysis and remediation to prevent future occurrence.
  • Ensure that critical services have an adequate monitoring and alerting setup and that operational hygiene is applied to guarantee their continuity.
  • Deliver software to improve reliability, performance and scalability across the stack.
  • Collaborate with the team to define the SRE strategy and roadmap.

What you’ll need

  • 10+ years of working experience in a production environment, covering software and system engineering.
  • 5+ years of production experience operating Linux systems on cloud or bare metal, covering infrastructure as code, configuration management and monitoring.
  • Extensive experience designing, developing, operating and troubleshooting mission-critical distributed systems at scale.
  • Knowledge of algorithms and data structures, proficiency in one or more modern programming language, such as: Java, Go, C++, Scala or Python.
  • Knowledge of Linux systems internals.
  • Knowledge of networking.
  • Excellent written and oral communication skills, ability to collaborate successfully with technical and non-technical stakeholders.
  • Track record of establishing successful mentorship relationships with colleagues, expressing technical leadership without "pulling rank" and role modeling the SRE principles.
  • Business acumen, ability to prioritize high ROI work, strong sense of ownership.

Nice to have

  • Experience working with Kubernetes and Prometheus in production.
  • Experience working with AWS, Azure or a similar cloud environment at scale.

Meet your team

Our team is in the core TomTom live services. We connect with all DevOps teams and make sure that there is a good as possible customer experience when there is an incident and minimize the MTTR (Mean time to resolve). We also focus on reducing the number of incidents as we participate in improvement actions with a focus on automation and reliability setup.

Our Site Reliability Engineers (SRE) are a hybrid of software and systems engineers. We code our way out of operational problems. We are responsible for reliability, scalability, and automation while keeping an eye on latency, performance and capacity as well as other KPI’s.

Achieve more
We are self-starters who play well with others. Every day, we solve new problems with creativity, meet new people and learn rapidly at our offices around the world. We will invest in your growth and are committed to supporting you. In everything we do, we’re guided by six values: We care, putting our heart into what we do; we build trust (you can count on us); we create – driven to make a difference; we are confident, but don’t boast; we keep it simple, since life is complex enough; and we have fun because life’s too short to be boring.
After you apply
Our recruitment team will work hard to give you a meaningful experience throughout the process, no matter the outcome. Your application will be screened closely and you can rest assured that all follow-up actions will be thorough, from assessments and interviews through your onboarding.
TomTom is an equal opportunity employer
We celebrate diversity, thrive on each other’s differences and are committed to creating an inclusive environment at our offices around the world. Naturally, we do not discriminate against any employee or job applicant because of race, religion, color, sexual orientation, gender, gender identity or expression, marital status, disability, national origin, genetics, or age.
Ready to move the world forward?

At TomTom…
You’ll move the world forward. Every day, we create the most innovative mapping and location technologies to shape tomorrow’s mobility for the better.

We are proud to be one team of more than 5,000 unique, curious, passionate problem-solvers spread across the world. We bring out the best in each other. And together, we help the automotive industry, businesses, developers, drivers, citizens and cities move towards a safe, autonomous world that is free of congestion and emissions.

The SRE team at TomTom brings software and system engineering skills together. We code our way out of operational problems, working with internal and external teams to build resilient, scalable and reliable systems in order to deliver services of the highest quality to our customers.

This is a unique chance to solve challenging problems, dig deep and troubleshoot complex systems, learn from incidents, work across the stack to drive the reliability of our services, have a real impact on global mobility every day.


What you’ll do

  • Work with partners to shape the architecture, design, and implementation of new and existing systems and ensure their reliability.
  • Join the Incident Commander rotation in high priority incidents, getting hands-on when required to improve the TTR.
  • Apply resiliency engineering and drive incident response, analysis and remediation to prevent future occurrence.
  • Ensure that critical services have an adequate monitoring and alerting setup and that operational hygiene is applied to guarantee their continuity.
  • Deliver software to improve reliability, performance and scalability across the stack.
  • Collaborate with the team to define the SRE strategy and roadmap.

What you’ll need

  • 10+ years of working experience in a production environment, covering software and system engineering.
  • 5+ years of production experience operating Linux systems on cloud or bare metal, covering infrastructure as code, configuration management and monitoring.
  • Extensive experience designing, developing, operating and troubleshooting mission-critical distributed systems at scale.
  • Knowledge of algorithms and data structures, proficiency in one or more modern programming language, such as: Java, Go, C++, Scala or Python.
  • Knowledge of Linux systems internals.
  • Knowledge of networking.
  • Excellent written and oral communication skills, ability to collaborate successfully with technical and non-technical stakeholders.
  • Track record of establishing successful mentorship relationships with colleagues, expressing technical leadership without "pulling rank" and role modeling the SRE principles.
  • Business acumen, ability to prioritize high ROI work, strong sense of ownership.

Nice to have

  • Experience working with Kubernetes and Prometheus in production.
  • Experience working with AWS, Azure or a similar cloud environment at scale.

Meet your team

Our team is in the core TomTom live services. We connect with all DevOps teams and make sure that there is a good as possible customer experience when there is an incident and minimize the MTTR (Mean time to resolve). We also focus on reducing the number of incidents as we participate in improvement actions with a focus on automation and reliability setup.

Our Site Reliability Engineers (SRE) are a hybrid of software and systems engineers. We code our way out of operational problems. We are responsible for reliability, scalability, and automation while keeping an eye on latency, performance and capacity as well as other KPI’s.

Achieve more
We are self-starters who play well with others. Every day, we solve new problems with creativity, meet new people and learn rapidly at our offices around the world. We will invest in your growth and are committed to supporting you. In everything we do, we’re guided by six values: We care, putting our heart into what we do; we build trust (you can count on us); we create – driven to make a difference; we are confident, but don’t boast; we keep it simple, since life is complex enough; and we have fun because life’s too short to be boring.
After you apply
Our recruitment team will work hard to give you a meaningful experience throughout the process, no matter the outcome. Your application will be screened closely and you can rest assured that all follow-up actions will be thorough, from assessments and interviews through your onboarding.
TomTom is an equal opportunity employer
We celebrate diversity, thrive on each other’s differences and are committed to creating an inclusive environment at our offices around the world. Naturally, we do not discriminate against any employee or job applicant because of race, religion, color, sexual orientation, gender, gender identity or expression, marital status, disability, national origin, genetics, or age.
Ready to move the world forward?

P
P

Senior Site Reliability Engineer

Palo Alto Networks

Amsterdam
14 dagen geleden
Amsterdam
14 dagen geleden
Company Description

Our Mission
At Palo Alto Networks® everything starts and ends with our mission:
Being the cybersecurity partner of choice, protecting our digital way of life.
We have the vision of a world where each day is safer and more secure than the one before. These aren’t easy goals to accomplish – but we’re not here for easy. We’re here for better. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for innovators who are as committed to shaping the future of cybersecurity as we are. 

Job Description

Your Career
Palo Alto Networks has been rapidly moving towards the future where cloud-based applications are increasingly common. Our Site Reliability Engineers are a hybrid of software and systems engineers. Our mission is to design the next version of the Prisma Access. We code our way out of operational problems. We are responsible for reliability, scalability, and automation while keeping an eye on latency, performance, and capacity.
Your Impact

  • Design, write and maintain software to improve the availability, scalability, latency, and efficiency of the services, incorporating third-party open-source tools when available.
  • Create new designs for a growing number of distributed systems.
  • Design and implement the tools and processes used for deployment and change management.
  • Plan and execute configuration management.
  • Own, maintain, and continuously improve all systems provided as a service, such as monitoring and datastores.
  • Engage in service capacity planning and demand forecasting, anticipating performance bottlenecks.
  • Automate resource provisioning and allocation process.
  • Run software performance analysis and system tuning.
  • Plan and execute disaster recovery drills
  • Participate in rotating on-call duties
Qualifications

Your Experience

  • Fluent in one or more of: Python, Go
  • Minimum of 4 years of industry experience in engineering
  • Familiarity with algorithms, data structures, and complexity analysis
  • In-depth knowledge of operating systems (processes, threads, concurrency, etc)
  • Experience working with Unix/Linux systems from kernel to shell and beyond, with experience
  • working with system libraries, file systems, and client-server protocols
  • Experience with network protocols and theory (TCP/IP, UDP, ICMP, MAC addresses, IP
  • packets, DNS, and load balancing, etc.)
  • Experience with Kubernetes, Terraform, ansible
  • Systematic problem solving approach
  • Strong sense of ownership and drive

Nice-to-Have

  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Experience with Amazon Web Services & Google Cloud Platform
  • Experience with InfluxDB, Graphite, NoSQL tuning and performance
  • Bachelor in Computer Science / Engineering or Equivalent

Additional Information

The Team
As a member of the SRE team, you will work on producing mission-critical platforms, tools, and processes that will ensure the highest levels of availability and reliability of all our applications. We need creative and innovative problem solvers who can partner with our Application development teams to make their services more usable. Our SRE team is furnished with a standout opportunity to build tools, frameworks, and cloud platforms that will support our company’s growth over the next decade. If you are a self-starter and jump on new ideas to make the platform more stable, secure and feature-rich, this is your new career.
Our Commitment
We’re trailblazers that dream big, take risks, and challenge cybersecurity’s status quo. It’s simple: we can’t accomplish our mission without diverse teams innovating, together.
We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at

accommodations@paloaltonetworks.com

.
Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

M
M

Senior Service Reliability Engineer

Mambu

Amsterdam
11 dagen geleden
Amsterdam
11 dagen geleden
Mambu is the leading SaaS core banking engine. If you’re a customer of the largest digital bank in the EU, then you’ve probably interacted with our platform and didn't even know it. We are at the heart of what makes digital banks and lenders work - the system that processes banking transactions and updates accounts and other financial records from deposits to loans and credit balances. But Mambu is different.  We are not just cloud-native, lean and flexible - we are helping to revolutionise financial services globally. We are in a growth phase and we’ve only just begun. We are looking for a passionate, skilled and enthusiastic Senior Site Reliability Engineer to join our team. As a SRE you will build, operate and improve a highly available, performant, scalable, cost-effective, reliable and secure Mambu Cloud Platform using latest tools and technologies. More about us: To stay on top of the latest Fin-Tech trends and our success stories, please follow us on LinkedIn For more details regarding our global career opportunities, please visit Career Site
F
F

Site Reliability Engineer (Remote)

FreshBooks

Amsterdam, North Holland, Netherlands, NH
30+ dagen geleden
Amsterdam, North Holland, Netherlands, NH
30+ dagen geleden

FreshBooks has a big vision. We launched in 2003 but we’re just getting started and there’s a lot left to do. We're a high performing team working towards a common goal: building an elite online accounting application to help small businesses better handle their finances. Known for extraordinary customer service and based in Toronto, Canada, FreshBooks serves paying customers in over 120 countries.

The Opportunity - Site Reliability Engineer (Remote)

The Shared Services team at FreshBooks is looking for talented & experienced engineers to help us build and support our cloud infrastructure. Join our growing organization and you will get a chance to be in the driving seat of innovation and change at FreshBooks.

As a Site Reliability Engineer, you’ll be joining a team of mix background technologists. Our mandate is to provide secure, flexible and stable platform solutions that empower our feature development teams to create the highest quality services for our customers. This team is also responsible for Reliability Engineering at FreshBooks.

What you'll do:

  • Develop, deploy and operate cloud native infrastructure, on GCP, for the FreshBooks’ SaaS accounting platform.
  • Taking a data-driven approach to operations you will drive a culture of automation, both within the team and throughout the organization, to scale FreshBooks efficiently and reliably.
  • Partner with developer teams to establish Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to the existing FreshBooks’ services that make up our product.
  • Performing service criticality and reliability analysis of each subsystem of FreshBooks’ systems.
  • Developing & Improving application & Infrastructure monitoring and alerting standards
  • Helping raise the bar on our incident response management process and joining a periodic on-call rotation.
  • You will help harden & operate various technologies used at FreshBooks like: RabbitMQ Service, DNS, Kubernetes, GCP networking, etc. 
  • Create cutting edge cloud infrastructure through Infrastructure-as-Code and automation 
  • Build and improve cloud infrastructure self-service tooling that powers our internal platform used by hundreds of developers. 
  • Lead and participate in technical discussions to aid system design, analysis, and troubleshooting. 

We think you'll be an amazing fit for this position if your application can demonstrate:

  • 5+ years as a software engineer and writing Infrastructure as a code.
  • You have spent at least 2 years as a Site Reliability Engineer.
  • You have a natural instinct to drive for operational maturity with partner teams.
  • You have demonstrated experience supporting at least one of the predominant public clouds (AWS, Azure, or GCP).  Bonus points for experience on GCP.
  • You have a deep understanding of Cloud computing concepts and solutions.
  • You have hands-on experience with container technologies, like Docker runtime and Kubernetes orchestration. 
  • You have hands-on experience working with Infrastructure-as-Code tools such as Terraform. 
  • You have expertise with Linux Operating system administration and a good understanding of Linux fundamentals like signals, scheduling, filesystems. 
  • You have strong TCP/IP networking knowledge preferably across the cloud deployment stack - DNS, LDAP, network routing, L3/L7 protocols etc. 
  • Very Strong problem solving & troubleshooting skills including ability to perform root cause analysis and preventative analysis.  
  • Experience using and/or implementing modern observability tooling such as Prometheus, InfluxDB, Grafana, Logstash, Kibana or Jaeger.
  • Experience with APM and monitoring tools such as DataDog and New Relic.
  • You are available after hours for planned activities and support as required.

It’s a bonus if you have:

  • Experience with continuous integration pipelines (you’ll work closely with our Engineering Effectiveness team to help us get quality code in front of customers quickly).
  • Experience with web applications developed in Python.
  • A degree in Computer Science/Engineering or Information Technology.

Why Join Us

We're a motivated bunch, with our eyes laser-focused on shipping extraordinary experiences to businesses. You will be surrounded by hardworking team members who share a common vision for what an amazing software company could be, and have the opportunity to help build an elite one, right here in downtown Toronto.

Apply Now

Have we got your attention? Submit your application today and a member of our recruitment team will be in touch with you shortly!

FreshBooks is an equal opportunity employer. We do not discriminate based on gender, religion, race, mental disability, sexual orientation, age, or any other status. All applicants are considered based on their qualifications and merits. At FreshBooks, we inspire an environment of mutual respect and we believe diversity and inclusion are crucial to our success.

FreshBooks provides employment accommodation during the recruitment process. Should you require any accommodation, please indicate this on your application and we will work with you to meet your accessibility needs. For any questions, suggestions or required documents regarding accessibility in a different format, please contact us at phone 416-780-2700 and/or accessibility@freshbooks.com.

L
L

Site Reliability Engineer - Akka Serverless

Lightbend

Amsterdam
30+ dagen geleden
Amsterdam
30+ dagen geleden

Lightbend is developing a cloud platform that makes distributed systems and design patterns consumable as a service. Our mission is to take care of the complexities of running distributed systems, allowing developers to focus on their business logic while delivering resilient and scalable systems. We are taking the traditional stateless FaaS model, and turning it on its head, pushing into the uncharted territory of managing stateful application code, built on the solid foundations of tried and tested distributed computing principles that we have successfully delivered over more than a decade.

We are looking for an experienced Site Reliability Engineer in the European timezone to join our Cloud Services team who is excited to leverage leading SRE practices to operate highly resilient and scalable systems. 

Responsibilities:

  • Develop and extend software to monitor and improve end-to-end platform performance, identify runtime deficiencies, find potential failures, and fix production issues in a fully managed Cloud environment.
  • Participate in on-call rotation and incident-resolution.
  • Build deep, full-stack knowledge of our platforms and applications. 
  • Work to simplify and automate deployment processes, run-time operations, and provide non-disruptive releases.
  • Help create and maintain an environment that provides security and privacy for our customers data.
  • Maintain application reliability and uptime SLAs throughout the application lifecycle using programmatic self-healing and software automation.
  • Travel occasionally to meet with the rest of Lightbend’s technical team, as safely permitted.

Candidates can potentially live anywhere in Europe, as this is a fully remote position. This is not a full-time firefighting role requiring super heros. Site reliability is the entire team’s responsibility. We are looking for an operations expert to be a part of building and running our new offerings as we expand our platform to Europe.

Qualifications:

You

  • Are an SRE who understands how to operate modern distributed data systems on Kubernetes to be extremely reliable with predictable performance.
  • Have experience with Google’s Cloud service offerings, GCP, GKE and related services, specifically from an operational perspective.
  • Have a passion for automating the complexities of orchestrating and running multi-tenant cloud application services.
  • Are accustomed to collaborating with business owners and understanding diverse business requirements.
  • Have two or more years of experience in distributed systems architecture and runtime requirements.
  • Are a voracious learner, ready to take on new technologies and techniques quickly and constantly.
  • Have excellent written and verbal communication skills in at least English.
  • Are skillful at interacting and working with people; working with a self-organized lean and agile team to mitigate project risks, manage effort and ensure quality.
  • Are dedicated to best practices such as infrastructure as code, automated testing, code reviews, and continuous integration, deployment & testing.
  • Are biased towards action on tough problems and issues, and focused on your customer’s success.
  • Are an agent of change, constantly learning and seeking better outcomes.
  • Are familiar with many of the supporting technologies we use, including Terraform, Prometheus, Grafana, Actors, Service Mesh frameworks, etc.
  • Are experienced with complex and secure networking environments, including Encryption Keys, and TLS.

Ideally, you also...

  • Have knowledge of the Lightbend technologies and distributed systems, including Akka clustering.
  • Have supported SaaS/PaaS systems.
  • Have an awareness of Serverless/Functions-as-a-service Platforms.

What we offer:

Lightbend is a welcoming, transparent, and highly distributed company dedicated to creating high-performance systems that bring success to all who use them.  With a strong focus on work-life balance, our company offers a fast-paced, collaborative environment mixed with challenging and engaging work. This combination has attracted and retained some of the brightest minds in our technology communities.

Lightbend is an Equal Opportunity Employer.

 

L
L

Site Reliability Engineer - Akka Serverless

Lightbend

Amsterdam
30+ dagen geleden
Amsterdam
30+ dagen geleden

Lightbend is developing a cloud platform that makes distributed systems and design patterns consumable as a service. Our mission is to take care of the complexities of running distributed systems, allowing developers to focus on their business logic while delivering resilient and scalable systems. We are taking the traditional stateless FaaS model, and turning it on its head, pushing into the uncharted territory of managing stateful application code, built on the solid foundations of tried and tested distributed computing principles that we have successfully delivered over more than a decade.

We are looking for an experienced Site Reliability Engineer in the European timezone to join our Cloud Services team who is excited to leverage leading SRE practices to operate highly resilient and scalable systems. 

Responsibilities:

  • Develop and extend software to monitor and improve end-to-end platform performance, identify runtime deficiencies, find potential failures, and fix production issues in a fully managed Cloud environment.
  • Participate in on-call rotation and incident-resolution.
  • Build deep, full-stack knowledge of our platforms and applications. 
  • Work to simplify and automate deployment processes, run-time operations, and provide non-disruptive releases.
  • Help create and maintain an environment that provides security and privacy for our customers data.
  • Maintain application reliability and uptime SLAs throughout the application lifecycle using programmatic self-healing and software automation.
  • Travel occasionally to meet with the rest of Lightbend’s technical team, as safely permitted.

Candidates can potentially live anywhere in Europe, as this is a fully remote position. This is not a full-time firefighting role requiring super heros. Site reliability is the entire team’s responsibility. We are looking for an operations expert to be a part of building and running our new offerings as we expand our platform to Europe.

Qualifications:

You

  • Are an SRE who understands how to operate modern distributed data systems on Kubernetes to be extremely reliable with predictable performance.
  • Have experience with Google’s Cloud service offerings, GCP, GKE and related services, specifically from an operational perspective.
  • Have a passion for automating the complexities of orchestrating and running multi-tenant cloud application services.
  • Are accustomed to collaborating with business owners and understanding diverse business requirements.
  • Have two or more years of experience in distributed systems architecture and runtime requirements.
  • Are a voracious learner, ready to take on new technologies and techniques quickly and constantly.
  • Have excellent written and verbal communication skills in at least English.
  • Are skillful at interacting and working with people; working with a self-organized lean and agile team to mitigate project risks, manage effort and ensure quality.
  • Are dedicated to best practices such as infrastructure as code, automated testing, code reviews, and continuous integration, deployment & testing.
  • Are biased towards action on tough problems and issues, and focused on your customer’s success.
  • Are an agent of change, constantly learning and seeking better outcomes.
  • Are familiar with many of the supporting technologies we use, including Terraform, Prometheus, Grafana, Actors, Service Mesh frameworks, etc.
  • Are experienced with complex and secure networking environments, including Encryption Keys, and TLS.

Ideally, you also...

  • Have knowledge of the Lightbend technologies and distributed systems, including Akka clustering.
  • Have supported SaaS/PaaS systems.
  • Have an awareness of Serverless/Functions-as-a-service Platforms.

What we offer:

Lightbend is a welcoming, transparent, and highly distributed company dedicated to creating high-performance systems that bring success to all who use them.  With a strong focus on work-life balance, our company offers a fast-paced, collaborative environment mixed with challenging and engaging work. This combination has attracted and retained some of the brightest minds in our technology communities.

Lightbend is an Equal Opportunity Employer.

 

Powered by JazzHR

N
N

Senior Site Reliability Engineer

Nylas

Amsterdam
11 dagen geleden
Amsterdam
11 dagen geleden
Nylas is a pioneer and leading provider of universal communications APIs that allow developers to quickly connect their applications to every email, calendar, or contacts provider in the world. Over 40,000 developers around the globe use the Nylas communications platform to process over 1.2 billion API requests and 20TB of data per day from providers such as Gmail, Microsoft Exchange, Outlook, Yahoo! and more.  Who We Are Nylas was founded in 2013 by a couple of MIT graduates who were passionate about making complex systems simpler. Co-founder and CTO Christine Spang saw that email use was growing at a steady rate, yet there wasn’t a simple way to unify this data-rich tool in a way that developers could easily integrate with this data. She and a small team (at the time) set out to fix this. Fast-forward to 2020, Nylas has successfully raised funding from Spark Capital, 8VC, ScaleUP, Round13, Citi Ventures, Slack Fund, Data Collective, Fuel Capital, and SV Angel. Nylas customers span from large enterprises such as Hyundai, Fox News Corp, Hubspot and Move.com to high-growth start-ups like Dialpad, Pipedrive, Lexicata, and Sparkpost. Our Work Philosophy Nylas is also big believers in the safety and well-being of our employees and society, which is why we are onboarding all new Nylanauts remotely during this global pandemic until there is a vaccine and it is safe for humans to resume their pre-COVID lifestyles. AND once COVID-19 pandemic eventually comes to an end, we will continue to embrace Remote First philosophy, with a minor twist: Remote First, Office Second. That's right! It's about how you work, not where you work. Nylanauts can choose any workspace or environment that will result in more ideas, engagement, creativity, focus, collaboration, and productivity. It's true. Nylanauts can work from the slopes of Missoula or beaches of San Diego. Wherever motivates them; inspires them to be better versions of themselves. And if Nylanauts wish to work from an office in one of our hubs(San Francisco, Denver, New York City, Toronto, and London), they can! The workspace is there to be utilize. Why Remote First, Office Second? Because we not only believe in respecting individual working styles, disabilities, and personal schedules, but also ensuring everyone has a better work-life balance. The outcomes will always be more important than the physical location. So, if you’re looking to join a fast-growing company with a beloved, daily-use product, and an authentic mission that puts people first, we want to meet you. Want to know more? Check us out on Comparably and Great Place to Work!   About the Role: You'll help build and scale the infrastructure our platform runs on and the tools our developers need to get work done. Our SRE team is responsible for the infrastructure layer of our API platform—the base operating system (including security), CI/CD & deployment tools, monitoring and observability tools, and our horizontally sharded data storage layer which stores tens of terabytes of data. Right now, our open-source Python sync engine regularly archives terabytes of data across a massive SQL cluster, and our Flask APIs handle tens of millions of requests a day. We aim to scale that several times over in the next year.   At Nylas,“DevOps” is a part of our engineering culture, not a role we’re looking to fill. Our development team shares the pager with operations and makes their own deploys. We’re always looking for ways in which we can have specialists who delight in really knowing different parts of systems but still avoid being siloed away.   We keep our code and infrastructure automation in the same repo, and you’ll be empowered to make application changes necessary for scaling and reliability in collaboration with our development team. Our stack includes Python, MySQL, Redis, AWS, Debian GNU/Linux. Nylas is registered as an employer in many, but not all, states/provinces. If you are not located in or able to work from a state/province where Nylas is registered, you will not be eligible for employment. Visa sponsorship may not be available in certain remote locations. Nylas is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also EEO is the Law.

Geplaatst op

11 dagen geleden

Beschrijving

Mambu is the SaaS banking engine powering innovative loan and deposit products, the lean alternative to cumbersome core banking systems. Helping clients to successfully start up business ventures, transform existing operations, launch new products and expand into new markets. Mambu provides financial institutions of all sizes with the agility to rapidly design, launch, service and scale their banking and lending portfolio. We believe that a great company is built on great people. We are proud to have brought together incredibly bright minds to help make financial services ready for the 21st century. Our clients understand what it takes to succeed in a fully digital world and our team is a trusted partner in their endeavours. We are looking for a passionate, skilled and enthusiastic Site Reliability Engineer to join our team in Amsterdam. As a SRE you will build, operate and improve a highly available, performant, scalable, cost-effective, reliable and secure Mambu Cloud Platform using latest tools and technologies. More about us: To stay on top of the latest Fin-Tech trends and our success stories, please follow us on LinkedIn For more details regarding our global career opportunities, please visit Career Site
Source: Mambu