J on the Beach - Speakers

Speakers

No unicorns, no caticorns, just software development

Aarushi Kansal

@aarushikansal

Senior Backend Engineer at Netlify

Building Machine Learning Pipelines for Computer Vision

Adam Paszke

@apaszke

Main author of PyTorch

PyTorch: a modern flexible HPC environment

Alex Soto

@alexsotob

Software Engineer at Red Hat

Kubernetes-Native Java with Quarkus

Chaos is taking over my Kubernetes cluster

Alex Soto

@alexsotob

Software Engineer at Red Hat

Alex is a Director of Developer Experience at Red Hat. He is passionate about Java world, software automation and he believes in the open source software model.

Alex is the creator of NoSQLUnit project, member of JSR374 (Java API for JSON Processing) Expert Group, the co-author of Testing Java Microservices book for Manning and contributor of several open source projects. A Java Champion since 2017, international speaker and teacher at Salle URL University, he has talked about new testing techniques for microservices and continuous delivery in the 21st century.

Talk

Kubernetes-Native Java with Quarkus

Kubernetes is becoming the de-facto platform to deploy our application nowadays. But this movement also implies some changes in the way we code our applications. Before this change, we just developed a monolith application where everything was up and running up front, now we are breaking down this monolith into (micro)services architecture and everything is interconnected with the network. Although it might seem easy, done properly is not an easy as there are some challenges to address that was not in a monolith architecture. In this session, we’re going to start discussing what are these challenges (ie fault tolerance, service discovery, open tracing, or health checks) and demonstrate how they can be solved using Eclipse MicroProfile specification.

Come to this session to learn how to develop a successful Kubernetes Native application using Quarkus, a Java ecosystem way to develop cloud-first, container-native, serverless focused and Kubernetes optimized.

Talk

Chaos is taking over my Kubernetes cluster

Chaos Engineering is used in a distributed system to test integrally all the application by simulating error conditions within the system and observes how the application reacts to that errors. With all this information and analyzing it correctly, you can write applications more resilient to the failures. This talk will provide an introduction to the principles of Chaos Engineering, how to perform experiments, identify the weakness of the architecture and fix these problems.

Come to this session to learn different tools like Istio, Chaos Toolkit or Glooshot to run Chaos Engineering in Kubernetes and what strategies you can use to prevent chaos from taking over your system.

Alexis Duque

@alexis0duque

Director of R&D at Rtone

AI at the edge with Tensorflow Lite to Design the Future of Vertical Farming

Ana Lebrón Moreno

Site Reliability Engineer Manager at Enova International

SRE Role in a mature DevOps organization

Ana Valdivia

@ana_valdi

Data Scientist at Trilateral Research

When the AI hype comes to punish the opressed

Andrew Betts

Web Developer and Developer Advocate for Fastly

Andrew is a web developer and developer advocate for Fastly, working with developers across the world to help make the web faster, more secure, more reliable and easier to work with. He founded a web consultancy which was ultimately acquired by the Financial Times, led the team that created their pioneering HTML5 web app, and founded the FT’s Labs division. He has also been an elected member of the W3C Technical Architecture Group, a committee of nine people who guide the development of the World Wide Web.

Antonio Fernández Anta

@afdezanta

Research Professor at IMDEA Networks

Blockchains, Micro-ledgers and Other Creatures

Antonio Fernández Anta

@afdezanta

Research Professor at IMDEA Networks

Dr. Antonio Fernández Anta is a Research Professor at IMDEA Networks. Previously he was a Full Professor at the Universidad Rey Juan Carlos (URJC) and was on the Faculty of the Universidad Politécnica de Madrid (UPM), where he received an award for his research productivity. He was a postdoc at MIT from 1995 to 1997, and spent sabbatical years at Bell Labs Murray Hill and MIT Media Lab. He has been awarded the Premio Nacional de Informática "Aritmel" in 2019 and is a Mercator Fellow of the SFB MAKI in Germany since 2018. He has more than 25 years of research experience, and more than 200 scientific publications. He was the Chair of the Steering Committee of DISC and has served in the TPC of numerous conferences and workshops. He received his M.Sc. and Ph.D. from the University of SW Louisiana in 1992 and 1994, respectively. He completed his undergraduate studies at the UPM, having received awards at the university and national level for his academic performance. He is a Senior Member of ACM and IEEE.

Talk

Blockchains, Micro-ledgers and Other Creatures

There is a big hype about blockchains and distributed ledger technologies (DTL), and their potential to reshape many aspects of society. Unfortunately, currently, these terms include many different aspects of these technologies in such a way that it is difficult to identify what is really needed and what is not in a particular set up. In this talk we try to analyze blockchains and DTL from a formalized and abstract point of view, separating these different aspects. We will especially explore scenarios in which the most convenient solution is having multiple ledgers, having lightweight micro-ledgers, or even ledgers that do not guarantee order.

Antón Rodríguez

@antonmry

Data Engineer at Inditex Group

Anton is a Software Engineer focused on Data Pipelines (Spark) and Event Streaming (Kafka Streams, Flink). Nowadays he works for Inditex Group (Zara, Massimo Dutti, etc.) as Data Engineer. In the past, he worked as a specialist in Deployment Pipelines, API Management and Advanced Orchestration in distributed systems. He enjoys building Data Pipelines with Java/Scala and deploying them to the Cloud or Kubernetes. He co-organizes the Vigo and Coruña Java User Groups (VigoJUG & CoruñaJUG). He also likes to speak at technical conferences and contribute to open source projects.

Antonio Vilches

@avilches

Principal Software Engineer at Shapelets

Antonio is one of the core software engineers at Shapelets, holding BSc and MSc in Computer Sciences from the University of Malaga. He is a software performance gangster. Thus, he went further and pursued a Ph.D. in Parallel Computing, where he developed parallel patterns for CPU-GPU systems. Antonio is the lead team developer of the company, always keen to share his knowledge and help others with their stuff.

Ara Pulido

@arapulido

Developer Relations at Datadog

Navigating the Sea of Kubernetes Development Tooling

Arno Schots

Cloud Solutions Engineering Director

Arno Schots is a Cloud Solution Engineering Director at Oracle, leading a team of 30+ Cloud Architects in France, Italy, Spain and Portugal. These teams are responsible for helping customers move to the cloud. Combining Enterprise Architecture background with hands-on engineering and development experience, he is passionate about sharing this knowledge with customers and technology audiences around the world. In his free time, he can be found outdoor doing sports like trail running, cycling and surfing. Last but not least, he is co-organiser of the great Jonthebeach event.

David G. Simmons

@davidgsIoT

Head of Development Relations at QuestDB

Using Cross-measurement Math to Synthesize Sensor Data for Digital Twins

David Rey

Chief Data Officer at Idealista

Data, is it oil or soil?

Félix López

@flopezluis

Senior Software Engineering Manager at Eventbrite

Félix López is a senior software engineering manager at Eventbrite, previously an engineering manager at Google, with more than 18 years of experience. During his career, he has worked on web development, video games, distributed systems and fin-tech companies. He holds a Research Master in Intelligent Systems (including neural networks, speech processing, data mining, etc.). He is interested in Distributed Systems, Machine Learning and psychology.

Jaroslaw Rzepecki

@JarekRzepecki

Senior Research Engineer at Microsoft

Towards more Human-like Video Game Agents

Jinal Parikh

@Jins__p

Technology Analyst at Goldman Sachs

Jinal Parikh is currently a Technology Analyst. She loves handling scale and researching about distributed systems in her spare time. Having worked previously with Morgan Stanley and a few startups, she is also a Google Women Techmakers Scholar 2017, performing outreach activities to foster a local community that compels women to persevere in Tech.

Joerg Gablonsky

Technical Fellow at Boeing Research and Technology

Joerg is the Chair of the Boeing Enterprise High-Performance Computing Council, and the technical lead for numerical optimization inside Boeing’s Research and Technology Group. He spent part of his career in Boeing’s IT organization, helping to establish a centralized High-Performance Computing Service as well as the Digital Transformation Environment, transforming how Boeing develops software. Now back in Boeing Research and Technology where he started his career, he is developing mathematical optimization methods and exposing those methods via modern software technologies.

Jörg Schad

@joerg_schad

Head of Engineering & Machine Learning at ArangoDB

The case for a common Graph-Based Metadata Layer for Machine Learning Platforms

Jörg Schad

@joerg_schad

Head of Engineering & Machine Learning at ArangoDB

Jörg Schad is Head of Engineering and Machine Learning at ArangoDB. In a previous life, he has worked on or built machine learning pipelines in healthcare, distributed systems at Mesosphere, and in-memory databases. He received his Ph.D. for research around distributed databases and data analytics. He’s a frequent speaker at meetups, international conferences, and lecture halls.

Jörg Schad is Head of Machine Learning at ArangoDB. In a previous life, he has worked on or built container infrastructure and distributed systems at Mesosphere, and in-memory databases. He received his Ph.D. for research around distributed databases and data analytics. He’s a frequent speaker at meetups, international conferences, and lecture halls.

Talk

The case for a common Graph-Based Metadata Layer for Machine Learning Platforms

We all know data is important for Machine Learning, but as it turns out for operating Machine Learning Platforms Metadata is equally important. With the rapid and recent rise of data science, the Machine Learning Platforms being built are becoming more complex. For example, consider the various Kubeflow components: Distributed Training, Jupyter Notebooks, CI/CD, Hyperparameter Optimization, Feature store, and more. Each of these components is producing metadata: Different (versions) Datasets, different versions of Jupyter notebooks, different training parameters, test/training accuracy, different features, model serving statistics, and many more. For production use, it is critical to have a common view across all these metadata as we have to ask questions such as: Which Jupyter notebook has been used to build Model xyz currently running in production? If there is new data for a given dataset, which models (currently serving in production) have to be updated?

Juan Carlos Rico

@_JCRico

Cloud Solutions Architect

Juan Carlos Rico is a Cloud Solutions Architect at Oracle, where he supports customers in Oracle’s Cloud adoption across EMEA. JC was born, and graduated in Computer Science, in Malaga, and joined the company in 2013 after some time working as a Software Engineer. Alongside technology, his main passion is spending time with his family, friends and love public speaking in international technology events like Oracle Open World or J On The Beach amongst others.

Lucas Bernardi

Principal Data Scientist at Booking.com

The 7 Powers of Machine Learning

Luis Vaquero

Global Head of Data Science at Dyson

Trees in your Contact Centre: Streamlining Text Analytics to Delight our Customers

Łukasz Gebel

@rauluka7

Software Engineer at TomTom

Do Developers Dream of Stateless Apps?

Łukasz Gebel

@rauluka7

Software Engineer at TomTom

Łukasz Gebel: Software engineer at TomTom by day, machine learning enthusiast at night. My leading technology is Java and Java-based frameworks. On a daily basis, I work on designing, implementing and deploying distributed systems that work in cloud environments, such as Microsoft Azure and AWS. I’m interested in classification problems and multi-agent systems. I love to learn, read books and play football – in no particular order.

Talk

Do Developers Dream of Stateless Apps?

In Blade Runner by P. K. Dick, trained hunters had to retire problematic Androids. We, Developers, are similar to those hunters. Our job is to solve problems. State brings complexity and troubles. Getting rid of it is not always possible. How to make our stateful distributed system highly available?

It’s a story based on the experience that I gained while working on stateful distributed systems deployed in cloud environments (Azure, AWS). It includes what went well and what is more important, what went wrong. I’ll start with defining state and explain differences between stateful and stateless apps (it’s not so obvious!).

Then I’ll discuss the strategies that we can use in cloud environments to ensure high availability our or systems. We’ll go through scaling, multi-region deployments, and why sometimes we need to care where our machines are located.

In the third part of this talk, I’ll focus on tools that help us to deal with the state and their high availability features provided by cloud. I’ll show you the live demo of Azure SQL failover and compare it to Cosmos DB. I’ll also discuss Storage and Queues. Understanding the limitations of tools we use is as important as being aware of what happens under the hood. It is needed to build reliable architecture.

I’ll sum up the talk by explaining what is SLA and how to calculate it for your system (yes, there will be some math). So, are we problem hunters or we are haunted by problems? Join my presentation, make your system highly available and dream peaceful dreams.

Marta Rivera

@MartaRiveraAlba

Lead Data Scientist at Clarity AI

Measuring Social Impact: domesticating big data streams with Airflow and Machine Learning

Marta Rivera

@MartaRiveraAlba

Lead Data Scientist at Clarity AI

Marta has always been interested in understanding the underlying mathematical basis of dynamical processes. During her career, she has had the privilege to focus on the discovery and understanding of the mathematical rules of nature. She studied Physics at the Universidad Autónoma at Madrid and finished a Masters and PhD in Biophysics in between Madrid and USA. During her PhD she developed mathematical models to test the optimality of the visual system of fruit flies. As a postdoctoral researcher, she worked in Spain, Portugal and USA on larval behaviour across species using mathematical modelling, machine learning and computer vision. Since 2016 she develops algorithms and mathematical models based on machine learning to unravel the mysteries of the market and to measure the social impact of companies towards a fairer world.

Talk

Measuring Social Impact: domesticating big data streams with Airflow and Machine Learning

In this talk, I will explain how we developed and continuously improve our Big Data pipeline to measure the social impact that can be attributed to every company or government in the world. In a giant effort to fully characterize social impact, we integrate aggregated transactional data from individual bank accounts, government budgets, company supplier disclosures, consumption surveys, product composition databases, United Nations consumption reports and worldwide industry to industry relationships together with proprietary cutting-edge ML algorithms. We take advantage of Airflow to integrate a heterogenic repertoire of python microservices that can be independently updated. This architecture allows us to keep improving the methodology implemented and adding extra data sources while continuously serving our clients.

Max Neunhoffer

@neunhoef

Senior Developer & Architect at ArangoDB

The case for a common Graph-Based Metadata Layer for Machine Learning Platforms

Miro Cupak

@mirocupak

Co-founder and VP Engineering at DNAstack

What's new in concurrency: threads and fibers for everyday Java developer

Nadieh Bremer

@NadiehBremer

Freelance Data Visualization Designer

Visualizing Connections

Natan Silnitsky

@NSilnitsky

Backend infra engineer at Wix.com

Greyhound - Powerful Pure Functional Kafka library

Nicolas Fränkel

@nicolas_frankel

Developer Advocate at Hazelcast

Introduction to data streaming

Nicolas Fränkel

@nicolas_frankel

Developer Advocate at Hazelcast

Developer Advocate with 15+ years' experience consulting for many different customers, in a wide range of contexts (such as telecoms, banking, insurances, large retail and public sector). Usually working on Java/Java EE and Spring technologies, but with focused interests like Rich Internet Applications, Testing, CI/CD and DevOps. Currently working for Hazelcast. Also double as a teacher in universities and higher education schools, a trainer and triples as a book author.

Talk

Introduction to data streaming

While “software is eating the world”, those who are able to best manage the huge mass of data will emerge out on the top.

The batch processing model has been faithfully serving us for decades. However, it might have reached the end of its usefulness for all but some very specific use-cases. As the pace of businesses increases, most of the time, decision-makers prefer slightly wrong data sooner, than 100% accurate data later. Stream processing - or data streaming - exactly matches this usage: instead of managing the entire bulk of data, manage pieces of them as soon as they become available.

In this talk, I’ll define the context in which the old batch processing model was born, the reasons that are behind the new stream processing one, how they compare, what are their pros and cons, and a list of existing technologies implementing the latter with their most prominent characteristics. I’ll conclude by describing in detail one possible use-case of data streaming that is not possible with batches: display in (near) real-time all trains in Switzerland and their position on a map. I’ll go through all the requirements and the design. Finally, using an OpenData endpoint and the Hazelcast platform, I’ll try to impress attendees with a working demo implementation of it.

Nuno Preguiça

@nunopreguica

Associate Professor at DI FCT NOVA

Reconciling availability and safety in distributed databases

Patrick Debois

@patrickdebois

Director of Dev❤️Ops Relations at Snyk.io

How secure is your build/server?

Patrick Debois

@patrickdebois

Director of Dev❤️Ops Relations at Snyk.io

In order to understand current IT organizations, Patrick has taken a habit of changing both his consultancy role and the domain which he works in: sometimes as a developer, manager, sysadmin, tester and even as the customer.

He first presented concepts on Agile Infrastructure at Agile 2008 in Toronto, and in 2009 he organized the first devopsdays. Since then he has been promoting the notion of ‘devops’ to exchange ideas between these groups and show how they can help each other to achieve better results in business.

Talk

How secure is your build/server?

Development has changed over the years, from doing everything yourself to a 3rd party package for every function. Operations has changed too, running your own servers is now considered an exception. To the cloud! We have learned that we need to trust others, but as our parents used to say - don’t trust strangers. So we secure our production server more than ever.

Yet, in the middle sits this no man's land: “the build server”. We think it’s time to take a closer look at some of the good practices around securing builds & artifacts to improve our day to day level of trust.

With Marked Sherman statement “Development is now assembly” in mind, the talk will focus more on the package/artifact/repository aspect. Less on the app security inside the code itself or at the OS/Machine level.

This talk I will go into detail on:

How to verify trust of your dependencies: from metadata, binaries and repositories
How to provide trust to others that build upon your software
How this ties into the concept of “reproducible builds”
How a practical “Software Bill of Material” looks
How the concepts of the “The Update Framework” (TUF) relate
How you can implement secure packaging policies

It will explain these topics using practical/code examples from the Nodejs and Docker ecosystems. All this will be presented from the different viewpoints from “dev”, “sec” and “ops”.

Let’s take ownership of your trust, we are already responsible when things go wrong anyway.