May 27, 2024


Unlimited Technology

Big data infrastructure internship | Adaltas

Big data infrastructure internship | Adaltas

Occupation description

Large Details and distributed computing are at the core of Adaltas. We accompagny our associates in the deployment, routine maintenance, and optimization of some of the most significant clusters in France. Due to the fact not too long ago we also deliver aid for day-day operations.

As a great defender and active contributor of open supply, we are at the forefront of the knowledge platform initiative TDP (TOSIT Info Platform).

Through this internship, you will add to the development of TDP, its industrialization, and the integration of new open supply parts and new functionalities. You will be accompanied by the Alliage qualified staff in cost of TDP editor assistance.

You will also perform with the Kubernetes ecosystem and the automation of datalab deployments Onyxia, which we want to make offered to our clients as very well as to students as section of our instructing modules (devops, big facts, etcetera.).

Your skills will enable to develop the companies of Alliage’s open up source assist giving. Supported open supply components contain TDP, Onyxia, ScyllaDB, … For individuals who would like to do some website do the job in addition to huge facts, we by now have a very functional intranet (ticket administration, time management, innovative search, mentions and similar content, …) but other nice features are anticipated.

You will apply GitOps release chains and create article content.

You will function in a team with senior advisors as mentor.

Company presentation

Adaltas is a consulting company led by a staff of open resource professionals focusing on knowledge management. We deploy and work the storage and computing infrastructures in collaboration with our shoppers.

Spouse with Cloudera and Databricks, we are also open source contributors. We invite you to search our web site and our numerous technical publications to master much more about the company.

Techniques demanded and to be acquired

Automating the deployment of the Onyxia datalab calls for information of Kubernetes and Cloud native. You must be relaxed with the Kubernetes ecosystem, the Hadoop ecosystem, and the dispersed computing product. You will grasp how the standard elements (HDFS, YARN, object storage, Kerberos, OAuth, and many others.) perform collectively to meet the takes advantage of of massive details.

A great expertise of employing Linux and the command line is expected.

During the internship, you will study:

  • The Kubernetes/Hadoop ecosystem in get to add to the TDP venture
  • Securing clusters with Kerberos and SSL/TLS certificates
  • High availability (HA) of solutions
  • The distribution of methods and workloads
  • Supervision of providers and hosted applications
  • Fault tolerant Hadoop cluster with recoverability of dropped data on infrastructure failure
  • Infrastructure as Code (IaC) via DevOps applications these kinds of as Ansible and [Vagrant](/en/tag/hashicorp- vagrant/)
  • Be comfy with the architecture and operation of a information lakehouse
  • Code collaboration with Git, Gitlab and Github


  • Become common with the architecture and configuration techniques of the TDP distribution
  • Deploy and take a look at protected and hugely out there TDP clusters
  • Contribute to the TDP understanding foundation with troubleshooting guides, FAQs and articles or blog posts
  • Actively contribute concepts and code to make iterative improvements to the TDP ecosystem
  • Investigate and evaluate the dissimilarities between the main Hadoop distributions
  • Update Adaltas Cloud applying Nikita
  • Contribute to the growth of a instrument to gather purchaser logs and metrics on TDP and ScyllaDB
  • Actively lead thoughts to produce our help solution

Extra facts

  • Site: Boulogne Billancourt, France
  • Languages: French or English
  • Commencing date: March 2023
  • Length: 6 months

Significantly of the electronic world runs on Open Resource program and the Big Facts marketplace is booming. This internship is an possibility to get precious knowledge in both of those domains. TDP is now the only actually Open Supply Hadoop distribution. This is a excellent momentum. As element of the TDP workforce, you will have the possibility to find out 1 of the main large facts processing types and take part in the advancement and the long term roadmap of TDP. We think that this is an interesting opportunity and that on completion of the internship, you will be prepared for a effective job in Major Data.

Devices available

A laptop with the next properties:

  • 32GB RAM
  • 1TB SSD
  • 8c/16t CPU

A cluster created up of:

  • 3x 28c/56t Intel Xeon Scalable Gold 6132
  • 3x 192TB RAM DDR4 ECC 2666MHz
  • 3x 14 SSD 480GB SATA Intel S4500 6Gbps

A Kubernetes cluster and a Hadoop cluster.


  • Income 1200 € / thirty day period
  • Cafe tickets
  • Transportation pass
  • Participation in one particular intercontinental convention

In the past, the conferences which we attended consist of the KubeCon arranged by the CNCF basis, the Open Supply Summit from the Linux Basis and the Fosdem.

For any ask for for supplemental details and to submit your software, be sure to make contact with David Worms: