hero

Looking for your next challenge?

companies
Jobs

Perception Data Engineer (Visual Data Pipeline)

ANYbotics

ANYbotics

Software Engineering, Data Science
Barcelona, Spain
Posted on Nov 11, 2025
ANYbotics is a fast-growing tech company dedicated to shaping the future of mobile robotics across multiple industries. Join our highly talented and motivated team of more than 200+ people and work on cutting-edge robot technology.
The Opportunity
You’ll build and operate the data plumbing that our perception models need: ingestion, versioned storage, ETL, labeling integration, and reliable production pipelines for training and inference.
Market & Technology
ANYbotics transforms industrial plants in the (renewable) energy, process, and utility sector by introducing robotics to a wide range of novel applications that so far were beyond reach. Our mobile robot ANYmal uses legs for extreme mobility in complex environments, camera- and LIDAR-based sensing for full autonomy and obstacle avoidance, to perform jobs and deliver high-quality, consistent inspection results. We develop numerous customized hardware systems, including the entire robotic platform, actuators, sensors, inspection payloads, charging systems, and all related ANYbotics electrical hardware
About Us:
ANYbotics is a leading robotics company specializing in advanced autonomous systems. With a successful Series B financing round recently closed, we are poised for rapid growth and international expansion. Our mission is to revolutionize the robotics industry through cutting-edge technology and innovation. As we embark on this exciting journey, we are seeking a dynamic and experienced person to join our team and help us shape the future of autonomous robotic inspections.

Your contributions

  • Design, build and maintain scalable data pipelines and ETL workflows that ingest raw images, sensor metadata, and labels (both real and synthetic).
  • Implement dataset versioning, schema management, and reproducible data snapshots to support experiments and audits.
  • Integrate annotation tools (CVAT / Label Studio), manage labeling workflows and quality-control tooling, and support label QA processes.
  • Build data validation and monitoring checks (file integrity, label sanity, distribution drift alerts) and automate remediation where possible.
  • Provide clean, ready-to-use datasets and data loaders for ML engineers; optimize data access patterns for training (sharding, caching, prefetching).
  • Collaborate with MLOps to automate scheduled retraining triggers and with Synthetic Data Engineer to merge synthetic data streams.

You profile

  • 3+ years engineering experience building production data pipelines or ETL systems.
  • Strong Python scripting and engineering skills (pandas, pyarrow, boto3 or equivalent).
  • Experience with dataset versioning or large-file management (DVC, Git-LFS, or similar) and cloud object storage (S3).
  • Familiarity with annotation tooling and workflows for image data (CVAT / Label Studio).
  • Basic understanding of ML training data needs (batching, sharding, augmentation integration).
  • Prior work supporting computer-vision teams (image pipelines, preprocessing, TFRecord or custom dataset formats).

Bonus points

  • Experience with big-data tooling (Spark, Airflow/Prefect) or columnar formats (Parquet).
  • Knowledge of data privacy/compliance practices and tooling.
  • Cloud infra know-how (AWS/GCP) and experience setting up reproducible data pipelines.
We’re an international robotics company with the A-team spread across the Globe. This role gives you the opportunity to be part of growing our EU presence while staying connected to our global team. To be eligible, you’ll need to have the legal right to live and work in Spain. Ideally you reside in Barcelona, or are open to relocate. This is not a remote position