Schmidt Ocean Institute – Science and Data Division / Full-Time
Location: Remote
Application Deadline: June 2, 2025
Schmidt Ocean Institute (SOI) was founded by Eric and Wendy Schmidt in 2009 with a goal to advance the frontiers of ocean exploration and research through technological advancement, intelligent observation, and open sharing of information. This is achieved by making ship time and associated assets freely available to scientists globally. In exchange, SOI supports public sharing of acquired scientific observations, data, and other information about the ocean to stimulate the growth of data applications and user community and to amplify and enable further exploration, discovery, deeper understanding, and effective conservation and management of our environment.
Data Solutions Engineer
The Schmidt Ocean Institute (SOI)’s Data Solutions Engineer is part of the Data Department within the Science and Data Division and will be creating data solutions for the distribution of high-quality oceanic, atmospheric, and video data captured by the Institute. The Data Solutions Engineer will report to the Head of Data Solutions and will develop and manage systems and applications to increase accessibility and usability of scientic data and video collected by SOI. This involves developing data-centric applications and platforms, overseeing projects and collaborative initiatives, providing support to scientists, establishing performance tracking metrics, and ensuring data integrity.
Technical Development & Data Management Responsibilities:
- Ensure data integrity, control, and accessibility after collection, overseeing both internal and curated external data access for scientific purposes.
- Use scientific programming to develop analysis-ready products.
- Collaborate with external organizations to make data available at data centers with appropriate metadata.
- In collaboration with the data team, develop, build, and implement new data-focused applications, systems, and APIs for post-expedition data access.
- Lead the ongoing development and management of the scientific annotation platform to facilitate access to and distribution of video and labeled imagery.
- Analyze and assess technical methods to enhance the visibility and utilization of SOI data platforms.
- Develop and maintain standardized metrics and automated systems to track and report on project metrics.
- Research and assess new technologies in data accessibility, machine learning, and new advanced strategies for visual data utilization.
- Propose new platforms, systems, data products and application development of strategic importance to SOI’s goals to Head of Data Solutions.
- Other Duties as assigned.
Management and Collaboration Responsibilities:
- Manage projects from start to finish, ensuring timely delivery within scope and budget.
- Assist Head of Data Solutions with drafting and managing contracts.
- Identify and manage partnerships and collaborations to increase data utilization.
- Identify partner/collaborator requirements for optimal system functionality.
- Work with outside contractors to support application development.
- Representing the organization and its data work in public/scientific forums, including presentations at conferences.
- Research and contribute to reports and data journal publications, in collaboration with SOI colleagues.
- Travel to and occasionally sail with the vessel.
Required Qualifications:
- A Bachelor’s degree or higher in a STEM field such as Computing, Oceanography, Mathematics, or equivalent experience.
- At least 5 years (post-undergraduate degree) of work experience in data management, scientific computing, or technical project management.
- Proven ability to plan, organize, and execute projects, with specific experience in data management and technical project management.
- Strong organizational and time-management skills
- Excellent problem-solving skills and a keen attention to detail.
- Excellent communication skills, including the ability to explain complex technical concepts to diverse audiences.
- Proven track record of overseeing data systems or platforms, including software or API development.
- Experience with data pipeline development, data transformation, and automation.
- Strong understanding of data quality, distribution, and access methodologies.
- Proficiency in scientific coding languages, preferably Python.
- Experience with cloud computing platforms, preferably Google Cloud.
- Familiarity with scientific data and image/video annotation platforms or workflows.
- Proficient in technical applications of machine learning, particularly object detection in image and video data
Preferred Qualifications:
- A Master’s degree in a STEM field or Oceanography
- Experience working with metadata standards and scientific data repositories.
- Familiarity with FAIR data principles and Open Source standards
- Experience contributing to scientific publications or data journals
- Experience working with earth and space data sets
- Experience with working with geospatial/temporal datasets.
- Experience creating data visualizations via coding tools
- Experience developing or managing APIs or web-based applications.
Application Form Available Here
Schmidt Ocean Institute is an equal opportunity employer and we strive to create an atmosphere where diversity of identity, experience, and background are welcomed, valued, and supported. We believe that diversity brings about greater results on all levels and we aim to use our resources to generate greater impact through our work. Candidates who contribute to this diversity are strongly encouraged to apply.
Schmidt Ocean Institute (SOI) was founded by Eric and Wendy Schmidt in 2009 with a goal to advance the frontiers of ocean exploration and research through technological advancement, intelligent observation, and open sharing of information. This is achieved by making ship time and associated assets freely available to scientists globally. In exchange, SOI supports public sharing of acquired scientific observations, data, and other information about the ocean to stimulate the growth of data applications and user community and to amplify and enable further exploration, discovery, deeper understanding, and effective conservation and management of our environment.