Remote Otter LogoRemoteOtter

Data Engineer Intern - Web Crawling - Remote

Posted 2 days ago
Software Development
Internship
USA

Overview

Sayari is seeking a Data Engineer Intern specializing in web crawling to assist in maintaining and improving its web crawling framework, crucial for collecting and analyzing global corporate and trade data.

In Short

  • Remote paid internship.
  • Work expectations of 20-30 hours a week.
  • Focus on web crawling infrastructure maintenance and improvement.
  • Collaborate with Product and Software Engineering teams.
  • Investigate and implement web crawlers for new sources.
  • Improve metrics and reporting for web crawling.
  • Enhance ETL processes.
  • Contribute to the development of Sayari’s data product.

Requirements

  • Experience with Python.
  • Experience managing web crawling at scale; Scrapy is a plus.
  • Familiarity with Kubernetes.
  • Experience with git for collaborative work.
  • Knowledge of selectors such as XPath, CSS, JMESPath.
  • Experience with WebDev tools (Chrome/Firefox).

Benefits

  • Opportunity to work in a supportive and innovative team.
  • Gain experience in a cutting-edge data engineering environment.
  • Possibility of future employment opportunities.
Sayari logo

Sayari

Sayari is a leading provider of counterparty and supply chain risk intelligence, serving government agencies, multinational corporations, and financial institutions. With its headquarters in Washington, D.C., Sayari offers an intuitive network analysis platform that uncovers hidden risks through integrated data on corporate ownership, supply chains, and trade transactions from over 250 jurisdictions. The company is committed to enhancing visibility into global commercial and financial networks using open data, fostering a culture of collaboration, innovation, and diversity. Sayari's solutions are utilized by thousands of analysts across more than 35 countries, reflecting its global reach and impact.

Share This Job!

Save This Job!

Similar Jobs:

Bynder logo

Data Engineer Intern - Remote

Bynder

8 weeks ago

Join Bynder as a Data Engineer Intern to gain hands-on experience in data engineering and analytics.

Worldwide
Internship
Software Development
SQLI logo

Data Engineer Intern - Remote

SQLI

73 weeks ago

Join SQLI as a Data Engineer Intern and contribute to innovative data strategies and projects.

France
Internship
Software Development
Sayari logo

Data Engineering Intern - Remote

Sayari

6 days ago

Join Sayari as a Data Engineering Intern to work on data collection and ETL pipeline development in a remote setting.

USA
Internship
Software Development
SonicWall logo

Data Plane Engineer Intern - Remote

SonicWall

Yesterday

Join SonicWall as a Data Plane Engineer Intern to develop scalable distributed systems and enhance network security.

USA
Internship
Software Development
CXG logo

Data Engineer (Web Scraping) - Remote

CXG

4 weeks ago

We are seeking a Data Engineer to manage web scraping configurations and data pipelines remotely.

Tunisia
Full-time
Data Analysis