Data Engineer
Headhunty is looking for a Data Engineer on behalf of our client, innovative AI startup. Data Engineer: This Engineer focuses on pulling data into (GCP) using Google APIs, ingesting flat files, and Kaggle dataset: Text, images, attachments and video serve as the sources of input data.
We need someone to start within a month from 1st of December 2024. Candidate location should be in Western Balkans, best option would be in North Macedonia, but we are open to Montenegro, Serbia, Bosnia and Herzegovina, Albania, Kosovo candidates.
Mandatory requirements:
- Comfortable developing the pipeline from idea to deployment
- Working with Apache Beam and Apache Spark
- Production grade project experience in Google Cloud
Responsibilities:
- Build data Ingestion pipeline:
- Build data pipelines for user data and structured files for developer data.
- A data pipeline to ingest Google Drive data for users using Google API.
- A data pipeline to ingest developer data as flat files.
- Set up data storage options that can address security, embedding, and latency requirements.
- Implement SQL schema design to store the serving data.
- Implement post-processing requirements
Qualifications:
- Solid education with 4+ years experience in data engineering;
- Strong programming skills in SQL and Python.
- Proficiency in database administration
- Proved experience with Google Cloud
- Strong problem-solving skills and ability to troubleshoot complex data issues
- Excellent communication skills and ability to work collaboratively in a team environment
- A proactive mindset with a willingness to take ownership and drive projects to completion.
- Strong organisational skills along with a desire to continually be challenged
- Experience with monitoring, CI/CD
The ideal candidate is someone who has been working as a Data Engineer for 6-8 years, he has utilised Google Cloud Platform on a production grade project for at least 2 years.
Candidate should be is self-reliant, reliable and understands the fast paced nature of startups.
Candidate should be capable of doing everything from initiating the project up until setting the deployment environment.
If you are interested, feel free to send your CV to [email protected]