Job Information
BYTEDANCE INC. Lead Software Engineer of Cloud Storage, Infrastructure (Multiple Positions) in Bellevue, Washington
Responsibilities
- Lead the team to design, develop, and build new and innovative large-scale distributed storage systems and work on Research and Development (R&D) of state-of-art distributed storage systems by understanding and analyzing recent advancements in academic research as well as industry around the topic of distributed storage.
- Build prototypes with the C++ programming language.
- Make changes to add new features to existing distributed storage systems running in production.
- Analyze the performance of distributed storage systems.
- Analyze the cost of operating distributed storage systems.
- Play a key role in driving project objectives and execution management, architecture design and code reviews, defining long-term technical roadmap for the team, and demonstrate technical leadership.
- Formulate development goals according to the project schedule, write detailed design documents, and be responsible for module implementation, performance tuning, and functional testing.
- Provide timely technical support for online applications, extract potential needs and points of optimization from it, and continuously optimize the system.
- Continuously improve the storage system in terms of stability, scalability, functionality, and performance according to business requirement.
- Work closely with stakeholder teams to identify business pain points, continuously optimize and improve customer experience, team reputation, and impact.
Participate in writing papers on prototypes and systems built in the interest of advancement of state-of-art practices in distributed storage systems.
Qualifications
Must have a Master's degree in Computer Science, Engineering (any), Information Technology, Mathematics, Statistics, Physics, or a related field, and 4 years of related work experience; OR a Bachelor's degree in Computer Science, Engineering (any), Information Technology, Mathematics, Statistics, Physics, or a related field, and 6 years of related work experience, out of
which 5 years must be post-bachelor's, progressive related work experience.
Of the required experience, must have 4 years of experience in each of the following:
Delivering and maintaining a reliable, available, scalable, and manageable large-scale distributed storage system, including deploying a large-scale distributed storage system to Linux server using Shell scripting, C, or C++;
Optimizing the performance of distributed system using Agile method;
Implementing design features and functionality using C or C++;
Verifying and monitoring of distributed system using Python or C++, including end-to-end testing, compatibility testing, integration testing, stress testing, and failure ingestion testing;
Version control of codebase using Pipeline, Bash, or Git;
Storage engine architecture design and development including Software framework design; and
Critical algorithm design including the following: (1) mapping information compression; (2) Garbage collection policy, and (3) performance or Quality of Service (QoS) optimizatio