Manager, Data Center
NVIDIA
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
NVIDIA is looking for a Data Center Team Lead with strong systems and networking knowledge, to lead the team that build and support the supercomputers and HPC-AI clusters of the networking clusters solutions group.
What you’ll be doing:
Lead and coordinate the planning and build of complex clusters and supercomputers across multiple data centers and labs
Manage for rack-and-stack, cabling, and space optimization efforts to ensure efficiency, maintainability, and standard processes
Lead all aspects of power and cooling efficiency strategies while ensuring optimal rack space utilization
Coordinate daily functions and maintenance of data facilities and test environments, ensuring seamless operations and timely problem resolution
Installation and integration of diverse infrastructure and solutions including Cloud, VMs, Storage, Network, HPC, and AI
Manage debugging activities — network, optical cabling, bare metal, and operating systems
Collaborate closely with Research & Development teams to support evolving project needs and experimental setups
Mentor and develop team members, ensuring knowledge sharing, standard methodologies, and professional growth
What we need to see:
MCSE or MCITP / CCNA certification
3+ years of experience as a team lead in large and complex data center environments, overall experience of 8+ years
Demonstrated practical experience in operating systems with strong problem identification and resolution skills
In-depth knowledge of Linux & Windows Core Services: DHCP, DNS, NIS, AD, etc.
Strong leadership skills with ability to organize, prioritize, and guide a team
Passionate about delivering excellent service with strong collaboration and interpersonal skills
Ways to stand out from the crowd:
Hands-on with Python and configuration management tools (e.g., Ansible, Puppet)
Experience with CI tools and job schedulers (e.g., Jenkins, SLURM)
Knowledge of virtualization technologies: KVM, VMware, Hyper-V
Experience with storage solutions like Netapp, Lustre, GPFS, ZFS
Skilled in L2 & L3 network protocols and resolving technical issues
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.