What you'll be doing
Performance analysis / bottleneck analysis of complex, high performance GPUs and System-on-Chips (SoCs).
Work on hardware models of different levels of extraction, including performance models, RTL test benches and emulators to find performance bottlenecks in the system.
Work closely with the architecture and design teams to explore architecture trade-offs related to system performance, area, and power consumption.
Understand key performance usecases or the product. Develop workloads and test suites targeting graphics, machine learning, automotive, video, compute vision applications running on these products.
Drive methodologies for improving turnaround time, finding representative data-sets and enabling performance analysis early in the product development cycle.
Develop required infrastructure including performance simulators, testbench components and analysis tools.
What we need to see :
BE / BTech or MS / MTech in relevant area or equivalent experience, PhD is a plus.
5+ years of relevant experience dealing with system level architecture and performance issues.
Strong understanding of System-on-Chip (SoC) architecture, graphics pipeline, memory subsystem architecture and Network-on-Chip (NoC) / Interconnect architecture.
Strong programming (C / C++) and scripting (Perl / Python) skills. Exposure to Verilog / System Verilog, SystemC / TLM is a strong plus.
Strong debugging and analysis (including data and statistical analysis) skills, including use for rtl dumps to debug failures.
Exposure to performance simulators, cycle accurate / approximate models or emulators for pre-silicon performance analysis is a plus.
Excellent communication and organization skills.
Ability to work in a global team environment.