AI Infrastructure Architect, R&D
- Developed a custom non-autoregressive language model for air-gapped CPU inference, achieving sub-20ms response latency and up to 1,200 t/s on 8GB consumer hardware.
- Built a vertically integrated software supply chain and digital workforce spanning development, deployment, operations, marketing, and commercialization. Enabled full lifecycle ownership and rapid product launches reducing SaaS costs by 100%.