Atul Vaish

Principal Architect | Engineering Leader

Personal technical site documenting independent applied engineering work in GPU optimization, edge AI inference, secure systems, and performance-critical platforms across x86 and ARM environments.

Hardware Platforms Used in Independent Prototyping & Research

Personal hardware testbed used for hands-on prototyping, benchmarking, and system-level experimentation.

Selected Technical Prototypes & Experiments

Independent technical work (2025–26) focused on GPU acceleration, system-level AI engineering, and performance benchmarking.

1. GPU Acceleration for Visual Search

High-performance visual search and object detection pipelines optimized using CUDA.

View on GitHub →

2. Agentic Compute on GPU

Parallelizing AI agent workflows and decision-making loops on GPUs.

GitHub / PyPI →

3. SentinelChange AI

Deep learning models for high-accuracy change detection in satellite imagery.

View on GitHub →

4. CUDA Softmax Benchmark

Benchmarking softmax kernel implementations for LLM inference.

View on GitHub →

5. CUDA GEMM Benchmark

Performance analysis of GEMM operations across GPU architectures.

View on GitHub →

Autonomous Local Research Agent Architecture

6. Autonomous Local Research Agent

Local agent for technical research automation and document synthesis.

View on GitHub →

7. GeoAnnotator

Precision annotation tooling for geospatial datasets.

Watch Demo →

8. Voice LLM Deployment on Jetson Orin

End-to-end voice-to-LLM inference on constrained devices.

GitHub Repo →

9. SmartPurchase Prototype

Autonomous checkout experimentation using real-time CV.

Watch Demo →

10. Urban Digital Twin (Unity / Cesium)

3D Tiles integration and simulation for large-scale urban models.

Watch Demo →

Cryptography & CloudHSM — Prior Training Work

Sample recording from hands-on cryptography and AWS CloudHSM training delivered as part of prior professional engagements.

Technical Blogs & Insights

Deep-dive write-ups and implementation notes from applied systems work.