Conference
2025
- arXivBlockDialect: Block-wise Fine-grained Mixed Format for Energy-Efficient LLM InferenceIn arXiv.2501.01144, 2025
2024
- ISLPEDAccelerating DNN Execution with Adaptive N:M Pruning on Both Weight and DataIn ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED), 2024
- HPCACAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device LearningIn International Symposium on High-Performance Computer Architecture (HPCA), 2024
- ISSCCA 12nm Linux-SMP-Capable RISC-V SoC with 14 Accelerator Types, Distributed Hardware Power Management and Flexible NoC-based Data OrchestrationIn 2024 IEEE International Solid- State Circuits Conference (ISSCC), 2024
2023
- IROSVaPr: Variable-Precision Tensors to Accelerate Robot Motion PlanningIn IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023
2022
- ICSASAP: Automatic Synthesis of Area-Efficient and Precision-Aware CGRAsIn Proceedings of the 36th ACM International Conference on Supercomputing, 2022
2021
2020
- HotChipsA Scalable Bayesian Inference Accelerator for Unsupervised LearningIn 2020 IEEE Hot Chips 32 Symposium (HCS), 2020
- VLSI SympA 3mm2 Programmable Bayesian Inference Accelerator for Unsupervised Machine Perception using Parallel Gibbs Sampling in 16nmIn 2020 IEEE Symposium on VLSI Circuits, 2020
2019
Journal
2024
- TODAESApplication-level Validation of Accelerator Designs Using a Formal Software/Hardware InterfaceACM Trans. Des. Autom. Electron. Syst., Feb 2024
2023
Technical Report
2022
- LATTE
2019
- ArXiv
Preprint
2022
- ArXivSpecialized Accelerators and Compiler Flows: Replacing Accelerator APIs with a Formal Software/Hardware InterfaceFeb 2022