YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUs
Published in Proceedings of the 33rd ACM SIGPLAN International Conference on Compiler Construction (CC 2024), 2024
Explores dataflow choices and code generation to accelerate CPU inference, achieving strong speedups across NN workloads.
Recommended citation: Zhou, C., Hassman, Z., Shah, D., Richard, V., & Li, Y. (2024). “YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUs.” CC 2024, pp. 212–226.
Recommended citation: Zhou, C., Hassman, Z., Shah, D., Richard, V., & Li, Y. (2024). "YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUs." CC 2024, pp. 212–226.
Download Paper