sse41
Here are 19 public repositories matching this topic...
Math library using HLSL syntax with multiplatform SIMD support
-
Updated
Jan 2, 2026 - C++
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
-
Updated
Jul 15, 2019
Node.js implementation of HighwayHash, Google's fast and strong hash function
-
Updated
Oct 8, 2021 - JavaScript
High performance blur in pure Rust using SIMD
-
Updated
Dec 25, 2025 - Rust
Bilinear image filtering implemented with SSE4, AVX2 and AVX512.
-
Updated
Jul 8, 2023 - C++
Minify JSON files fast! Supports Comments. Uses D, C, and AVX2 and SSE4_1 SIMD.
-
Updated
Jul 2, 2025 - D
IAA03_fast_math is a single-header math kernel(Atan2 only for now) designed to eliminate the "Trigonometry Tax" in high-throughput systems (Physics Engines, Audio DSP, and ML Pre-processing). Branchless ,ILP and SIMD (AVX2/SSE4.1), it achieves up to a ~186x per-element throughput speedup over std::atan2 while being IEEE 754 compliant
-
Updated
Jan 14, 2026 - C++
⚡ Accelerate your Go applications with a high-performance SIMD library for vectorized operations on various data types.
-
Updated
Jan 18, 2026 - Go
Tiny optimized math framework game oriented
-
Updated
Jan 9, 2025 - C++
Implementation of GPU Gems 38 using OpenGL Compute Shaders
-
Updated
Jul 26, 2025 - C++
Improve this page
Add a description, image, and links to the sse41 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the sse41 topic, visit your repo's landing page and select "manage topics."