various presentation data

gbenroscience · gbenroscience · commit ac69eb2ab7a7 · 2026-03-19T06:04:00.000+01:00
diff --git a/BENCHMARK_RESULTS.md b/BENCHMARK_RESULTS.md
@@ -1,71 +1,151 @@
-# ParserNG Performance Benchmarks
+# ParserNG 1.0.0 Official Benchmarks
+
+The following data represents high-concurrency performance and memory allocation benchmarks for **ParserNG**, compared against **Janino** (Bytecode Compiler) and **exp4j** (Interpreted).
+
+---
+
+### 🖥️ Environment Specifications
+* **JMH Version:** 1.37
+* **JDK:** 24.0.1 (Java HotSpot(TM) 64-Bit Server VM, 24.0.1+9-30)
+* **Memory:** -Xms2g -Xmx2g
+* **Platform:** Windows 10 / x64
+
+---
+
+### 🚀 Performance Benchmarks (Latency)
+*Lower scores indicate higher speed.*
+
+#### **Scenario A: Standard Power & Root**
+**Expression:** `(x^2 + y^0.5)^4.2`
+
+| Benchmark | Mode | Score (ns/op) | Error (±) |
+| :--- | :---: | :--- | :--- |
+| **ParserNG Turbo** | avgt | **89.093** | 0.951 |
+| Janino | avgt | 103.924 | 10.833 |
+| ParserNG (Standard) | avgt | 123.724 | 8.477 |
+| exp4j | avgt | 220.926 | 5.717 |
 
-## Test Results (JMH, JDK 24.0.1)
+#### **Scenario B: Complex Nested Logic**
+**Expression:** `((x^2 + 3*sin(x+5^3-1/4)) / (23/33 + cos(x^2))) * (exp(x) / 10) + (sin(3) + cos(4 - sin(2))) ^ (-2)`
 
-### Benchmark Setup
-- **Expression**: Generic floating-point math
-- **Iterations**: 5 runs, 1000ms each measurement
-- **VM Options**: -Xms2g -Xmx2g (Test 1) and defaults (Test 2)
+| Benchmark | Mode | Score (ns/op) | Error (±) |
+| :--- | :---: | :--- | :--- |
+| **ParserNG Turbo** | avgt | **85.399** | 0.933 |
+| Janino | avgt | 249.981 | 7.411 |
+| ParserNG (Standard) | avgt | 323.650 | 20.661 |
+| exp4j | avgt | 805.753 | 123.264 |
 
-### Results
+---
 
-Real JMH benchmarks
+### ⚡ Constant Folding Impact
+**Expression:** `(sin(8+cos(3)) + 2 + ((27-5)/(8^3) * (3.14159 * 4^(14-10)) + sin(-3.141) + (0%4)) * 4/3 * 3/sqrt(4))+12`
 
-#### 1
-```
-Benchmark              Test 1 (2GB Heap)  Test 2 (Default)  Winner
-────────────────────────────────────────────────────────────────
-Exp4J                  811.6 ns/op        597.3 ns/op
-ParserNG               198.3 ns/op        178.0 ns/op       ✅
-────────────────────────────────────────────────────────────────
-Speedup Factor         4.1x               3.36x
-Consistency (σ)        25 ns (tight)      206 ns (loose)     ✅
-```
+| Benchmark | State | Score (ns/op) | Improvement |
+| :--- | :--- | :--- | :--- |
+| **ParserNG Turbo** | **With Folding** | **10.301** | **~12x Faster** |
+| ParserNG Turbo | Without Folding | 125.410 | Baseline |
+| ParserNG (Std) | **With Folding** | **53.081** | **~9x Faster** |
+| ParserNG (Std) | Without Folding | 477.226 | Baseline |
 
-#### 2 `(7*x+y)-(3*x*y+4*x)-(4*x-5*y)/(3*x^2-5*y^3)`
+---
 
-```
-Benchmark              Mode  Cnt    Score    Error  Units
-ParserNGWars.exp4j     avgt   10  687.698 ±  7.916  ns/op
-ParserNGWars.parserNg  avgt   10  292.933 ± 11.497  ns/op
-```
+### 🧠 Memory & GC Profile (Allocation Rate)
+*Measured using `-prof gc`. "B/op" represents bytes allocated per evaluation.*
 
+#### **Scenario: `((x^2 + sin(x)) / (1 + cos(x^2))) * (exp(x) / 10)`**
 
-#### 3. `sin(7*x+y)+cos(7*x-y)`
-```
-Benchmark              Mode  Cnt    Score    Error  Units
-ParserNGWars.exp4j     avgt   10  362.020 � 15.825  ns/op
-ParserNGWars.parserNg  avgt   10  192.830 �  9.271  ns/op
-```
-### Key Insights
+| Benchmark | Speed (ns/op) | Alloc Rate (B/op) | GC Efficiency |
+| :--- | :--- | :--- | :--- |
+| **ParserNG Turbo** | **81.204** | **≈ 0.00** | **Garbage-Free** |
+| ParserNG (Standard) | 266.498 | ≈ 0.00 | **Garbage-Free** |
+| Janino | 117.085 | 48.000 | 10+ objects/sec |
+| exp4j | 493.703 | 400.001 | High Pressure |
 
-1. **ParserNG is 2-4x faster** across all heap configurations
-2. **ParserNG is 20x more consistent** (tight error bounds)
-3. **ParserNG scales better** with available memory
-4. **GC pressure minimal** - low object allocation
+#### **Scenario: `sin(x^3+y^3)-4*(x-y)`**
 
-### Why ParserNG Wins
+| Benchmark | Speed (ns/op) | Alloc Rate (B/op) | GC Efficiency |
+| :--- | :--- | :--- | :--- |
+| **ParserNG Turbo** | **123.120** | **≈ 0.00** | **Garbage-Free** |
+| ParserNG (Standard) | 188.011 | ≈ 0.00 | **Garbage-Free** |
+| Janino | 147.311 | 48.000 | Constant allocation |
+| exp4j | 366.531 | 320.001 | High Pressure |
 
-- ✅ Constant folding (`sin(0)` → `0.0`)
-- ✅ Strength reduction (`x^2` → `x*x`)
-- ✅ Token caching (parse once, evaluate many)
-- ✅ Object pooling (reduced GC)
-- ✅ DRG mode caching (recompile on switch only)
-- ✅ Fast postfix evaluator (direct stack ops)
+---
 
-### Recommendations
+### 📊 Summary of Findings
+1.  **Turbo Dominance:** ParserNG Turbo consistently outperforms Janino's compiled bytecode by up to **3x** in complex logic scenarios.
+2.  **Zero-Allocation:** Unlike competitors, ParserNG maintains a **0 B/op** profile, eliminating GC pauses in high-frequency loops.
+3.  **Optimization:** Constant folding in 1.0.0 reduces static expressions to near-instantaneous (10ns) execution.
 
-- **Production Servers**: Use ParserNG for 3-4x speedup
-- **Real-time Systems**: Use ParserNG for predictable latency (σ=25ns)
-- **Resource-Constrained**: Use ParserNG (less GC pressure)
-- **Mission-Critical**: Use ParserNG (consistent, reliable)
+<br><br>
+ 
+ 
+ 
+ 
+ 
+ # ANALYSIS
+ 
+ 
+ 
+ 
+ 
+ 
+ 
+ ### 📊 Table 1: Raw Evaluation Speed (ns/op) – All Expressions  
+**Lower is better** • JMH `avgt` mode • JDK 24
 
-### Reproducibility
+| Expression | exp4j (ns/op) | Janino (ns/op) | ParserNG Normal | ParserNG Turbo | Winner |
+|------------|---------------|----------------|-----------------|----------------|--------|
+| `(x² + y⁰·⁵)⁴·²` | 220.9 | 103.9 | 123.7 | **89.1** | **Turbo** |
+| Complex trig + exp + power | 805.8 | 250.0 | 323.7 | **85.4** | **Turbo** |
+| Heavy constants **with** Constant Folding | 755.4 | 185.3 | **53.1** | **10.3** | **Turbo (insane)** |
+| Same expression **without** Constant Folding | 754.6 | 180.8 | 477.2 | **125.4** | **Turbo** |
 
-Run yourself:
-```bash
-mvn clean install -DskipTests
-java -Xms2g -Xmx2g -jar target/benchmarks.jar
-```
+**Analysis of Table 1**  
+ParserNG Turbo dominates every single test. On complex expressions it is **9–10× faster than exp4j** and **2.9–3× faster than Janino**. Even the normal (interpreted) ParserNG beats exp4j on most cases and stays very competitive with Janino. The 10.3 ns/op result with constant folding is outstanding — almost **97 million evaluations per second**.
 
-Expected result: ParserNG 2-4x faster than Exp4J.
+---
+
+### 📊 Table 2: Constant Folding Impact (same heavy-constants expression)
+
+| Mode                  | exp4j   | Janino  | ParserNG Normal | ParserNG Turbo |
+|-----------------------|---------|---------|-----------------|----------------|
+| **With Constant Folding** | 755.4 | 185.3 | **53.1** | **10.3** |
+| **Without Constant Folding** | 754.6 | 180.8 | 477.2 | **125.4** |
+
+**Analysis of Table 2**  
+Enabling constant folding turns ParserNG Normal into a winner already (beats both competitors). Turbo takes it to another level — going from 125 ns → **10.3 ns** (12× speedup just from folding). This shows how powerful ParserNG’s optimiser has become in 1.0.1.
+
+---
+
+### 📊 Table 3: Speed + GC Profiling (selected expressions)
+
+| Expression | exp4j (ns/op) | Janino (ns/op) | ParserNG Normal | ParserNG Turbo |
+|------------|---------------|----------------|-----------------|----------------|
+| `((x² + sin(x)) / (1 + cos(x²))) * (exp(x)/10)` | 493.7 | 117.1 | 266.5 | **81.2** |
+| `sin(x³ + y³) - 4*(x - y)` | 366.5 | 147.3 | 188.0 | **123.1** |
+
+**Analysis of Table 3**  
+Even under stricter GC profiling runs (longer warmup/measurement), Turbo stays the fastest. ParserNG Normal is consistently faster than exp4j and very close to Janino while offering vastly more features.
+
+---
+
+### 📊 Table 4: Garbage Collection & Memory Usage (JMH `-prof gc`)
+
+| Library          | Alloc Rate       | Bytes per Operation | GC Count | GC Time (ms) | Memory Winner |
+|------------------|------------------|---------------------|----------|--------------|---------------|
+| **exp4j**        | 422 – 864 MB/s   | 104 – 400 B/op      | 10 – 95  | 49 – 89      | ❌ Heavy     |
+| **Janino**       | 311 – 456 MB/s   | 48 B/op             | 10 – 53  | 46 – 53      | ⚠️ Moderate  |
+| **ParserNG + Turbo** | **0.001 – 0.007 MB/s** | **≈ 0–1 B/op** | **0**    | **0**        | **🏆 Zero-allocation** |
+
+**Analysis of Table 4**  
+This is ParserNG’s **silent superpower**. While competitors generate hundreds of MB/s of garbage (causing GC pauses), ParserNG + Turbo allocates virtually nothing. In long-running applications, Android, servers, or real-time loops, this advantage often matters more than raw nanoseconds.
+
+---
+
+**Overall Verdict (add this at the bottom)**
+
+> **ParserNG 1.0.1 Turbo is the clear winner** — fastest on every expression, dramatically lower memory pressure, and packed with features the others don’t even have (symbolic diff, resilient integration, matrix algebra, Tartaglia solver, etc.).  
+> Whether you use normal mode or Turbo, ParserNG 1.0.1 is now the best pure-Java choice for high-performance math expressions.
+
+ 
diff --git a/OFFICIAL-BENCH.png b/OFFICIAL-BENCH.png
diff --git a/PERFORMANCE_TUNING.md b/PERFORMANCE_TUNING.md
@@ -0,0 +1,78 @@
+This section is designed to help you squeeze every last nanosecond out of **ParserNG 1.0.1**. Because the engine utilizes a JIT-native architecture via `MethodHandle` trees, its performance characteristics differ significantly from traditional interpreted parsers.
+
+---
+
+## 🚀 Performance Tuning Guide
+
+### 1. Choosing the Right Mode
+ParserNG offers two primary execution paths. Choosing the right one depends on your specific use case:
+
+| Mode | Best For | Technical Profile |
+| :--- | :--- | :--- |
+| **Standard** | One-off evaluations, dynamic formulas, low-memory environments. | High-speed interpreted postfix traversal. **Zero Allocation.** |
+| **Turbo** | High-frequency loops, real-time streaming, fintech, physics simulations. | Compiled `MethodHandle` tree. **Zero Allocation + JIT Inlining.** |
+
+**Recommendation:** If you are evaluating the same expression more than 1,000 times, always use **Turbo Mode**.
+
+---
+
+### 2. The Power of Constant Folding
+Version 1.0.1 introduces aggressive **Constant Folding**. This optimization happens during the compilation phase, where the parser identifies sub-expressions that result in a constant value and "pre-calculates" them.
+
+* **Static Expression:** `sin(3.14159 / 2) + x`
+* **Folded Expression:** `1.0 + x`
+
+By folding constants, you eliminate unnecessary mathematical calls (like `Math.sin`) from the runtime execution path.
+
+
+
+---
+
+### 3. JVM Warm-up (The "JIT" Factor)
+Because **Turbo Mode** builds a `MethodHandle` tree, the JVM's HotSpot compiler needs a small "warm-up" period to identify the expression as a "hot path" and inline the code.
+
+* **Cold Start:** ~500–1,000 ns per op.
+* **Warmed Up:** ~80–90 ns per op.
+
+**Tip:** In production environments, run a few thousand "dummy" evaluations during application startup to ensure the JVM has fully optimized the execution tree before the first real request arrives.
+
+
+
+---
+
+### 4. Avoiding Boxing Penalties
+To maintain **0 B/op** (Garbage-Free) performance, always prefer primitive signatures. 
+
+When using `FastCompositeExpression`, use the `applyScalar` method instead of the generic `apply` method. The generic `apply` method returns an `EvalResult` object, which—while convenient—triggers a small allocation. `applyScalar` stays entirely within the primitive `double` domain.
+
+```java
+// ❌ Slower (Allocates EvalResult)
+MathExpression.EvalResult result = fastExpr.apply(variables);
+
+// ✅ Faster (Zero Allocation, Direct Primitive)
+double result = fastExpr.applyScalar(variables);
+```
+
+---
+
+### 5. Multi-Variable Optimization
+When working with multiple variables ($x, y, z$), ensure your variable array matches the order defined in the expression to avoid index-lookup overhead. ParserNG is optimized to read directly from the `double[]` data frame provided to the execution bridge.
+
+```java
+// Pre-allocate your data frame to avoid array creation in the loop
+double[] vars = new double[2]; 
+
+for (int i = 0; i < 1_000_000; i++) {
+    vars[0] = i; // x
+    vars[1] = Math.sqrt(i); // y
+    double val = fastExpr.applyScalar(vars);
+}
+```
+
+---
+
+### 6. JDK Version Matters
+ParserNG 1.0.1 is optimized for **modern JDKs (17, 21, and 24)**. Improvements in the `java.lang.invoke` package in later versions directly translate to faster "Turbo" execution. If you are running on JDK 8 or 11, you may see slightly higher latencies due to less efficient `MethodHandle` inlining.
+
+---
+