Leaderboard

Best benchmark result for each model + quantization + GPU combination, ranked by throughput. Data refreshes automatically.

Connecting to data source…
ModelQuantGPUContextThroughput (tok/s)Avg TTFT (ms)P99 TTFT (ms)Avg ITL (ms)P99 ITL (ms)Avg Wtok/W
gpt-oss-20b-Q4_1Q4_1NVIDIA GeForce RTX 509032,7681491.1199.8349.320.7422.131927.75
gpt-oss-20b-Q5_K_MQ5_K_MNVIDIA GeForce RTX 5090130,0641421.2199.5349.621.8123.042136.67
gpt-oss-20b-Q4_K_SQ4_K_SNVIDIA GeForce RTX 509032,7681374.2214.4369.622.4435.082176.32
gpt-oss-20b-Q3_K_MQ3_K_MNVIDIA GeForce RTX 5090130,0641370.1338.11079.622.0237.272026.77
gpt-oss-20b-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 509016,3841369.4208.4357.722.5135.922156.37
gpt-oss-20b-Q5_K_SQ5_K_SNVIDIA GeForce RTX 509016,3841367.4325.91200.922.1629.701648.34
gpt-oss-20b-Q2_KQ2_KNVIDIA GeForce RTX 509032,7681367.1207.3347.122.5628.542126.45
gpt-oss-20b-Q3_K_SQ3_K_SNVIDIA GeForce RTX 509016,3841361.4327.11081.522.1036.332036.69
gpt-oss-20b-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 5090130,0641358.9223.0387.922.7035.412156.31
gpt-oss-20b-Q4_K_MQ4_K_MNVIDIA GeForce RTX 509032,7681357.8207.6340.622.7034.922176.27
gpt-oss-20b-F16F16NVIDIA GeForce RTX 509016,3841355.0212.1348.422.7334.782236.08
gpt-oss-20b-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 509016,3841344.6220.2352.922.9035.202146.27
gpt-oss-20b-Q6_KQ6_KNVIDIA GeForce RTX 509032,7681338.6220.9386.922.9144.132176.17
gemma-4-E2B-it-Q8_0Q8_0NVIDIA GeForce RTX 509032,7681334.7101.3184.420.9656.791897.07
gpt-oss-20b-Q4_0Q4_0NVIDIA GeForce RTX 5090130,0641334.3222.4385.522.9844.372146.24
gemma-4-E2B-it-Q4_1Q4_1NVIDIA GeForce RTX 509032,7681326.1101.1186.620.9950.081757.56
gpt-oss-20b-Q8_0Q8_0NVIDIA GeForce RTX 509032,7681322.6249.4480.023.1645.372196.04
gpt-oss-20b-Q2_K_LQ2_K_LNVIDIA GeForce RTX 509016,3841316.7247.4423.423.0941.032196.02
gemma-4-E2B-it-Q5_K_SQ5_K_SNVIDIA GeForce RTX 509032,7681311.597.7187.921.2952.301956.74
gemma-4-E2B-it-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 5090130,0641308.0129.9355.220.9155.611478.88
gemma-4-E2B-it-Q5_K_MQ5_K_MNVIDIA GeForce RTX 5090130,0641307.8100.8185.821.2354.571906.88
gemma-4-E2B-it-IQ4_XSIQ4_XSNVIDIA GeForce RTX 509016,3841302.3102.8194.720.8450.441757.45
gemma-4-E2B-it-Q4_K_SQ4_K_SNVIDIA GeForce RTX 509032,7681290.1122.3299.621.0065.331488.73
gemma-4-E2B-it-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 5090130,0641283.199.3189.021.4550.891976.53
gemma-4-E2B-it-Q6_KQ6_KNVIDIA GeForce RTX 509016,3841280.2107.1191.121.3355.412016.38
gemma-4-E2B-it-Q3_K_SQ3_K_SNVIDIA GeForce RTX 5090130,0641270.494.5206.921.2648.251896.74
gemma-4-E2B-it-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 509016,3841265.5114.0339.321.6554.161568.11
gemma-4-E2B-it-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 5090130,0641262.6102.8200.021.8353.131916.60
gemma-4-E2B-it-Q3_K_MQ3_K_MNVIDIA GeForce RTX 5090130,0641246.8115.8312.822.0859.391548.10
gemma-4-E2B-it-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 5090130,0641241.8101.5188.021.4851.092006.22
gemma-4-E2B-it-Q4_0Q4_0NVIDIA GeForce RTX 509016,3841241.7110.8306.521.8452.551428.76
gemma-4-E2B-it-Q4_K_MQ4_K_MNVIDIA GeForce RTX 509032,7681233.9105.5191.320.9857.081966.30
gemma-4-E4B-it-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 509032,7681227.6230.7387.623.3449.421707.24
gemma-4-E2B-it-IQ4_NLIQ4_NLNVIDIA GeForce RTX 5090130,0641192.7100.8186.021.4851.991806.62
gemma-4-E2B-it-BF16BF16NVIDIA GeForce RTX 509016,3841188.0114.7239.022.4459.661766.73
gemma-4-E2B-it-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 5090130,0641169.997.6204.423.3547.791836.39
gemma-4-E2B-it-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 509032,7681161.693.6178.922.3048.501866.24
gemma-4-E2B-it-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 509032,7681147.299.5187.722.4853.001955.89
gemma-4-E4B-it-Q4_1Q4_1NVIDIA GeForce RTX 5090130,0641135.2154.3401.725.1361.841676.81
gemma-4-E4B-it-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 509032,7681131.2149.9386.824.8557.601716.61
gemma-4-E4B-it-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 509016,3841118.0283.1848.724.6963.961497.52
gemma-4-E4B-it-Q4_0Q4_0NVIDIA GeForce RTX 5090130,0641106.7173.0373.725.5169.751447.69
gemma-4-E4B-it-IQ4_XSIQ4_XSNVIDIA GeForce RTX 509016,3841103.5137.8376.325.1355.642224.96
gemma-4-E4B-it-Q5_K_SQ5_K_SNVIDIA GeForce RTX 5090130,0641091.2155.9327.726.4681.921567.01
gemma-4-E4B-it-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 509016,3841082.1150.4398.425.9463.991855.87
gemma-4-E4B-it-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 509016,3841050.8159.1394.826.9769.291945.43
gemma-4-E4B-it-IQ4_NLIQ4_NLNVIDIA GeForce RTX 509016,3841047.7147.6395.526.2862.792224.73
gemma-4-E4B-it-Q8_0Q8_0NVIDIA GeForce RTX 509032,7681046.6154.9383.227.1567.361885.58
gemma-4-E4B-it-Q5_K_MQ5_K_MNVIDIA GeForce RTX 5090130,0641044.9162.3403.826.8971.901945.39
gemma-4-E4B-it-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 509032,7681036.2161.3417.427.4368.971955.32
gemma-4-E4B-it-Q4_K_SQ4_K_SNVIDIA GeForce RTX 5090130,0641020.0159.6402.126.7372.771895.40
gemma-4-E4B-it-Q6_KQ6_KNVIDIA GeForce RTX 509032,768998.8157.6410.027.7068.972024.95
gemma-4-E4B-it-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 5090130,064997.2162.5381.827.6178.411596.27
gemma-4-E4B-it-Q4_K_MQ4_K_MNVIDIA GeForce RTX 509032,768996.7157.4434.528.2871.551596.28
gemma-4-E4B-it-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 509032,768990.1159.0387.128.0767.521975.04
gemma-4-E4B-it-Q3_K_SQ3_K_SNVIDIA GeForce RTX 509032,768986.2191.9469.827.5266.891915.17
gemma-4-E4B-it-Q3_K_MQ3_K_MNVIDIA GeForce RTX 509032,768980.9163.3397.127.2265.412533.88
gemma-4-26B-A4B-it-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 509032,768928.9804.82379.430.9488.812254.14
gemma-4-E4B-it-BF16BF16NVIDIA GeForce RTX 5090130,064898.3170.7410.931.8391.812313.88
gemma-4-26B-A4B-it-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 509032,768897.1992.12482.431.2580.272643.40
gemma-4-26B-A4B-it-MXFP4_MOENVIDIA GeForce RTX 5090130,064844.9520.41694.234.37134.192313.66
gemma-4-26B-A4B-it-UD-Q3_K_MQ3_K_MNVIDIA GeForce RTX 509065,536816.6668.72136.135.11125.152363.46
gemma-4-26B-A4B-it-UD-Q4_K_MQ4_K_MNVIDIA GeForce RTX 509032,768815.9431.51280.435.92140.312383.43
gemma-4-26B-A4B-it-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 509065,536811.5536.61284.436.01138.952752.95
gemma-4-26B-A4B-it-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 509032,768810.6726.61767.935.18102.752693.02
gemma-4-26B-A4B-it-UD-Q4_K_SQ4_K_SNVIDIA GeForce RTX 509065,536810.5486.31314.836.24143.942752.94
Qwen3.5-0.8B-Q4_1Q4_1NVIDIA GeForce RTX 509032,768808.5117.2158.04.506.40
gemma-4-26B-A4B-it-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 509065,536807.4638.12023.635.31135.422373.41
gemma-4-26B-A4B-it-UD-IQ3_SIQ3_SNVIDIA GeForce RTX 509065,536795.9580.91900.036.27130.752413.31
Qwen3.5-0.8B-Q4_0Q4_0NVIDIA GeForce RTX 5090130,064788.684.9122.34.656.501584.99
gemma-4-26B-A4B-it-UD-IQ4_NLIQ4_NLNVIDIA GeForce RTX 509032,768786.9306.21004.737.95157.862393.30
Qwen3.5-0.8B-IQ4_NLIQ4_NLNVIDIA GeForce RTX 50908,192786.21205.51398.04.826.591694.65
Qwen3.5-0.8B-Q8_0Q8_0NVIDIA GeForce RTX 509016,384777.31285.51435.04.756.501744.48
gemma-4-26B-A4B-it-UD-IQ4_XSIQ4_XSNVIDIA GeForce RTX 509032,768776.6310.71007.338.51144.252413.23
Qwen3.5-0.8B-Q6_KQ6_KNVIDIA GeForce RTX 5090130,064775.4113.9159.94.636.51
Qwen3.5-0.8B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 5090130,064775.11367.31470.54.736.621724.51
Qwen3.5-0.8B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 509016,384774.51299.61433.04.816.57
Qwen3.5-0.8B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 509016,384773.71369.51473.94.756.55
Qwen3.5-0.8B-IQ4_NLIQ4_NL32,768768.8200.1547.139.8659.03
Qwen3.5-0.8B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 50908,192766.9115.0158.14.786.681704.51
gemma-4-26B-A4B-it-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 509065,536766.2533.01636.337.64141.802413.18
gemma-4-26B-A4B-it-UD-Q5_K_SQ5_K_SNVIDIA GeForce RTX 509032,768763.6395.51364.937.71141.742433.14
Qwen3.5-2B-IQ4_NLIQ4_NLNVIDIA GeForce RTX 509065,551759.5118.3163.54.826.702463.09
Qwen3.5-0.8B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 50908,192752.6115.7156.94.886.781714.41
Qwen3.5-0.8B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 5090130,064746.21323.81473.95.016.72
Qwen3.5-2B-Q4_0Q4_0NVIDIA GeForce RTX 509032,768744.21369.11499.45.036.732053.63
Qwen3.5-0.8B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 50908,192741.1115.3159.04.966.781684.41
gemma-4-26B-A4B-it-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 509032,768739.9419.91133.438.26150.632872.58
Qwen3.5-0.8B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 5090130,064739.81306.41486.45.147.501714.34
Qwen3.5-0.8B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 5090130,064737.3116.7155.54.996.851664.44
gemma-4-26B-A4B-it-UD-Q5_K_MQ5_K_MNVIDIA GeForce RTX 509032,768732.3479.61108.437.90148.312842.58
Qwen3.5-0.8B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 509016,384732.285.1123.95.086.871744.21
Qwen3.5-0.8B-BF16BF16NVIDIA GeForce RTX 5090130,064730.0116.4155.85.046.82
Qwen3.5-2B-Q4_1Q4_1NVIDIA GeForce RTX 5090130,064724.23886.34373.95.3129.022343.09
Qwen3.5-0.8B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 50908,192723.5113.5158.85.036.95
Qwen3.5-2B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 509065,551714.0121.8171.95.137.062502.86
Qwen3.5-0.8B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 50908,192713.5116.3153.85.177.061664.30
Qwen3.5-0.8B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 509016,384713.41402.71544.85.236.971754.07
Qwen3.5-0.8B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 50908,192712.41345.11517.75.4023.21
Qwen3.5-0.8B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 509016,384708.91336.71559.65.367.131833.88
Qwen3.5-2B-Q4_0Q4_0NVIDIA GeForce RTX 5060 Ti32,768705.7253.0683.743.2765.93838.49
Qwen3.5-2B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 5060 Ti130,064699.5296.5775.143.8865.03848.36
Qwen3.5-2B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 5060 Ti32,768699.4295.9772.343.7974.26848.34
Qwen3.5-2B-Q4_1Q4_1NVIDIA GeForce RTX 5060 Ti32,768694.7264.5755.443.7477.70848.30
Qwen3.5-0.8B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 5090130,064694.21338.01576.55.428.81
Qwen3.5-2B-Q6_KQ6_KNVIDIA GeForce RTX 509016,384690.9117.0169.95.289.64
Qwen3.5-2B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 509065,551690.11424.71599.35.5825.901624.26
Qwen3.5-2B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 5060 Ti32,768690.0229.3737.744.1170.40858.12
Qwen3.5-0.8B-BF16BF16130,064688.2212.9539.743.2571.46
Qwen3.5-2B-IQ4_NLIQ4_NLNVIDIA GeForce RTX 5060 Ti16,384688.0236.4698.043.8879.79868.02
Qwen3.5-2B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 5060 Ti32,768686.8254.7764.543.8973.86858.11
Qwen3.5-2B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 509065,551686.7120.8170.45.377.212772.48
Qwen3.5-2B-IQ4_NLIQ4_NL16,384682.6259.9822.244.3388.12
Qwen3.5-2B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 509016,384679.81547.11663.85.457.352273.00
Qwen3.5-2B-Q8_0Q8_0NVIDIA GeForce RTX 5090130,064678.01451.41624.25.6715.281873.63
Qwen3.5-2B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 5060 Ti16,384677.7264.6758.644.7479.67877.79
Qwen3.5-2B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 5060 Ti130,064673.7223.2747.044.8374.28877.78
Qwen3.5-2B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 5060 Ti130,064672.4225.5737.745.3884.46768.82
Qwen3.5-2B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 5060 Ti32,768671.6243.7740.445.1978.06887.65
Qwen3.5-2B-Q6_KQ6_KNVIDIA GeForce RTX 5060 Ti32,768667.9257.5785.845.5381.73887.63
Qwen3.5-2B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 509065,551664.41581.31700.35.587.512332.86
Qwen3.5-2B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 5060 Ti16,384663.6232.4792.245.5382.61907.41
Qwen3.5-2B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 509032,768657.6116.5167.15.577.332792.35
Qwen3.5-2B-Q8_0Q8_0NVIDIA GeForce RTX 5060 Ti16,384649.8229.8756.546.8896.73748.77
Qwen3.5-2B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 50908,192643.310282.811378.56.0430.122702.38
gemma-4-26B-A4B-it-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 50908,192634.9211.8517.923.4692.572372.67
Qwen3.5-2B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 509016,384631.9120.6162.65.887.792552.48
Qwen3.5-2B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 509065,551629.71581.51732.86.087.782222.83
DeepSeek-R1-Distill-Qwen-32B-Q2_K_LQ2_K_LNVIDIA GeForce RTX 509016,384628.01036.03426.842.56142.113601.75
DeepSeek-R1-Distill-Qwen-32B-Q2_KQ2_KNVIDIA GeForce RTX 509016,384627.41182.03372.242.56146.714081.54
Qwen3.5-2B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 5060 Ti16,384625.9278.3845.647.9088.11887.10
gemma-4-26B-A4B-it-UD-Q6_KQ6_KNVIDIA GeForce RTX 50908,192624.9205.0482.923.9497.482372.64
Qwen3.5-2B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 509016,384622.11530.01756.16.1627.222272.74
gemma-4-26B-A4B-it-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 50908,192616.4263.3661.023.8586.613022.04
gemma-4-26B-A4B-it-Q8_0Q8_0NVIDIA GeForce RTX 50908,192615.6270.7659.624.04101.782972.07
Qwen3.5-2B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 50908,192610.991.2132.76.127.972602.35
Qwen3.5-2B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 5090130,064608.84637.75177.36.3733.272652.30
Qwen3.5-2B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 50908,192608.31579.21791.66.3729.632312.64
gemma-4-E2B-it-BF16BF16NVIDIA GeForce RTX 5060 Ti130,064607.4174.1329.343.50114.19728.42
Qwen3.5-2B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 509032,768606.84725.55188.96.348.04
Qwen3.5-2B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 509032,768597.3117.8167.26.268.07
Qwen3.5-2B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 509016,384596.2113.9164.36.198.052642.26
Qwen3.5-2B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 5060 Ti130,064590.0283.7860.250.7090.05876.78
Qwen3.5-2B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 5060 Ti130,064569.7181.0439.047.3184.08807.17
Qwen3.6-35B-A3B-MXFP4_MOENVIDIA GeForce RTX 509032,768568.4573.21650.351.42196.692572.22
Qwen3.6-35B-A3B-UD-Q4_K_MQ4_K_MNVIDIA GeForce RTX 5090130,064558.5673.11541.052.12178.142991.87
Qwen3.5-9B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 509016,384557.7424.11151.054.13187.563061.82
Qwen3.5-9B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 509016,384555.9260.3994.554.69159.153211.73
Qwen3.5-9B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 509016,384555.2195.5496.227.36109.293041.83
Qwen3.6-35B-A3B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 5090130,064554.2632.91584.852.72194.542971.86
Qwen3.5-9B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 509032,768553.6303.2996.554.73162.903191.74
Qwen3.5-9B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 509032,768553.5267.41007.855.41190.063101.79
Qwen3.6-35B-A3B-UD-IQ1_MIQ1_MNVIDIA GeForce RTX 509032,768553.4341.51089.454.20195.602652.08
Qwen3.6-35B-A3B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 509016,384552.7458.51447.854.24204.462991.85
Qwen3.5-9B-Q4_0Q4_0NVIDIA GeForce RTX 5090130,064552.5251.11003.055.75164.883101.78
Qwen3.5-9B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 5090130,064552.1332.1999.054.65147.893211.72
Qwen3.5-9B-Q4_1Q4_1NVIDIA GeForce RTX 50908,192551.2286.3982.055.42183.033031.82
Qwen3.6-35B-A3B-UD-IQ4_NL_XLIQ4_NL_XLNVIDIA GeForce RTX 5090130,064549.8445.01108.453.76192.662682.05
Qwen3.5-2B-BF16BF16130,064548.6246.9636.354.0899.43
Qwen3.5-9B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 509032,768548.3665.01726.154.65136.663251.69
DeepSeek-R1-Distill-Qwen-32B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 509032,768547.8737.41697.447.76187.864171.31
Qwen3.6-35B-A3B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 5090130,064547.4390.81086.055.04171.432742.00
DeepSeek-R1-Distill-Qwen-32B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 509032,768547.31196.33107.846.25181.544161.32
Qwen3.6-35B-A3B-UD-Q3_K_SQ3_K_SNVIDIA GeForce RTX 5090130,064546.9429.71094.954.60167.032672.05
Qwen3.5-9B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 509016,384546.8262.61019.955.92185.023101.76
Qwen3.6-35B-A3B-UD-Q5_K_MQ5_K_MNVIDIA GeForce RTX 509016,384546.0495.11539.954.55158.022732.00
Qwen3.6-35B-A3B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 5090130,064545.9524.01397.754.10186.432941.86
Qwen3.6-35B-A3B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 5090130,064545.4484.81234.153.99171.122682.04
Qwen3.5-9B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 509032,768544.7255.41019.856.34186.723121.75
Qwen3.5-9B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 5090130,064543.7499.41558.255.48137.493131.74
Qwen3.6-35B-A3B-UD-Q4_K_SQ4_K_SNVIDIA GeForce RTX 5090130,064543.6413.71107.253.84176.132662.05
Qwen3.6-35B-A3B-UD-Q5_K_SQ5_K_SNVIDIA GeForce RTX 509032,768542.1558.61759.753.91165.262682.02
Qwen3.5-9B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 5090130,064536.3264.51021.856.94160.123111.72
Qwen3.6-35B-A3B-UD-IQ4_NLIQ4_NLNVIDIA GeForce RTX 5090130,064535.1571.91678.854.06184.482681.99
Qwen3.6-35B-A3B-UD-Q3_K_MQ3_K_MNVIDIA GeForce RTX 5090130,064534.9368.11091.155.31181.402642.02
Qwen3.6-35B-A3B-UD-IQ4_XSIQ4_XSNVIDIA GeForce RTX 5090130,064534.7406.61078.955.23142.242691.99
Qwen3.6-35B-A3B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 509016,384534.3459.51155.755.47144.182761.94
Qwen3.5-9B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 509016,384532.5355.61175.257.74185.893271.63
Qwen3.5-9B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 509016,384530.9369.41187.857.11201.203281.62
Qwen3.5-2B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 5060 Ti130,064530.7139.0357.648.5881.21995.34
Qwen3.5-9B-Q8_0Q8_0NVIDIA GeForce RTX 509032,768529.6256.31005.858.03174.283071.73
Qwen3.5-9B-Q6_KQ6_KNVIDIA GeForce RTX 509016,384529.1264.91022.257.33183.223131.69
Qwen3.6-35B-A3B-UD-IQ3_SIQ3_SNVIDIA GeForce RTX 5090130,064526.5494.11459.255.92187.162771.90
Qwen3.5-9B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 5090130,064524.4280.71019.157.36158.283261.61
Qwen3.5-2B-BF16BF16NVIDIA GeForce RTX 509016,384518.812601.714093.37.5133.841932.69
Qwen3.5-9B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 509032,768513.1353.81176.459.20178.783381.52
Qwen3.5-0.8B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 4060 Ti8,192483.3217.8638.932.1934.324211.51
Qwen3.5-9B-BF16BF16NVIDIA GeForce RTX 509016,384482.0376.51185.462.78205.513441.40
Qwen3.5-0.8B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 4060 Ti8,192478.6268.7378.832.3335.054510.63
Qwen3.5-0.8B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 4060 Ti16,384477.2392.1806.865.3773.764311.12
Qwen3.5-0.8B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 4060 Ti8,192476.6258.4445.132.1536.684410.83
Qwen3.5-0.8B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 4060 Ti16,384474.8326.8670.164.5375.414311.04
Qwen3.5-0.8B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 4060 Ti16,384473.4352.6783.765.1679.184410.88
Qwen3.5-0.8B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 4060 Ti16,384471.9302.9687.465.1088.934310.95
Qwen3.5-0.8B-Q4_0Q4_0NVIDIA GeForce RTX 4060 Ti16,384471.4269.0609.065.0588.674410.84
Qwen3.5-0.8B-Q4_1Q4_1NVIDIA GeForce RTX 4060 Ti16,384470.5302.1747.765.6185.664410.82
Qwen3.5-0.8B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 4060 Ti16,384469.3263.2619.965.5390.254410.74
Qwen3.5-0.8B-IQ4_NLIQ4_NLNVIDIA GeForce RTX 4060 Ti16,384467.4256.9620.565.5388.914310.82
Qwen3.5-0.8B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 4060 Ti16,384466.9239.1619.566.2191.874410.64
Qwen3.5-0.8B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 4060 Ti16,384466.7232.8555.765.7394.044410.73
Qwen3.5-0.8B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 4060 Ti32,768464.8338.6733.865.7688.914410.64
Qwen3.5-2B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 5060 Ti32,768464.7140.3306.752.7696.32825.65
Qwen3.5-0.8B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 4060 Ti16,384463.6252.0645.266.0289.274510.35
Qwen3.5-0.8B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 4060 Ti16,384462.8263.1632.566.1189.854510.40
Qwen3.5-0.8B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 4060 Ti32,768462.5261.8633.166.3889.964410.49
Qwen3.5-0.8B-Q6_KQ6_KNVIDIA GeForce RTX 4060 Ti16,384460.7233.9542.665.9090.544410.42
DeepSeek-R1-Distill-Qwen-32B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 509016,384459.3858.31901.653.84203.964331.06
Qwen3.5-0.8B-Q8_0Q8_0NVIDIA GeForce RTX 4060 Ti16,384457.8252.5615.166.8894.624510.17
Qwen3.6-35B-A3B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 509016,384457.6428.91098.564.00216.362521.82
Qwen3.5-0.8B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 4060 Ti16,384451.7266.0615.067.7391.784510.04
Qwen3.5-2B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 4060 Ti32,768450.9470.6971.769.3573.534510.02
Qwen3.5-4B-Q4_0Q4_0NVIDIA GeForce RTX 509032,768446.62260.62525.28.3510.88
Qwen3.5-2B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 4060 Ti32,768446.5465.8954.370.0074.78469.73
Qwen3.5-2B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 4060 Ti32,768438.7366.1845.070.5392.93479.35
Qwen3.5-4B-Q4_1Q4_1NVIDIA GeForce RTX 50908,192437.72208.72511.08.7629.14
Qwen3.5-2B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 4060 Ti16,384437.6448.7972.270.3285.67479.25
Qwen3.5-0.8B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 4060 Ti32,768435.3254.1504.870.0599.03469.50
Qwen3.5-4B-IQ4_NLIQ4_NLNVIDIA GeForce RTX 50908,192434.52236.72490.28.7858.74
Qwen3.5-4B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 50903,055433.1200.8294.38.4510.962531.71
Qwen3.5-2B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 4060 Ti16,384432.9285.0804.871.3696.11479.15
Qwen3.5-2B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 4060 Ti32,768432.5378.0878.371.7293.16489.11
Qwen3.5-2B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 4060 Ti32,768431.9324.7844.171.24102.37479.19
Qwen3.5-2B-Q4_0Q4_0NVIDIA GeForce RTX 4060 Ti16,384431.1307.5879.171.3199.36479.15
Qwen3.5-2B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 4060 Ti130,064430.9355.8783.071.41101.27479.17
Qwen3.5-2B-IQ4_NLIQ4_NLNVIDIA GeForce RTX 4060 Ti16,384430.3277.4805.971.70109.25469.36
Qwen3.5-2B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 4060 Ti32,768429.4317.4827.772.19100.46489.00
Qwen3.5-4B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 50908,192428.72472.82676.48.5711.17
Qwen3.5-2B-Q4_1Q4_1NVIDIA GeForce RTX 4060 Ti16,384428.5301.9897.571.81100.85459.48
Qwen3.5-2B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 4060 Ti8,192428.1270.3343.635.9843.20479.13
Qwen3.5-4B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 509016,384427.32392.62636.88.5613.88
Qwen3.5-0.8B-BF16BF16NVIDIA GeForce RTX 4060 Ti16,384425.1312.8905.471.58111.964210.10
Qwen3.5-2B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 4060 Ti32,768424.8300.9739.172.20111.07488.92
Qwen3.5-2B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 4060 Ti16,384422.2310.2946.672.92106.31488.87
Qwen3.5-2B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 4060 Ti16,384421.3294.8862.772.83106.24488.70
Qwen3.5-2B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 4060 Ti16,384419.0344.4967.473.34100.59498.52
Qwen3.5-4B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 50908,192417.4207.2305.48.7911.23
Qwen3.5-2B-Q6_KQ6_KNVIDIA GeForce RTX 4060 Ti16,384417.3330.4883.273.78101.97498.60
Qwen3.5-35B-A3B-UD-Q4_K_LQ4_K_LNVIDIA GeForce RTX 509032,768415.6575.11559.672.18214.213381.23
Qwen3.5-4B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 5090130,064407.62344.92698.79.3611.63
Qwen3.5-2B-Q8_0Q8_0NVIDIA GeForce RTX 4060 Ti130,064406.8328.2955.275.61115.53468.94
Qwen3.5-35B-A3B-MXFP4_MOENVIDIA GeForce RTX 509016,384405.6518.61478.674.06209.113371.20
Qwen3.5-4B-Q6_KQ6_KNVIDIA GeForce RTX 5090130,064403.62348.22701.79.5114.26
Qwen3.5-35B-A3B-APEX-I-CompactNVIDIA GeForce RTX 509032,768402.9565.31617.975.05242.183351.20
Qwen3.5-35B-A3B-APEX-QualityQualityNVIDIA GeForce RTX 5090130,064401.9505.81504.575.37179.713361.20
Qwen3.5-2B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 4060 Ti16,384401.1304.5756.876.61120.74508.10
Qwen3.5-35B-A3B-APEX-I-QualityQualityNVIDIA GeForce RTX 5090130,064400.8519.21624.875.42214.323361.19
Qwen3.5-4B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 509032,768400.42396.92733.19.5941.83
Qwen3.5-35B-A3B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 509016,384400.4523.11373.675.70248.973541.13
Qwen3.5-35B-A3B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 5090130,064400.1561.61963.875.46208.363261.23
Qwen3.5-35B-A3B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 509032,768399.8570.21620.875.34190.593261.23
Qwen3.5-35B-A3B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 509032,768399.7537.31554.275.70225.773271.22
Qwen3.5-35B-A3B-APEX-CompactNVIDIA GeForce RTX 5090130,064399.1483.61585.275.84186.233331.20
Qwen3.5-35B-A3B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 509016,384398.1583.02062.375.91179.723471.15
Qwen3.5-2B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 5060 Ti130,064397.6147.6269.059.47108.67785.11
Qwen3.5-35B-A3B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 509016,384397.6427.31291.276.04238.723451.15
Qwen3.5-35B-A3B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 509032,768397.1425.71315.576.36245.603281.21
Qwen3.5-0.8B-Q4_0Q4_0NVIDIA GB102,048395.6123.7236.39.5712.14
Qwen3.5-35B-A3B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 5090130,064395.2457.11482.076.99190.913261.21
Qwen3.5-35B-A3B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 509016,384394.6370.21123.677.30214.383501.13
Qwen3.5-4B-Q8_0Q8_0NVIDIA GeForce RTX 509032,768393.87314.28046.09.8018.15
Qwen3.5-0.8B-UD-IQ2_XXSIQ2_XXSNVIDIA GB1032,768393.3135.2234.39.6712.44
Qwen3.5-35B-A3B-UD-IQ4_XSIQ4_XSNVIDIA GeForce RTX 5090130,064393.0541.41675.276.86202.323261.21
Qwen3.5-0.8B-Q4_1Q4_1NVIDIA GB102,048393.0119.9237.79.6212.26
Qwen3.5-35B-A3B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 509032,768391.6592.82003.477.00189.953301.19
Qwen3.5-35B-A3B-UD-IQ4_NLIQ4_NLNVIDIA GeForce RTX 509032,768390.9508.81556.577.14225.043271.20
Qwen3.5-35B-A3B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 509032,768390.6702.02320.477.19191.163311.18
Qwen3.5-35B-A3B-UD-IQ3_SIQ3_SNVIDIA GeForce RTX 509032,768390.5538.21652.877.68190.043321.18
Qwen3.5-35B-A3B-APEX-I-BalancedBalancedNVIDIA GeForce RTX 509016,384390.2417.81250.977.59231.993351.16
Qwen3.5-35B-A3B-APEX-MiniNVIDIA GeForce RTX 5090130,064389.5392.21162.778.40223.973331.17
Qwen3.5-35B-A3B-APEX-BalancedBalancedNVIDIA GeForce RTX 509032,768389.0653.31597.277.13213.843621.08
Qwen3.5-0.8B-UD-Q2_K_XLQ2_K_XLNVIDIA GB1065,536386.5137.7239.69.8412.74
Qwen3.5-0.8B-IQ4_NLIQ4_NLNVIDIA GB1032,768386.2110.0194.99.6515.20
Qwen3.5-4B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 509032,768386.116752.718880.410.0759.59
Qwen3.5-0.8B-Q3_K_SQ3_K_SNVIDIA GB108,192385.0133.8235.59.9012.43
Qwen3.5-4B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 509016,384384.37361.58196.910.0748.91
Qwen3.5-0.8B-UD-IQ2_MIQ2_MNVIDIA GB108,192381.6137.2238.09.9812.83
Qwen3.5-9B-IQ4_NLIQ4_NLNVIDIA GeForce RTX 5090130,064380.42512.72851.510.1556.492181.75
Qwen3.5-4B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 5060 Ti32,768379.4584.91920.479.46172.02973.92
Qwen3.5-4B-Q4_0Q4_0NVIDIA GeForce RTX 5060 Ti16,384378.9530.41473.179.97186.951033.69
Qwen3.5-4B-IQ4_NLIQ4_NLNVIDIA GeForce RTX 5060 Ti130,064377.1517.51669.779.75180.16973.91
Qwen3.5-4B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 5060 Ti16,384376.8664.12165.280.52178.561033.64
Qwen3.5-2B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 4060 Ti16,384376.7351.5915.781.13122.87478.07
Qwen3.5-0.8B-IQ4_XSIQ4_XSNVIDIA GB1065,536376.1126.9227.710.0412.76
Qwen3.5-0.8B-UD-IQ3_XXSIQ3_XXSNVIDIA GB102,048374.5127.5233.110.2113.06
Qwen3.5-4B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 509032,768373.017685.519603.010.3953.322671.40
Qwen3.5-4B-Q4_1Q4_1NVIDIA GeForce RTX 5060 Ti32,768372.6521.21756.380.69171.40963.87
Qwen3.5-35B-A3B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 509016,384371.3685.22126.181.15201.863321.12
Qwen3.5-4B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 5060 Ti32,768371.1646.11894.180.55190.101043.56
Qwen3.5-4B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 5060 Ti32,768368.7607.12012.881.30185.04983.75
Qwen3.5-0.8B-Q3_K_MQ3_K_MNVIDIA GB1065,536368.4112.0201.710.1914.66
Qwen3.5-0.8B-Q4_K_SQ4_K_SNVIDIA GB1032,768364.82583.82964.510.6344.87
Qwen3.5-0.8B-Q4_K_MQ4_K_MNVIDIA GB10130,064364.12640.72986.410.5713.57
Qwen3.5-35B-A3B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 50908,192363.8331.7919.941.70120.873231.13
Qwen3.5-4B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 50908,192363.17839.88633.410.6143.61
Qwen3.5-4B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 509032,768360.22755.13037.410.6212.97
DeepSeek-R1-Distill-Qwen-32B-Q6_KQ6_KNVIDIA GeForce RTX 50908,192359.4385.31001.338.06138.924360.83
Qwen3.5-0.8B-Q5_K_MQ5_K_MNVIDIA GB1032,768358.62666.23037.410.7813.85
Qwen3.5-4B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 509016,384358.4138.3218.710.5512.84
Qwen3.5-2B-BF16BF16NVIDIA GeForce RTX 4060 Ti16,384357.6382.61137.685.23135.69507.22
Qwen3.5-0.8B-Q5_K_SQ5_K_SNVIDIA GB102,048356.62612.03052.610.9014.67
Qwen3.5-4B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 50908,192356.32685.13053.310.7944.57
Qwen3.5-0.8B-UD-Q4_K_XLQ4_K_XLNVIDIA GB1065,536355.5111.5194.910.6313.73
Qwen3.5-0.8B-UD-Q3_K_XLQ3_K_XLNVIDIA GB10130,064355.12451.63005.310.8747.80
Qwen3.5-4B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 509032,768354.37983.58846.710.9049.01
Qwen3.5-0.8B-UD-Q5_K_XLQ5_K_XLNVIDIA GB102,048350.72667.53086.111.0914.48
gemma-4-E2B-it-IQ4_NLIQ4_NLApple M4 Pro32,768349.7393.2940.880.29182.732812.58
Qwen3.6-27B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 509016,384349.11110.73378.983.69345.093800.92
Qwen3.5-4B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 509032,768349.18065.59006.711.1152.612091.67
Qwen3.5-4B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 50908,192347.918868.020994.911.1549.70
gemma-4-26B-A4B-it-UD-IQ2_MIQ2_MNVIDIA GB1065,536341.91306.04706.086.83185.16496.94
gemma-4-E2B-it-UD-Q4_K_XLQ4_K_XLApple M4 Pro16,384341.1465.8968.684.53243.702812.10
Qwen3.5-0.8B-Q6_KQ6_KNVIDIA GB10130,064341.193.5178.111.0817.92
gemma-4-E2B-it-Q8_0Q8_0Apple M4 Pro32,768339.4401.3955.178.42187.703011.16
gemma-4-E2B-it-UD-Q8_K_XLQ8_K_XLApple M4 Pro130,064338.2399.3983.279.88198.393011.24
Qwen3.5-0.8B-UD-Q6_K_XLQ6_K_XLNVIDIA GB10130,064337.12799.93148.411.4817.62
gemma-4-26B-A4B-it-UD-IQ2_XXSIQ2_XXSNVIDIA GB1032,768336.81001.33842.289.35226.12467.26
Qwen3.5-0.8B-Q8_0Q8_0NVIDIA GB10130,064335.42761.93214.311.5947.50
Qwen3.6-35B-A3B-UD-Q6_KQ6_KNVIDIA GeForce RTX 50908,192333.4343.9708.944.63140.032331.43
gemma-4-E2B-it-IQ4_XSIQ4_XSApple M4 Pro130,064333.3418.6995.481.38194.053011.30
Qwen3.5-9B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 509032,768332.62881.33280.611.6562.402161.54
gemma-4-E2B-it-UD-Q3_K_XLQ3_K_XLApple M4 Pro16,384330.4424.1935.582.83213.392811.72
Qwen3.5-9B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 5060 Ti130,064329.0459.81001.992.77253.361093.01
gemma-4-E2B-it-Q4_1Q4_1Apple M4 Pro130,064328.8469.41037.482.67227.10339.99
Qwen3.5-9B-IQ4_NLIQ4_NLNVIDIA GeForce RTX 5060 Ti16,384328.3498.81255.293.56248.881162.82
Qwen3.5-9B-Q4_0Q4_0NVIDIA GeForce RTX 5060 Ti130,064328.1411.51016.593.65215.591093.02
Qwen3.6-27B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 509032,768327.6971.52643.189.78454.183870.85
gemma-4-E2B-it-Q4_0Q4_0Apple M4 Pro32,768327.6444.1900.883.03288.463010.92
gemma-4-E2B-it-UD-Q6_K_XLQ6_K_XLApple M4 Pro32,768327.3434.21006.283.14212.353010.98
Qwen3.5-27B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 509032,768326.2906.22616.690.92451.662891.13
Qwen3.5-9B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 5060 Ti32,768326.01019.13721.390.12190.941073.05
Qwen3.5-2B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 5060 Ti32,768325.7157.3259.559.87118.07903.61
Qwen3.6-27B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 509032,768321.4957.62498.691.04465.863161.02
Qwen3.5-9B-Q4_1Q4_1NVIDIA GeForce RTX 5060 Ti32,768321.3561.61325.994.27231.011152.79
gemma-4-E2B-it-Q6_KQ6_KApple M4 Pro16,384320.9454.11021.286.01236.562911.07
gemma-4-E2B-it-UD-Q2_K_XLQ2_K_XLApple M4 Pro130,064320.3442.11012.084.11210.512811.56
Qwen3.5-2B-UD-IQ2_XXSIQ2_XXSNVIDIA GB10130,064320.13174.63538.012.0214.87625.15
Qwen3.5-9B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 5060 Ti32,768319.4536.91633.094.73239.941102.91
Qwen3.5-9B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 5060 Ti130,064319.4562.11978.594.75232.041112.89
gemma-4-E2B-it-Q5_K_SQ5_K_SApple M4 Pro130,064318.9495.21079.386.59265.722911.07
Qwen3.5-9B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 5060 Ti130,064318.7564.41296.195.71274.251192.67
Qwen3.5-27B-IQ4_NLIQ4_NLNVIDIA GeForce RTX 509032,768318.41304.64160.490.78338.052911.09
Qwen3.5-35B-A3B-UD-Q6_K_SQ6_K_SNVIDIA GeForce RTX 50908,192318.1408.01036.047.26147.073240.98
Qwen3.5-9B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 5060 Ti16,384317.7435.6972.596.87236.441102.90
gemma-4-E2B-it-Q4_K_MQ4_K_MApple M4 Pro130,064314.9477.51093.587.83242.40339.57
gemma-4-E2B-it-UD-Q5_K_XLQ5_K_XLApple M4 Pro130,064314.8448.41031.186.70216.512910.89
Qwen3.5-9B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 5060 Ti32,768314.4547.31349.396.46251.581172.68
Qwen3.5-4B-BF16BF16NVIDIA GeForce RTX 5090130,064313.28847.99966.812.3762.68
gemma-4-E2B-it-Q3_K_SQ3_K_SApple M4 Pro130,064313.1479.41096.588.77243.283010.30
gemma-4-E2B-it-BF16BF16Apple M4 Pro130,064312.0424.2949.984.78213.44329.66
Qwen3.6-27B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 509032,768311.61403.63455.791.36429.633330.94
Qwen3.5-27B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 509032,768311.5875.62650.394.89440.042951.06
gemma-4-E2B-it-Q3_K_MQ3_K_MApple M4 Pro32,768311.2458.11020.887.78229.362910.73
Qwen3.5-9B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 5060 Ti130,064311.1485.5999.497.97272.591132.76
Qwen3.5-2B-UD-IQ2_XXSIQ2_XXSApple M4 Pro8,192309.0597.21535.5101.21121.512811.08
Qwen3.5-2B-UD-IQ2_MIQ2_MNVIDIA GB108,192308.7144.9284.812.4315.22625.01
Qwen3.5-27B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 509016,384307.41013.52720.994.91383.232941.05
Qwen3.5-9B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 5060 Ti130,064306.5479.71027.599.61278.491152.67
Qwen3.6-27B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 5090130,064306.4965.32518.796.51452.153230.95
Qwen3.5-35B-A3B-Q6_KQ6_KNVIDIA GeForce RTX 50908,192303.0310.1689.449.85172.743100.98
gemma-4-E2B-it-Q4_K_SQ4_K_SApple M4 Pro16,384302.6446.61089.589.34218.73329.46
gemma-4-E2B-it-Q5_K_MQ5_K_MApple M4 Pro130,064302.5497.4946.990.20338.922810.96
Qwen3.5-9B-Q6_KQ6_KNVIDIA GeForce RTX 5060 Ti32,768302.0544.91358.8100.60266.501222.48
gemma-4-E2B-it-UD-IQ2_MIQ2_MApple M4 Pro130,064301.7433.31023.287.74202.452910.26
Qwen3.5-9B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 5060 Ti32,768300.9479.71110.5100.29247.311182.56
Qwen3.5-27B-Q4_0Q4_0NVIDIA GeForce RTX 509032,768300.2970.92738.698.67490.112781.08
Qwen3.6-27B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 509016,384299.9934.02530.497.40434.123860.78
Qwen3.5-27B-Q4_1Q4_1NVIDIA GeForce RTX 509032,768297.31498.34344.997.56337.042781.07
Qwen3.6-27B-Q2_KQ2_KNVIDIA GeForce RTX 5090130,064297.0887.93341.894.69351.553260.91
Qwen3.5-0.8B-UD-Q8_K_XLQ8_K_XLNVIDIA GB10130,064296.93119.13564.213.0749.17
Qwen3.6-27B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 509016,384296.5848.32508.799.37365.193220.92
Qwen3.5-2B-UD-IQ2_MIQ2_MApple M4 Pro8,192295.7646.21479.2105.58135.142810.41
Qwen3.5-2B-UD-IQ3_XXSIQ3_XXSNVIDIA GB1065,536295.6141.5273.512.9615.82604.93
Qwen3.5-9B-Q8_0Q8_0NVIDIA GeForce RTX 5060 Ti32,768294.7525.01282.3103.56256.471142.59
Qwen3.5-2B-UD-Q2_K_XLQ2_K_XLNVIDIA GB10130,064294.6146.4289.713.0515.88604.95
gemma-4-E2B-it-UD-IQ3_XXSIQ3_XXSApple M4 Pro16,384294.3423.8975.988.55214.32309.84
Qwen3.5-2B-Q4_0Q4_0NVIDIA GB10130,064293.1144.4273.413.1316.00535.50
Qwen3.5-27B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 509032,768290.41096.83548.7101.05422.532851.02
Qwen3.5-27B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 509016,384288.51570.44749.099.98387.412881.00
Qwen3.6-27B-Q3_K_LQ3_K_LNVIDIA GeForce RTX 509032,768286.9879.42487.7101.32394.563260.88
Qwen3.5-2B-Q3_K_SQ3_K_SNVIDIA GB102,048283.63472.83868.913.5616.52634.50
Qwen3.5-2B-Q3_K_MQ3_K_MNVIDIA GB10130,064283.5143.3283.413.6016.59604.69
Qwen3.5-2B-IQ4_NLIQ4_NLNVIDIA GB102,048283.43319.13787.113.7549.96575.00
Qwen3.5-2B-Q3_K_SQ3_K_SApple M4 Pro32,768283.3750.92701.4108.55185.93289.97
Qwen3.5-2B-UD-IQ3_XXSIQ3_XXSApple M4 Pro32,768282.7735.23176.0108.56170.36309.52
Qwen3.5-2B-Q3_K_MQ3_K_MApple M4 Pro8,192282.3637.21456.2108.09181.40299.80
Qwen3.5-4B-BF16BF16NVIDIA GeForce RTX 5060 Ti32,768281.6728.42537.1103.87259.151012.80
Qwen3.5-2B-Q4_1Q4_1NVIDIA GB102,048278.93377.33809.613.8948.13555.04
Qwen3.5-2B-IQ4_NLIQ4_NLApple M4 Pro130,064277.7517.52378.3110.43218.43299.51
Qwen3.5-2B-Q4_0Q4_0Apple M4 Pro8,192277.4557.41308.9109.27184.02299.56
Qwen3.5-2B-Q8_0Q8_0Apple M4 Pro8,192276.3457.8833.3110.18203.17309.30
Qwen3.5-4B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 4060 Ti32,768275.9982.02899.3110.56173.71525.27
gemma-4-26B-A4B-it-UD-IQ3_SIQ3_SNVIDIA GB1032,768274.7835.13381.8106.45291.62485.71
Qwen3.5-2B-UD-Q3_K_XLQ3_K_XLNVIDIA GB1065,536274.3146.4278.414.0617.15584.74
Qwen3.5-2B-UD-Q3_K_XLQ3_K_XLApple M4 Pro8,192273.8766.91468.7110.22197.07299.34
Qwen3.5-2B-IQ4_XSIQ4_XSNVIDIA GB108,192273.73464.73917.614.1517.61555.00
Qwen3.5-27B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 509016,384273.41364.73654.7106.41510.923090.88
Qwen3.5-2B-Q4_1Q4_1Apple M4 Pro8,192273.3648.71511.4110.34190.13299.39
Qwen3.5-2B-Q4_K_SQ4_K_SNVIDIA GB108,192273.0114.7230.113.9520.31614.48
Qwen3.5-2B-UD-Q2_K_XLQ2_K_XLApple M4 Pro16,384272.5896.04434.5111.37184.212710.13
Qwen3.5-2B-UD-Q6_K_XLQ6_K_XLApple M4 Pro32,768271.4551.72447.2112.57214.84309.11
Qwen3.5-2B-UD-Q8_K_XLQ8_K_XLApple M4 Pro32,768270.7642.83033.8111.04185.95318.85
Qwen3.6-35B-A3B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 50908,192270.3434.21118.455.77206.191571.72
Qwen3.5-2B-IQ4_XSIQ4_XSApple M4 Pro8,192270.3551.01139.8111.81240.63299.29
Qwen3.5-4B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 4060 Ti130,064270.1588.41959.4114.04226.52554.94
Qwen3.5-0.8B-BF16BF16NVIDIA GB102,048269.93229.23895.114.1717.40
Qwen3.5-4B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 4060 Ti32,768269.8543.41774.6113.80189.64564.84
Qwen3.5-4B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 4060 Ti16,384269.1547.61427.7113.54190.35564.84
Qwen3.5-2B-Q6_KQ6_KApple M4 Pro130,064268.9665.73230.7112.55209.33309.11
Qwen3.5-4B-IQ4_NLIQ4_NLNVIDIA GeForce RTX 4060 Ti16,384268.9580.41860.2113.91193.46564.84
gemma-4-26B-A4B-it-UD-IQ3_XXSIQ3_XXSNVIDIA GB1032,768267.81263.95146.6104.10234.57525.18
Qwen3.5-4B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 4060 Ti16,384267.7456.01214.8114.53218.78564.81
Qwen3.5-4B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 4060 Ti16,384267.2750.93159.8115.08233.09535.05
Qwen3.5-2B-Q4_K_MQ4_K_MNVIDIA GB10130,064267.13498.24008.214.6152.39644.19
Qwen3.5-4B-Q4_0Q4_0NVIDIA GeForce RTX 4060 Ti16,384266.7666.91625.5114.93223.04574.70
Qwen3.5-4B-Q4_1Q4_1NVIDIA GeForce RTX 4060 Ti32,768266.4895.42686.1114.54214.29564.75
gemma-4-26B-A4B-it-UD-IQ4_NLIQ4_NLNVIDIA GB10130,064266.2773.81881.9112.24332.47485.52
Qwen3.5-4B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 4060 Ti16,384265.4535.81758.4115.47242.33564.76
Qwen3.5-2B-Q4_K_SQ4_K_SApple M4 Pro16,384264.9557.82682.7114.67226.06318.54
Qwen3.5-2B-UD-Q4_K_XLQ4_K_XLApple M4 Pro16,384264.8555.71892.7115.50293.48289.63
Qwen3.5-2B-BF16BF16Apple M4 Pro130,064264.5473.81671.6113.87213.15328.21
Qwen3.5-2B-Q4_K_MQ4_K_MApple M4 Pro8,192264.2548.91266.9114.10202.39299.08
Qwen3.5-2B-Q5_K_SQ5_K_SApple M4 Pro16,384264.1691.73181.6113.74244.53308.95
Qwen3.5-2B-Q5_K_MQ5_K_MApple M4 Pro130,064263.9654.43225.2114.42218.01299.01
Qwen3.5-2B-UD-Q5_K_XLQ5_K_XLApple M4 Pro8,192263.8500.81111.5114.89212.72308.94
gemma-4-26B-A4B-it-MXFP4_MOENVIDIA GB10130,064263.11038.53831.8109.29263.56416.45
Qwen3.5-4B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 4060 Ti32,768262.0623.01570.3116.97224.13534.95
gemma-4-26B-A4B-it-UD-IQ4_XSIQ4_XSNVIDIA GB1065,536261.3607.11392.0114.86344.41455.81
Qwen3.5-2B-Q5_K_SQ5_K_SNVIDIA GB108,192260.2120.2238.914.9017.83614.27
Qwen3.5-4B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 4060 Ti32,768257.6635.51664.6118.39249.83564.62
Qwen3.5-4B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 4060 Ti130,064257.3475.71402.1118.90238.28554.70
Qwen3.5-4B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 4060 Ti130,064257.3694.61842.0119.41254.34564.60
Qwen3.5-2B-UD-Q4_K_XLQ4_K_XLNVIDIA GB102,048257.23503.94228.515.0319.10624.17
gemma-4-E4B-it-UD-IQ2_MIQ2_MApple M4 Pro16,384257.11025.72801.7111.74293.50308.49
Qwen3.5-2B-UD-Q5_K_XLQ5_K_XLNVIDIA GB10130,064257.0146.8279.715.0418.01594.37
Qwen3.5-4B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 4060 Ti32,768256.6539.31663.4119.52224.13564.56
Qwen3.5-4B-Q6_KQ6_KNVIDIA GeForce RTX 4060 Ti16,384256.4638.92286.8119.07227.66564.59
Qwen3.5-4B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 4060 Ti16,384256.11030.43185.9118.56201.66584.39
Qwen3.5-9B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 5060 Ti8,192253.9399.6833.357.23124.741242.05
Qwen3.5-2B-Q5_K_MQ5_K_MNVIDIA GB10130,064251.23716.24188.415.3554.27634.02
Qwen3.5-9B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 4060 Ti16,384248.3696.32251.4124.04222.86584.25
Qwen3.5-4B-Q8_0Q8_0NVIDIA GeForce RTX 4060 Ti16,384247.7657.81904.9122.40197.11564.44
Qwen3.5-9B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 5060 Ti16,384244.8628.41410.3124.22308.621192.07
Qwen3.5-2B-Q6_KQ6_KNVIDIA GB10130,064244.4120.2248.615.6618.76564.37
Qwen3.5-27B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 50908,192243.61292.22906.358.49278.243130.78
Qwen3.5-27B-Q6_KQ6_KNVIDIA GeForce RTX 50908,192242.8945.41931.660.25294.063120.78
Qwen3.5-9B-Q3_K_MQ3_K_MNVIDIA GeForce RTX 4060 Ti32,768238.5512.61296.5129.35280.36603.95
Qwen3.5-4B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 4060 Ti16,384238.4666.21608.6128.03264.23574.21
Qwen3.5-9B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 4060 Ti32,768238.4828.33132.9128.59258.15613.93
Qwen3.6-35B-A3B-UD-IQ1_MIQ1_MNVIDIA GeForce RTX 5060 Ti16,384237.9744.12151.2126.94307.521102.16
Qwen3.5-9B-Q4_0Q4_0NVIDIA GeForce RTX 4060 Ti130,064236.7459.51404.2131.59238.66603.96
Qwen3.5-9B-IQ4_XSIQ4_XSNVIDIA GeForce RTX 4060 Ti32,768236.5494.21281.0130.12270.36603.95
Qwen3.5-9B-Q3_K_SQ3_K_SNVIDIA GeForce RTX 4060 Ti32,768236.2510.31294.6130.35274.24603.91
Qwen3.5-9B-Q4_K_SQ4_K_SNVIDIA GeForce RTX 4060 Ti16,384235.9490.61273.3130.20271.94603.91
Qwen3.5-9B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 4060 Ti32,768235.1836.12497.8130.47285.04613.83
Qwen3.5-9B-IQ4_NLIQ4_NLNVIDIA GeForce RTX 4060 Ti16,384234.7472.01263.0131.08259.81593.95
gemma-4-E4B-it-Q4_1Q4_1Apple M4 Pro16,384233.3689.01898.5121.80336.82337.07
Qwen3.5-9B-Q4_K_MQ4_K_MNVIDIA GeForce RTX 4060 Ti130,064232.8490.21443.4132.72271.54593.95
Qwen3.5-9B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 4060 Ti16,384232.2502.21280.6133.16261.76603.90
gemma-4-E4B-it-Q4_K_MQ4_K_MApple M4 Pro130,064232.2738.81901.0122.50331.87327.35
Qwen3.5-9B-Q4_1Q4_1NVIDIA GeForce RTX 4060 Ti32,768232.1512.11299.6132.36277.63593.95
Qwen3.5-27B.Q4_K_MQ4_K_MNVIDIA GeForce RTX 509016,384231.72417.46723.5128.32161.204240.55
Qwen3.5-35B-A3B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 50908,192230.3485.91667.866.01229.162410.95
Qwen3.5-2B-Q8_0Q8_0NVIDIA GB102,048229.6132.4282.716.7719.74504.63
Qwen3.5-27B.Q2_KQ2_KNVIDIA GeForce RTX 50908,192227.91304.83617.764.56101.704410.52
gemma-4-E4B-it-IQ4_NLIQ4_NLApple M4 Pro32,768227.0672.61717.0124.53309.26327.16
Qwen3.5-27B.Q4_K_SQ4_K_SNVIDIA GeForce RTX 509016,384226.92083.35505.4131.17270.214240.53
Qwen3.5-9B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 4060 Ti16,384226.3510.01276.1136.63249.74623.67
Qwen3.5-9B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 4060 Ti32,768225.8594.71884.3135.93256.25623.62
Qwen3.5-4B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 4060 Ti130,064224.8786.72338.9134.76265.09544.17
Qwen3.5-27B.Q3_K_SQ3_K_SNVIDIA GeForce RTX 509016,384224.82622.37334.5132.38136.724340.52
Qwen3.5-9B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 4060 Ti32,768224.2598.41606.7138.01271.03613.67
gemma-4-E4B-it-Q8_0Q8_0Apple M4 Pro16,384223.6749.71812.3123.73358.49346.58
Qwen3.5-2B-UD-Q6_K_XLQ6_K_XLNVIDIA GB108,192223.34184.64734.817.3958.86544.13
gemma-4-E4B-it-Q4_K_SQ4_K_SApple M4 Pro32,768223.2715.81865.6127.48321.90327.06
gemma-4-E4B-it-UD-IQ3_XXSIQ3_XXSApple M4 Pro130,064221.4892.32450.8128.67653.84317.14
Qwen3.5-27B.Q3_K_MQ3_K_MNVIDIA GeForce RTX 50908,192220.51419.43565.067.1172.124540.49
Qwen3.5-9B-Q5_K_SQ5_K_SNVIDIA GeForce RTX 4060 Ti32,768220.0556.31406.7139.42265.14623.54
gemma-4-E4B-it-UD-Q2_K_XLQ2_K_XLApple M4 Pro16,384219.9842.42062.2126.81345.92317.19
Qwen3.6-27B-Q6_KQ6_KNVIDIA GeForce RTX 50908,192219.6774.91800.166.63293.953640.60
Qwen3.5-27B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 509032,768217.21377.13273.2136.91383.773930.55
Qwen3.5-9B-Q5_K_MQ5_K_MNVIDIA GeForce RTX 4060 Ti16,384216.9483.31312.7142.26265.73623.52
Qwen3.5-27B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 509032,768216.6937.62619.0139.10382.943760.58
Qwen3.5-9B-Q6_KQ6_KNVIDIA GeForce RTX 4060 Ti32,768216.3629.61582.9142.40285.49633.46
gemma-4-E4B-it-IQ4_XSIQ4_XSApple M4 Pro130,064215.1708.11776.1129.54338.06326.66
Qwen3.6-35B-A3B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 5060 Ti16,384214.91509.75806.7137.40216.851091.97
gemma-4-E4B-it-UD-Q8_K_XLQ8_K_XLApple M4 Pro16,384213.4738.01869.1128.57364.94346.29
Qwen3.5-27B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 509016,384212.71096.83034.3139.95353.933810.56
Qwen3.5-27B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 509032,768212.4923.02632.6140.61423.043810.56
Qwen3.5-27B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 5090130,064211.41087.53201.3141.71381.303700.57
gemma-4-E4B-it-UD-Q4_K_XLQ4_K_XLApple M4 Pro130,064209.3769.01944.1134.90339.98326.60
gemma-4-E4B-it-Q6_KQ6_KApple M4 Pro32,768206.4769.51995.5132.24398.25336.27
Qwen3.5-4B-BF16BF16NVIDIA GeForce RTX 4060 Ti16,384204.8925.22692.3146.69281.40573.58
gemma-4-26B-A4B-it-Q8_0Q8_0NVIDIA GB1065,536204.5908.43043.6145.96481.94405.18
gemma-4-E4B-it-Q5_K_SQ5_K_SApple M4 Pro130,064204.3801.91984.1135.49413.95336.27
gemma-4-E4B-it-UD-Q3_K_XLQ3_K_XLApple M4 Pro32,768203.4874.62225.1134.23374.71326.38
Qwen3.5-9B-Q8_0Q8_0NVIDIA GeForce RTX 4060 Ti16,384203.2684.51609.3149.41298.37593.43
gemma-4-E4B-it-UD-Q6_K_XLQ6_K_XLApple M4 Pro130,064201.9801.61940.5137.42398.91336.19
gemma-4-E4B-it-Q3_K_MQ3_K_MApple M4 Pro32,768200.0833.21942.2136.27382.00326.23
gemma-4-E4B-it-BF16BF16Apple M4 Pro32,768199.0814.51941.9135.18390.12365.47
Qwen3.5-9B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 4060 Ti16,384196.8682.81658.9157.66314.24643.06
gemma-4-E4B-it-Q5_K_MQ5_K_MApple M4 Pro16,384196.0773.52053.9141.14421.28335.97
gemma-4-E4B-it-UD-Q5_K_XLQ5_K_XLApple M4 Pro32,768189.0868.22202.7145.51427.08325.89
Qwen3.6-35B-A3B-UD-IQ1_MIQ1_MNVIDIA GeForce RTX 4060 Ti130,064188.1637.31480.7163.67338.87613.07
Qwen3.6-35B-A3B-UD-IQ1_MIQ1_MNVIDIA GB1032,768186.9665.81807.7163.55373.03503.78
gemma-4-E4B-it-Q3_K_SQ3_K_SApple M4 Pro130,064186.7921.82299.5142.25382.12325.85
Qwen3.5-2B-UD-Q8_K_XLQ8_K_XLNVIDIA GB1032,768186.4132.3249.820.4423.77444.23
Qwen3.6-35B-A3B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 4060 Ti32,768186.1628.21397.1165.86341.65632.98
Qwen3.6-35B-A3B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 4060 Ti16,384185.9657.61452.5164.37365.50622.99
Qwen3.6-35B-A3B-UD-Q4_K_SQ4_K_SNVIDIA GeForce RTX 4060 Ti16,384185.0806.01913.3163.88330.62603.07
Qwen3.6-35B-A3B-UD-IQ2_MIQ2_MNVIDIA GB1065,536184.2811.11686.3163.98378.97533.46
Qwen3.6-35B-A3B-UD-Q4_K_MQ4_K_MNVIDIA GeForce RTX 4060 Ti16,384183.2911.92600.1166.34369.06613.00
Qwen3.6-35B-A3B-UD-Q4_K_XLQ4_K_XLNVIDIA GeForce RTX 4060 Ti32,768182.7685.21927.1168.90379.66603.05
Qwen3.6-35B-A3B-UD-IQ2_XXSIQ2_XXSNVIDIA GB1065,536182.7770.11543.3166.13416.75543.40
Qwen3.6-35B-A3B-UD-Q3_K_MQ3_K_MNVIDIA GeForce RTX 4060 Ti32,768182.1746.41929.5168.48343.16613.00
Qwen3.6-35B-A3B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 4060 Ti32,768181.5670.61446.2168.81355.93632.89
Qwen3.6-35B-A3B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 4060 Ti130,064181.4773.52103.7170.05356.00632.87
Qwen3.6-35B-A3B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 4060 Ti130,064180.5722.21653.2168.59378.16612.97
Qwen3.6-35B-A3B-Q8_0Q8_0NVIDIA GeForce RTX 50908,192180.5672.11941.082.78340.001451.25
Qwen3.6-35B-A3B-UD-IQ4_NL_XLIQ4_NL_XLNVIDIA GeForce RTX 4060 Ti32,768180.3766.92134.3170.34345.29612.97
Qwen3.6-35B-A3B-MXFP4_MOENVIDIA GeForce RTX 4060 Ti130,064179.71048.63215.1168.61379.27612.96
Qwen3.5-9B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 4060 Ti16,384179.7612.01274.9172.03337.52603.01
Qwen3.6-35B-A3B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 5060 Ti8,192179.5746.414514.282.66195.51951.89
Qwen3.6-35B-A3B-UD-Q3_K_SQ3_K_SNVIDIA GeForce RTX 4060 Ti16,384179.3667.91519.2170.77344.76622.91
Qwen3.6-35B-A3B-UD-IQ3_SIQ3_SNVIDIA GeForce RTX 4060 Ti130,064178.5650.91628.9171.01385.86652.76
Qwen3.6-35B-A3B-UD-Q5_K_SQ5_K_SNVIDIA GeForce RTX 4060 Ti32,768178.2770.12244.6171.65382.30612.95
Qwen3.6-35B-A3B-UD-IQ4_NLIQ4_NLNVIDIA GeForce RTX 4060 Ti32,768178.0858.32623.8171.82374.43612.91
Qwen3.6-35B-A3B-UD-Q2_K_XLQ2_K_XLNVIDIA GB1032,768177.71221.74127.7168.56325.02543.30
Qwen3.6-35B-A3B-UD-IQ4_XSIQ4_XSNVIDIA GeForce RTX 4060 Ti16,384177.5776.62095.9171.62334.71612.89
Qwen3.6-35B-A3B-UD-IQ3_XXSIQ3_XXSNVIDIA GB10130,064174.61002.33597.4173.23406.04523.38
Qwen3.5-9B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 5060 Ti8,192174.5416.4763.387.56218.651121.55
Qwen3.6-35B-A3B-UD-Q3_K_SQ3_K_SNVIDIA GB1032,768174.0988.63132.7173.78393.40493.52
Qwen3.6-35B-A3B-APEX-CompactNVIDIA GB10130,064173.61167.93658.8173.68410.94523.31
Qwen3.6-35B-A3B-APEX-I-MiniNVIDIA GB1065,536172.8883.42585.8174.22403.83513.38
Qwen3.6-35B-A3B-MXFP4_MOENVIDIA GB1032,768172.71148.33648.6173.74389.71463.79
Qwen3.6-35B-A3B-APEX-I-CompactNVIDIA GB1032,768172.31025.83289.6174.29407.67513.40
Qwen3.6-35B-A3B-UD-Q3_K_MQ3_K_MNVIDIA GB1032,768172.1827.72071.1176.50402.27513.39
Qwen3.6-35B-A3B-UD-IQ3_SIQ3_SNVIDIA GB1032,768170.8811.61697.8177.53411.92553.13
Qwen3.6-35B-A3B-UD-Q4_K_SQ4_K_SNVIDIA GB1032,768169.9833.31984.4178.12405.99473.64
Qwen3.6-35B-A3B-UD-Q3_K_XLQ3_K_XLNVIDIA GB1032,768169.5989.23113.1177.47401.13493.46
Qwen3.6-35B-A3B-UD-IQ4_NL_XLIQ4_NL_XLNVIDIA GB1065,536168.7896.51880.5177.84405.94493.42
Qwen3.6-35B-A3B-UD-IQ4_XSIQ4_XSNVIDIA GB1032,768168.51085.03928.4178.01393.31493.44
Qwen3.6-35B-A3B-UD-Q4_K_XLQ4_K_XLNVIDIA GB1065,536168.2883.51784.4180.94433.75483.52
Qwen3.6-35B-A3B-UD-Q4_K_MQ4_K_MNVIDIA GB1032,768166.7860.11794.3182.26415.16493.44
Qwen3.5-9B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 5060 Ti130,064165.9459.71356.9138.09297.091481.12
Qwen3.6-35B-A3B-APEX-I-QualityQualityNVIDIA GB10130,064165.61066.93407.9183.30436.74513.23
Qwen3.6-35B-A3B-UD-IQ4_NLIQ4_NLNVIDIA GB1065,536164.2842.72067.8182.37419.76503.32
Qwen3.6-35B-A3B-APEX-QualityQualityNVIDIA GB1065,536164.0909.72343.2184.77430.85513.20
Qwen3.5-35B-A3B-Q8_0Q8_0NVIDIA GeForce RTX 509032,768162.5969.53696.293.20183.071770.92
Qwen3.6-35B-A3B-APEX-BalancedBalancedNVIDIA GB1065,536160.81045.72827.2187.71437.93513.13
Qwen3.5-2B-BF16BF16NVIDIA GB1065,536160.3153.7287.124.1827.15413.94
Qwen3.6-35B-A3B-UD-Q5_K_SQ5_K_SNVIDIA GB10130,064158.4942.92562.1190.03433.30463.44
Qwen3.6-35B-A3B-APEX-I-BalancedBalancedNVIDIA GB1065,536157.31179.84036.4189.87437.54513.10
Qwen3.6-35B-A3B-UD-Q5_K_XLQ5_K_XLNVIDIA GB1065,536156.3884.12286.9193.84442.09473.32
Qwen3.6-35B-A3B-UD-Q5_K_MQ5_K_MNVIDIA GB1065,536155.01109.23952.2192.39444.24473.32
Qwen3.5-9B-BF16BF16NVIDIA GeForce RTX 4060 Ti16,384155.0709.91322.1197.24399.90602.58
Qwen3.6-35B-A3B-UD-Q5_K_XLQ5_K_XLNVIDIA GeForce RTX 4060 Ti8,192152.4507.01075.2100.09183.41602.53
Qwen3.6-35B-A3B-UD-Q5_K_MQ5_K_MNVIDIA GeForce RTX 4060 Ti8,192151.8526.01185.0100.02209.87612.51
Qwen3.5-0.8B-Q8_0Q8_0Apple M4 Pro8,192151.542495.849174.526.2047.55
Qwen3.5-4B-UD-IQ2_XXSIQ2_XXSNVIDIA GB1032,768150.1266.0568.025.7130.09692.18
Qwen3.5-0.8B-UD-Q8_K_XLQ8_K_XLApple M4 Pro8,192149.842573.849894.026.4748.96
Qwen3.5-27B-UD-IQ2_XXSIQ2_XXSNVIDIA GeForce RTX 509032,768149.0529.0853.424.8830.573700.40
Qwen3.6-35B-A3B-UD-Q6_KQ6_KNVIDIA GB1065,536148.51188.03671.6201.09469.03483.07
gemma-4-26B-A4B-it-UD-IQ2_XXSIQ2_XXSApple M4 Pro32,768148.22422.715602.3203.08422.19314.86
Qwen3.5-0.8B-UD-IQ2_XXSIQ2_XXSApple M4 Pro32,768146.9118.2411.226.8629.21
Qwen3.5-0.8B-UD-IQ3_XXSIQ3_XXSApple M4 Pro32,768146.8118.0405.326.8828.99
Qwen3.5-0.8B-IQ4_NLIQ4_NLApple M4 Pro16,384146.0119.6409.126.8829.07
Qwen3.6-35B-A3B-UD-Q6_K_XLQ6_K_XLNVIDIA GB1065,536146.01184.83345.7204.44473.44473.12
gemma-4-26B-A4B-it-UD-IQ2_MIQ2_MApple M4 Pro130,064146.02321.613433.6203.12386.16324.62
Qwen3.5-0.8B-Q4_1Q4_1Apple M4 Pro8,192144.8117.2407.127.2629.32
Qwen3.5-0.8B-UD-IQ2_MIQ2_MApple M4 Pro8,192144.06872.17849.127.4229.74265.52
Qwen3.5-0.8B-Q4_0Q4_0Apple M4 Pro32,768144.019735.022433.127.5047.19
Qwen3.5-9B-UD-IQ2_XXSIQ2_XXSNVIDIA GB10130,064143.4607.81495.5215.76386.47542.67
Qwen3.5-0.8B-UD-Q2_K_XLQ2_K_XLApple M4 Pro130,064143.0119.9415.427.6029.68
Qwen3.5-4B-UD-IQ2_MIQ2_MNVIDIA GB1065,536142.3263.0600.626.7831.44702.04
Qwen3.6-35B-A3B-Q8_0Q8_0NVIDIA GB1065,536141.71125.33109.1210.87498.27443.20
Qwen3.5-0.8B-IQ4_XSIQ4_XSApple M4 Pro130,064141.4106.7346.627.7129.83
Qwen3.5-0.8B-UD-Q6_K_XLQ6_K_XLApple M4 Pro16,384141.4112.7366.627.2729.65
gemma-4-26B-A4B-it-UD-Q3_K_MQ3_K_MApple M4 Pro32,768141.41747.89828.5212.13505.26324.38
Qwen3.5-9B-UD-IQ2_MIQ2_MNVIDIA GB102,048141.3837.41476.6216.46321.79562.53
Qwen3.5-0.8B-Q4_K_SQ4_K_SApple M4 Pro8,192141.2115.9342.727.4229.55
Qwen3.5-9B-IQ4_XSIQ4_XSNVIDIA GB102,048140.9625.91447.2220.36391.25542.60
Qwen3.5-9B-Q4_0Q4_0NVIDIA GB108,192140.4645.11469.9219.86421.54542.61
Qwen3.5-0.8B-UD-Q3_K_XLQ3_K_XLApple M4 Pro16,384140.419989.522869.528.1630.39
Qwen3.5-9B-Q3_K_SQ3_K_SNVIDIA GB102,048139.8722.41910.5220.24393.59562.49
Qwen3.5-9B-Q3_K_MQ3_K_MNVIDIA GB102,048139.8610.61472.5221.07445.21552.55
Qwen3.5-0.8B-UD-Q4_K_XLQ4_K_XLApple M4 Pro8,192139.894.2241.527.8630.38
Qwen3.5-9B-Q4_1Q4_1NVIDIA GB102,048139.4609.51296.0221.53406.28522.68
Qwen3.5-9B-IQ4_NLIQ4_NLNVIDIA GB108,192139.3647.61457.3220.74411.77552.55
Qwen3.5-0.8B-Q3_K_MQ3_K_MApple M4 Pro8,192139.36756.88028.928.1048.73
Qwen3.5-27B-UD-IQ2_MIQ2_MNVIDIA GeForce RTX 509016,384139.346973.552639.827.85164.104080.34
Qwen3.6-35B-A3B-UD-Q8_K_XLQ8_K_XLNVIDIA GB1065,536138.91169.84154.2214.65498.22433.25
gemma-4-26B-A4B-it-UD-Q4_K_SQ4_K_SApple M4 Pro16,384138.81940.510591.8216.85500.12324.38
gemma-4-26B-A4B-it-UD-Q3_K_XLQ3_K_XLApple M4 Pro130,064138.71755.99140.4216.48637.48334.27
Qwen3.5-0.8B-Q4_K_MQ4_K_MApple M4 Pro130,064138.546912.253959.128.6047.80
Qwen3.5-9B-UD-Q2_K_XLQ2_K_XLNVIDIA GB1032,768138.3903.33314.7223.26388.25552.53
gemma-4-26B-A4B-it-UD-Q4_K_MQ4_K_MApple M4 Pro130,064138.31120.94029.3219.70740.99344.08
Qwen3.5-9B-Q4_K_SQ4_K_SNVIDIA GB102,048138.2776.92060.6221.14455.83552.51
Qwen3.5-4B-UD-IQ3_XXSIQ3_XXSNVIDIA GB108,192137.26850.77897.428.3333.42721.91
gemma-4-26B-A4B-it-UD-IQ3_XXSIQ3_XXSApple M4 Pro130,064136.81901.310078.3215.73529.76324.30
Qwen3.5-0.8B-Q5_K_SQ5_K_SApple M4 Pro16,384136.447723.954522.529.1150.06
Qwen3.5-9B-Q4_K_MQ4_K_MNVIDIA GB102,048136.2672.01488.8225.51430.90542.53
Qwen3.5-0.8B-Q3_K_SQ3_K_SApple M4 Pro130,064136.0122.9424.229.0431.15
Qwen3.5-0.8B-Q5_K_MQ5_K_MApple M4 Pro8,192135.298.8245.228.8931.18
gemma-4-26B-A4B-it-UD-IQ4_NLIQ4_NLApple M4 Pro130,064135.01070.03874.7221.06720.57324.23
gemma-4-26B-A4B-it-UD-Q4_K_XLQ4_K_XLApple M4 Pro16,384134.91450.87524.6223.81675.05314.31
Qwen3.5-4B-UD-Q2_K_XLQ2_K_XLNVIDIA GB10130,064134.87055.98040.828.6933.93701.91
Qwen3.5-9B-UD-IQ3_XXSIQ3_XXSNVIDIA GB1065,536134.5712.62137.5230.06420.25562.42
Qwen3.5-9B-UD-Q3_K_XLQ3_K_XLNVIDIA GB10130,064134.5681.92237.6228.86423.40552.47
gemma-4-26B-A4B-it-MXFP4_MOEApple M4 Pro130,064134.31795.710368.5215.53513.84334.02
gemma-4-26B-A4B-it-UD-Q2_K_XLQ2_K_XLApple M4 Pro16,384134.31235.44891.6224.53723.87314.33
Qwen3.5-0.8B-Q6_KQ6_KApple M4 Pro8,192134.248291.255414.229.5849.11
Qwen3.5-0.8B-UD-Q5_K_XLQ5_K_XLApple M4 Pro130,064134.07011.08085.429.3231.40
Qwen3.5-4B-Q4_0Q4_0NVIDIA GB10130,064133.8194.3336.129.0841.59592.26
Qwen3.5-9B-UD-Q4_K_XLQ4_K_XLNVIDIA GB10130,064133.8641.61766.6229.11429.58532.53
gemma-4-26B-A4B-it-UD-IQ4_XSIQ4_XSApple M4 Pro16,384132.9958.92118.6223.10745.25324.14
Qwen3.5-4B-IQ4_NLIQ4_NLNVIDIA GB108,192132.97132.18047.029.4295.31642.08
gemma-4-26B-A4B-it-UD-IQ3_SIQ3_SApple M4 Pro130,064132.61878.09793.5219.79585.76324.14
Qwen3.5-4B-Q4_1Q4_1NVIDIA GB108,192132.17348.08227.929.4533.88622.12
gemma-4-26B-A4B-it-Q8_0Q8_0Apple M4 Pro32,768131.91157.92880.3230.64782.27343.88
Qwen3.5-9B-Q5_K_MQ5_K_MNVIDIA GB102,048130.9681.51554.3233.66431.81542.41
Qwen3.5-4B-IQ4_XSIQ4_XSNVIDIA GB1065,536130.47197.38260.129.9588.78622.09
Qwen3.5-9B-UD-Q5_K_XLQ5_K_XLNVIDIA GB1065,536130.4713.51948.8235.10496.52542.41
Qwen3.5-9B-Q5_K_SQ5_K_SNVIDIA GB1065,536130.1717.51894.9237.05481.74542.40
Qwen3.5-4B-Q3_K_SQ3_K_SNVIDIA GB108,192129.97232.28279.029.9535.74701.86
gemma-4-26B-A4B-it-UD-Q6_K_XLQ6_K_XLApple M4 Pro16,384129.81216.44131.1236.66794.69333.99
Qwen3.5-4B-Q3_K_MQ3_K_MNVIDIA GB1065,536129.0219.0504.429.9134.52671.92
Qwen3.5-9B-Q6_KQ6_KNVIDIA GB102,048128.5685.71326.4237.46482.46542.40
Qwen3.5-4B-UD-Q3_K_XLQ3_K_XLNVIDIA GB102,048127.8258.5584.730.2134.25681.87
gemma-4-26B-A4B-it-UD-Q6_KQ6_KApple M4 Pro130,064127.71333.04945.6239.58807.02323.98
gemma-4-26B-A4B-it-UD-Q5_K_XLQ5_K_XLApple M4 Pro32,768126.21383.34340.6241.24825.33353.64
Qwen3.5-4B-Q4_K_SQ4_K_SNVIDIA GB1032,768125.87239.38548.331.0273.27721.74
gemma-4-26B-A4B-it-UD-Q5_K_SQ5_K_SApple M4 Pro32,768125.31136.63603.2244.55815.21323.89
Qwen3.5-9B-UD-Q3_K_XLQ3_K_XLNVIDIA GeForce RTX 5060 Ti16,384125.2448.81322.5141.75313.991500.84
Qwen3.5-0.8B-BF16BF16Apple M4 Pro8,192124.052224.859107.632.0253.91
Qwen3.5-9B-Q8_0Q8_0NVIDIA GB102,048123.9845.72047.5244.01482.00522.36
Qwen3.5-4B-Q4_K_MQ4_K_MNVIDIA GB1032,768122.522965.225664.732.1537.88721.71
gemma-4-26B-A4B-it-UD-Q5_K_MQ5_K_MApple M4 Pro16,384121.11487.76119.2247.70833.04323.77
Qwen3.5-4B-Q5_K_SQ5_K_SNVIDIA GB102,048120.17770.28913.332.6299.59721.67
Qwen3.5-4B-UD-Q4_K_XLQ4_K_XLNVIDIA GB108,192119.97842.48875.832.27100.00691.74
gemma-4-26B-A4B-it-UD-Q8_K_XLQ8_K_XLApple M4 Pro32,768119.61219.92314.6256.23851.36304.03
Qwen3.5-4B-Q5_K_MQ5_K_MNVIDIA GB102,048118.78175.79068.832.4736.62711.66
Qwen3.5-4B-UD-Q5_K_XLQ5_K_XLNVIDIA GB10130,064116.08251.09253.333.60104.30681.70
Qwen3.6-35B-A3B-UD-Q6_KQ6_KNVIDIA GeForce RTX 4060 Ti8,192113.7768.92171.1132.59289.80542.11
Qwen3.5-4B-Q6_KQ6_KNVIDIA GB108,192111.98514.49515.334.7939.73661.70
Qwen3.5-27B-Q8_0Q8_0NVIDIA GeForce RTX 50908,192106.11066.02154.8142.24462.011680.63
Qwen3.5-4B-Q8_0Q8_0NVIDIA GB102,048105.0261.9631.836.9841.31551.90
Qwen3.5-4B-UD-Q6_K_XLQ6_K_XLNVIDIA GB1032,768103.3228.6385.837.6957.61601.72
Qwen3.5-9B-UD-IQ2_XXSIQ2_XXSApple M4 Pro8,192102.72276.46056.8290.91592.74333.09
Qwen3.5-9B-Q8_0Q8_0Apple M4 Pro8,192102.01488.63216.9295.60785.92362.86
Qwen3.5-9B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 5060 Ti130,064102.0499.51391.4166.73406.591510.68
Qwen3.5-9B-Q3_K_MQ3_K_MApple M4 Pro8,192100.01768.03606.8303.67959.69332.99
Qwen3.5-35B-A3B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 50908,19299.31493.87358.5152.26266.401400.71
Qwen3.6-35B-A3B-APEX-I-CompactNVIDIA GeForce RTX 509032,76898.7200.1467.79.3512.711590.62
Qwen3.6-35B-A3B-APEX-CompactNVIDIA GeForce RTX 509032,76898.6200.6467.69.3912.561590.62
Qwen3.5-9B-IQ4_NLIQ4_NLApple M4 Pro8,19298.61501.23516.8311.15727.10342.87
Qwen3.5-9B-UD-IQ3_XXSIQ3_XXSApple M4 Pro32,76898.31915.66182.4310.771069.33333.00
Qwen3.5-9B-Q4_1Q4_1Apple M4 Pro16,38497.91533.34295.7315.69921.00342.85
Qwen3.5-9B-Q4_0Q4_0Apple M4 Pro8,19297.01444.83339.3316.65723.35352.80
Qwen3.5-9B-Q4_K_SQ4_K_SApple M4 Pro8,19297.01748.33977.9313.36776.21352.80
Qwen3.5-9B-UD-Q4_K_XLQ4_K_XLApple M4 Pro8,19296.61868.74002.7312.44907.41342.88
Qwen3.6-35B-A3B-APEX-I-MiniNVIDIA GeForce RTX 509032,76896.5197.7406.39.6312.451540.63
Qwen3.5-9B-UD-Q8_K_XLQ8_K_XLApple M4 Pro32,76896.31507.03529.3314.65994.82362.66
Qwen3.5-9B-Q6_KQ6_KApple M4 Pro8,19296.21657.13401.7317.58853.15352.72
Qwen3.5-9B-UD-Q6_K_XLQ6_K_XLApple M4 Pro8,19295.91533.03547.3316.34811.18352.71
Qwen3.5-9B-IQ4_XSIQ4_XSApple M4 Pro8,19295.81575.13298.7319.50860.72342.81
Qwen3.5-9B-UD-IQ2_MIQ2_MApple M4 Pro130,06495.61980.38685.5317.911027.83332.86
Qwen3.5-9B-UD-Q3_K_XLQ3_K_XLApple M4 Pro8,19295.41729.63604.4316.21838.64342.84
Qwen3.5-9B-BF16BF16Apple M4 Pro130,06495.21757.26149.1317.06993.41372.55
Qwen3.5-9B-Q3_K_SQ3_K_SApple M4 Pro8,19295.01827.94503.2316.06938.13332.85
Qwen3.5-9B-UD-Q6_K_XLQ6_K_XLNVIDIA GB1032,76894.8905.02354.9326.09569.35472.00
Qwen3.5-9B-UD-Q5_K_XLQ5_K_XLApple M4 Pro16,38494.81792.95849.1321.741133.23352.71
Qwen3.5-9B-UD-Q2_K_XLQ2_K_XLApple M4 Pro8,19294.51986.35022.6319.19784.08332.83
Qwen3.5-9B-Q5_K_MQ5_K_MApple M4 Pro8,19293.31767.43987.6327.35850.66342.75
Qwen3.5-9B-Q5_K_SQ5_K_SApple M4 Pro8,19293.21803.43868.8328.35991.41332.82
Qwen3.5-9B-Q4_K_MQ4_K_MApple M4 Pro32,76892.81718.66038.4331.241127.39342.77
Qwen3.6-35B-A3B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 4060 Ti8,19292.5810.92237.2163.72528.50481.95
Qwen3.5-9B-UD-Q8_K_XLQ8_K_XLNVIDIA GB1032,76890.0986.32314.6344.00719.49471.90
Qwen3.5-4B-UD-Q8_K_XLQ8_K_XLNVIDIA GB10130,06488.010487.012018.844.39121.85511.73
Qwen3.6-35B-A3B-UD-IQ4_NLIQ4_NLApple M4 Pro8,19286.6895.93428.6175.22278.37362.43
Qwen3.6-35B-A3B-UD-Q3_K_MQ3_K_MApple M4 Pro8,19286.3776.71931.7177.34361.02352.48
Qwen3.6-35B-A3B-UD-IQ4_NL_XLIQ4_NL_XLApple M4 Pro8,19285.5725.01753.7179.65385.76362.41
Qwen3.6-35B-A3B-UD-Q3_K_XLQ3_K_XLApple M4 Pro8,19285.4978.43153.6177.65385.27352.46
Qwen3.6-35B-A3B-UD-IQ4_XSIQ4_XSApple M4 Pro8,19285.3782.82088.6177.23300.55362.40
Qwen3.6-35B-A3B-UD-Q4_K_SQ4_K_SApple M4 Pro8,19284.9830.52156.0177.51292.29352.43
Qwen3.6-27B-IQ4_XSIQ4_XSNVIDIA GB1032,76884.62182.65730.7351.24871.88561.51
Qwen3.6-35B-A3B-UD-Q3_K_SQ3_K_SApple M4 Pro8,19284.3702.32068.2183.34416.05352.39
Qwen3.6-35B-A3B-APEX-QualityQualityNVIDIA GeForce RTX 509032,76884.2232.1581.011.0114.581520.55
Qwen3.6-35B-A3B-APEX-I-QualityQualityNVIDIA GeForce RTX 50908,19284.2231.5575.411.0214.671520.56
Qwen3.6-35B-A3B-UD-IQ2_MIQ2_MApple M4 Pro8,19284.01008.44504.0182.36363.62342.50
Qwen3.6-35B-A3B-UD-IQ2_XXSIQ2_XXSApple M4 Pro8,19283.2770.52187.0185.21473.13332.51
Qwen3.6-35B-A3B-UD-IQ3_XXSIQ3_XXSApple M4 Pro8,19283.01079.73460.1182.93416.71342.46
Qwen3.5-9B-BF16BF16NVIDIA GB1065,53682.7959.61997.9369.07760.01461.79
Qwen3.6-35B-A3B-UD-IQ1_MIQ1_MApple M4 Pro8,19282.3890.13093.4186.29455.39342.44
Qwen3.6-35B-A3B-UD-IQ3_SIQ3_SApple M4 Pro8,19282.1853.62260.5188.43467.04352.37
Qwen3.6-35B-A3B-UD-Q2_K_XLQ2_K_XLApple M4 Pro32,76881.61154.82142.9380.33881.87342.42
Qwen3.6-27B-Q3_K_SQ3_K_SNVIDIA GB1065,53681.42445.28520.1362.65888.33611.33
Qwen3.6-27B-Q2_KQ2_KNVIDIA GB1065,53681.12006.54659.3353.68893.19621.31
Qwen3.6-27B-Q3_K_LQ3_K_LNVIDIA GB1065,53680.42699.510574.5368.90952.70611.32
Qwen3.6-27B-Q3_K_MQ3_K_MNVIDIA GB1065,53680.32890.210165.3366.871092.69611.32
Qwen3.6-35B-A3B-UD-Q6_KQ6_KApple M4 Pro8,19279.91038.03067.0190.64383.86362.21
Qwen3.6-35B-A3B-MXFP4_MOEApple M4 Pro130,06479.61425.85483.3385.23882.65352.26
Qwen3.6-27B-Q4_K_SQ4_K_SNVIDIA GB10130,06479.62343.97674.4363.45872.45541.46
Qwen3.6-35B-A3B-UD-Q4_K_XLQ4_K_XLApple M4 Pro130,06479.51555.15484.8385.70891.42352.30
Qwen3.6-35B-A3B-UD-Q4_K_MQ4_K_MApple M4 Pro8,19279.41045.73042.3191.23339.58352.28
Qwen3.6-35B-A3B-Q8_0Q8_0Apple M4 Pro16,38478.61652.24598.5389.37916.73352.28
Qwen3.6-35B-A3B-UD-Q6_K_XLQ6_K_XLApple M4 Pro16,38478.51624.14927.4390.48926.09352.22
Qwen3.6-35B-A3B-APEX-BalancedBalancedNVIDIA GeForce RTX 509032,76878.5250.6626.811.8115.211500.52
Qwen3.6-27B-Q4_K_MQ4_K_MNVIDIA GB1032,76878.42138.14717.7375.03973.18561.41
Qwen3.6-35B-A3B-APEX-I-BalancedBalancedNVIDIA GeForce RTX 509032,76878.1247.3630.111.8715.281500.52
Qwen3.6-35B-A3B-UD-Q5_K_MQ5_K_MApple M4 Pro8,19277.5831.11774.2199.37437.98362.14
Qwen3.6-35B-A3B-UD-Q5_K_XLQ5_K_XLApple M4 Pro32,76876.91650.35907.9397.05917.48352.19
Qwen3.6-35B-A3B-UD-Q5_K_SQ5_K_SApple M4 Pro16,38475.62124.59159.2405.05952.74342.22
Qwen3.6-35B-A3B-UD-Q8_K_XLQ8_K_XLApple M4 Pro130,06473.52541.310081.5410.74993.91
Qwen3.5-4B-BF16BF16NVIDIA GB10130,06472.5304.9557.553.6357.62441.65
Qwen3.6-27B-Q8_0Q8_0NVIDIA GeForce RTX 50908,19272.21879.83967.1198.94544.611390.52
Qwen3.6-27B-Q5_K_MQ5_K_MNVIDIA GB1032,76871.22497.38374.4410.53978.79541.32
Qwen3.5-4B-Q8_0Q8_0Apple M4 Pro8,19270.891662.9102865.155.7297.24381.88
Qwen3.5-4B-UD-Q8_K_XLQ8_K_XLApple M4 Pro8,19268.240647.746032.257.67110.83381.80
Qwen3.5-4B-UD-IQ2_XXSIQ2_XXSApple M4 Pro130,06467.2415.01735.658.1359.74341.96
Qwen3.5-4B-Q4_1Q4_1Apple M4 Pro130,06466.542444.647515.959.3194.11351.93
Qwen3.5-4B-Q4_0Q4_0Apple M4 Pro8,19266.342796.347621.759.4792.92351.92
Qwen3.6-35B-A3B-Q8_0Q8_0NVIDIA GeForce RTX 4060 Ti8,19264.71345.04905.5232.17587.06421.56
Qwen3.5-9B-BF16BF16NVIDIA GeForce RTX 5060 Ti8,19264.71319.13324.4237.37752.84501.30
Qwen3.5-4B-UD-IQ2_MIQ2_MApple M4 Pro16,38463.9101118.4114129.961.98102.93361.79
Qwen3.5-4B-UD-IQ3_XXSIQ3_XXSApple M4 Pro8,19263.2102853.0115371.062.62102.17361.77
Qwen3.5-4B-Q4_K_SQ4_K_SApple M4 Pro32,76862.1269.9982.930.8638.52371.67
Qwen3.5-4B-IQ4_XSIQ4_XSApple M4 Pro130,06461.2298.8691.663.5399.88361.71
Qwen3.5-4B-UD-Q2_K_XLQ2_K_XLApple M4 Pro130,06460.8426.81782.364.3665.87331.84
Qwen3.6-35B-A3B-UD-Q2_K_XLQ2_K_XLNVIDIA GeForce RTX 5060 Ti32,76860.6247.0642.115.5718.04690.88
Qwen3.5-4B-UD-Q3_K_XLQ3_K_XLApple M4 Pro130,06460.3105937.7120731.865.56104.19351.75
Qwen3.5-4B-Q3_K_MQ3_K_MApple M4 Pro32,76860.2107140.8121084.965.76107.02341.80
Qwen3.5-4B-Q4_K_MQ4_K_MApple M4 Pro130,06460.1109129.1121271.365.85100.23341.77
Qwen3.5-4B-UD-Q4_K_XLQ4_K_XLApple M4 Pro16,38459.8111004.3121887.066.19105.68341.76
Qwen3.5-4B-Q3_K_SQ3_K_SApple M4 Pro16,38458.7111911.0124051.467.30117.02331.76
Qwen3.5-4B-UD-Q6_K_XLQ6_K_XLApple M4 Pro32,76858.2111881.1125038.368.00110.23361.62
Qwen3.6-35B-A3B-UD-IQ3_XXSIQ3_XXSNVIDIA GeForce RTX 5060 Ti32,76858.2245.8650.416.2118.09750.78
Qwen3.5-4B-Q5_K_SQ5_K_SApple M4 Pro130,06457.8113006.4126019.968.32122.07341.69
DeepSeek-R1-Distill-Qwen-32B-Q8_0Q8_0NVIDIA GeForce RTX 50908,19256.51640.13964.0251.23626.671190.48
Qwen3.5-4B-Q5_K_MQ5_K_MApple M4 Pro130,06456.4115470.8129146.470.16119.78341.65
Qwen3.5-4B-UD-Q5_K_XLQ5_K_XLApple M4 Pro130,06456.350872.856228.170.35113.69341.66
Qwen3.6-35B-A3B-UD-IQ3_SIQ3_SNVIDIA GeForce RTX 5060 Ti16,38455.8265.7682.016.9519.13740.75
Qwen3.5-27B-UD-Q6_K_XLQ6_K_XLNVIDIA GeForce RTX 50908,19255.32606.24481.2542.391517.241570.35
Qwen3.6-35B-A3B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 4060 Ti8,19254.91703.96414.9276.14591.31391.39
Qwen3.6-27B-Q5_K_SQ5_K_SNVIDIA GB108,19254.41674.53510.2259.15574.89531.02
Qwen3.5-4B-Q6_KQ6_KApple M4 Pro16,38453.7298.91022.536.2338.60381.42
Qwen3.5-4B-IQ4_NLIQ4_NLApple M4 Pro8,19250.5424.51612.676.9379.88341.48
Qwen3.5-27B-UD-IQ2_XXSIQ2_XXSNVIDIA GB1065,53650.53412.516283.0614.16840.26590.86
Qwen3.5-27B-IQ4_NLIQ4_NLNVIDIA GB10130,06448.82468.55767.8624.141203.60590.82
Qwen3.5-27B-Q4_0Q4_0NVIDIA GB1032,76848.62665.58717.5629.311157.56590.82
Qwen3.5-27B-IQ4_XSIQ4_XSNVIDIA GB1032,76848.63167.811401.0626.641095.45590.82
Qwen3.5-27B-Q3_K_SQ3_K_SNVIDIA GB1065,53648.12579.48242.8634.821283.38610.79
Qwen3.5-27B-Q3_K_MQ3_K_MNVIDIA GB1032,76847.92118.64828.6630.301289.45590.81
Qwen3.5-27B.Q8_0Q8_0NVIDIA GeForce RTX 50908,19247.22007.14736.8330.05438.241460.32
Qwen3.5-27B-Q4_1Q4_1NVIDIA GB1032,76846.92605.29689.9655.031183.81560.83
Qwen3.5-27B-UD-IQ2_MIQ2_MNVIDIA GB10130,06446.83119.19485.0651.321235.57610.77
Qwen3.5-27B-Q4_K_SQ4_K_SNVIDIA GB10130,06446.01828.58177.7664.451369.03560.82
Qwen3.5-27B-UD-Q2_K_XLQ2_K_XLNVIDIA GB1065,53646.03259.214575.0660.291064.35590.79
Qwen3.5-27B-Q4_K_MQ4_K_MNVIDIA GB10130,06445.81907.64815.3669.621300.67570.80
Qwen3.5-4B-BF16BF16Apple M4 Pro16,38445.020415.025350.687.8491.96411.09
Qwen3.5-27B-UD-IQ3_XXSIQ3_XXSNVIDIA GB1065,53643.22535.37640.9710.411298.77580.74
Qwen3.5-27B-UD-Q3_K_XLQ3_K_XLNVIDIA GB1065,53642.02992.79410.5723.991260.95580.72
Qwen3.5-27B-Q5_K_SQ5_K_SNVIDIA GB1032,76841.62874.59020.0730.721411.63570.73
Qwen3.5-27B-Q5_K_MQ5_K_MNVIDIA GB1032,76840.53218.511798.4751.701433.91580.70
Qwen3.5-27B-UD-Q4_K_XLQ4_K_XLNVIDIA GB1032,76836.82869.89114.7834.671491.65520.71
Qwen3.5-27B-UD-Q5_K_XLQ5_K_XLNVIDIA GB1065,53635.92655.06776.1851.261545.75520.69
Qwen3.5-27B-Q6_KQ6_KNVIDIA GB1065,53635.32673.06067.8868.061664.08520.68
Qwen3.5-27B-UD-IQ2_XXSIQ2_XXSApple M4 Pro16,38434.714236.980386.7849.961764.87341.03
Qwen3.5-27B-Q8_0Q8_0NVIDIA GB1032,76834.03047.710803.9898.771723.05490.69
Qwen3.5-27B-Q8_0Q8_0Apple M4 Pro32,76833.26858.430413.1907.773148.99360.93
Qwen3.5-27B-UD-IQ3_XXSIQ3_XXSApple M4 Pro32,76832.210089.556781.7922.322570.41340.95
Qwen3.5-27B-Q4_1Q4_1Apple M4 Pro16,38432.110049.359686.7931.062120.49350.92
Qwen3.5-27B-Q3_K_MQ3_K_MApple M4 Pro32,76832.06672.629383.9936.933794.48350.92
Qwen3.5-27B-UD-Q4_K_XLQ4_K_XLApple M4 Pro32,76832.010363.356049.7930.342724.96330.96
Qwen3.5-27B-Q6_KQ6_KApple M4 Pro130,06431.97812.639473.6938.053710.58350.90
Qwen3.5-27B-Q4_0Q4_0Apple M4 Pro32,76831.96838.434624.7949.332978.34350.91
Qwen3.5-27B-IQ4_NLIQ4_NLApple M4 Pro130,06431.76917.939969.3954.363147.05340.93
Qwen3.5-27B-IQ4_XSIQ4_XSApple M4 Pro130,06431.46686.330170.3964.563146.97350.89
Qwen3.5-27B-UD-Q5_K_XLQ5_K_XLApple M4 Pro16,38431.38919.157197.1948.793052.71340.93
Qwen3.5-27B-Q3_K_SQ3_K_SApple M4 Pro32,76831.36694.828410.0955.813847.84360.88
Qwen3.5-27B-UD-Q3_K_XLQ3_K_XLApple M4 Pro32,76830.910181.652862.3951.992685.96340.91
Qwen3.5-27B-UD-IQ2_MIQ2_MApple M4 Pro16,38430.98284.543437.7972.292854.47330.93
Qwen3.5-27B-Q4_K_MQ4_K_MApple M4 Pro130,06430.79252.449827.6976.332974.58340.90
Qwen3.5-27B-Q4_K_SQ4_K_SApple M4 Pro16,38430.48605.247639.8989.402774.14340.90
Qwen3.5-27B-UD-Q2_K_XLQ2_K_XLApple M4 Pro130,06430.49541.851287.4983.683277.40340.90
Qwen3.5-27B-UD-Q8_K_XLQ8_K_XLApple M4 Pro16,38430.39614.343726.1973.793628.40350.86
Qwen3.5-27B-UD-Q6_K_XLQ6_K_XLApple M4 Pro32,76830.29154.753033.8981.243109.00350.87
Qwen3.5-27B-Q5_K_MQ5_K_MApple M4 Pro16,38430.19683.349735.4986.983356.98340.88
Qwen3.5-27B-Q5_K_SQ5_K_SApple M4 Pro32,76829.810936.457252.9993.812773.56330.91
Qwen3.5-27B-UD-Q6_K_XLQ6_K_XLNVIDIA GB10130,06429.13507.89507.11050.011841.51490.59
Qwen3.5-27B-UD-Q8_K_XLQ8_K_XLNVIDIA GB1065,53628.13855.913394.51088.631923.82470.60
Qwen3.5-27B-UD-Q8_K_XLQ8_K_XLNVIDIA GeForce RTX 50908,19222.95271.410138.01302.323347.341050.22

791 of 791 entries

Deduplication policy: For each model + quantization + GPU combination, the run with the highest throughput (tok/s) is shown. Repeated runs are preserved in the raw dataset for further analysis.