Fix small performance regression versus v2.9 by tameware · Pull Request #214 · dds-bridge/dds

tameware · 2026-06-28T14:00:55Z

Regains roughly 2% of performance.

Compare is the speedup-opus branch. Branch is this branch.

solver file           compare_avg   branch_avg cmp/branch note
------ ------------- ------------ ------------ ---------- ---------------
solve  list1000.txt          1.96         1.93      1.02x branch faster
solve  list100.txt           1.97         1.95      1.01x branch faster
solve  list10.txt            4.18         4.16      1.00x equal
solve  list1.txt            12.20        12.00      1.02x branch faster
calc   list1000.txt         10.68        10.58      1.01x branch faster
calc   list100.txt           8.88         8.80      1.01x branch faster
calc   list10.txt           15.50        14.56      1.06x branch faster
calc   list1.txt            39.60        39.00      1.02x branch faster

Copilot

Pull request overview

This PR aims to regain a small (~2%) performance regression vs v2.9 by removing redundant/narrowing casts and simplifying a hot-path heuristic branch, reducing unnecessary conversions and work in frequently executed code paths.

Changes:

Removes redundant static_cast<unsigned char>(...) / double-cast patterns when indexing rank/relative-rank tables.
Simplifies part of weight_alloc_trump_void1() for the “void in trump when trump is led” discard case.
Removes unnecessary casts when copying winner/second-best ranks and hands from RelRanksType into Pos.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File	Description
library/src/quick_tricks.cpp	Simplifies rank-to-index conversion when setting `win_ranks` from `AbsRankType::rank`.
library/src/heuristic_sorting/heuristic_sorting.cpp	Removes redundant casts in `rel_rank` indexing and simplifies a discard-weighting branch in a hot heuristic function.
library/src/ab_search.cpp	Removes unnecessary casts when updating cached winner/second-best info from `thrp->rel[...]`.

zzcgumn · 2026-06-29T19:19:07Z

  int suitAdd;

-  if (suit == trump)
+  if (lead_suit == trump) // We pitch


This confuses me, is this not the same fix as the previous pull request?

Would you like me to investigate and address, or can this be dealt with in merge?

Can you try to rebase or merge this branch with latest develop?

tameware · 2026-06-29T20:41:46Z

Entirely possible! I sometimes get my branches confused.

…

On Mon, Jun 29, 2026 at 2:19 PM Martin Nygren ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In library/src/heuristic_sorting/heuristic_sorting.cpp <#214 (comment)>: > @@ -676,49 +676,17 @@ void weight_alloc_trump_void1(HeuristicContext& ctx) unsigned short suitCount = tpos.length[curr_hand][suit]; int suitAdd; - if (suit == trump) + if (lead_suit == trump) // We pitch This confuses me, is this not the same fix as the previous pull request? — Reply to this email directly, view it on GitHub <#214?email_source=notifications&email_token=ABC4PYHJOQ2O32PTZTFJNFD5CK6MFA5CNFSNUABKM5UWIORPF5TWS5BNNB2WEL2QOVWGYUTFOF2WK43UKJSXM2LFO4XTINJZGQ3TGMJVGEYKM4TFMFZW63VGMFZXG2LHN2SWK5TFNZ2KYZTPN52GK4S7MNWGSY3L#pullrequestreview-4594731510>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABC4PYBWNK44RE4QNRS4XGT5CK6MFAVCNFSNUABEKJSXA33TNF2G64TZHMZDMOJYHAZDANR3JFZXG5LFHM2DONRSGQ3DINRTGWQXMAQ> . You are receiving this because you were assigned.Message ID: ***@***.***>

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

dds-bridge#171

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Pass the resolved dds_mvp.js path via DDS_MVP_JS for Bazel runfiles, and add focus() to mock DOM elements so clearTestData and pageLoad work when merged with web changes. Co-authored-by: Cursor <cursoragent@cursor.com>

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Add upload-artifact@v6 steps to Linux, macOS, Windows, and WASM workflows using bazel-testlogs/ with if-no-files-found: ignore. Co-authored-by: Cursor <cursoragent@cursor.com>

…string.

Co-authored-by: Cursor <cursoragent@cursor.com>

docs/BUILD_SYSTEM.md — new section “AddressSanitizer and ThreadSanitizer (macOS)” covering: The three-way clang major coupling (MODULE.bazel, Xcode, .bazelrc TSAN rpath) Why ASAN uses the full Xcode toolchain vs TSAN using LLVM compile + Xcode runtime Steps when upgrading LLVM or Xcode Smoke-test commands Also corrected the outdated LLVM version note (darwin was listed as 20.1.8; it’s 21.1.8). .bazelrc — short pointers to BUILD_SYSTEM.md; TSAN rpath comment notes the hardcoded clang/21 must stay in sync. MODULE.bazel — comment on llvm_versions to update the TSAN rpath and verify Xcode when bumping LLVM.

…arking global arrays as static. Commented out unused SORT_SOLVE_STRENGTH and SORT_SOLVE_STRENGTH_CUTOFF

…ilure.

The BUILD change ensures a stats test links only system_util_stats, not system + system_util_stats. One system variant → one ThreadMgr → one set of mutexes. That matches the intent: a process-wide thread manager.

Wire a Bazel config_setting into DDS_LOCAL_DEFINES so AB stats can be turned on without --cxxopt=-DDDS_AB_STATS; refresh MODULE.bazel.lock. Co-authored-by: Cursor <cursoragent@cursor.com>

Pass dtest_effective_threads through to calc batches so calc threading matches solve and responds to the -n flag. Co-authored-by: Cursor <cursoragent@cursor.com>

SetResources ignores maxThreads; threading is handled via dtest_effective_threads and the *N batch APIs. Passing -n only triggered a misleading library warning. itest.cpp will be removed in a future PR.

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

The file was excluded from the dtest build, had no Bazel target, and referenced a nonexistent realMain entry point. Co-authored-by: Cursor <cursoragent@cursor.com>

Dereference the transposition-table NodeCards pointer to match the DumpRetrieved(const NodeCards&) signature when DDS_AB_HITS is enabled. Co-authored-by: Cursor <cursoragent@cursor.com>

Re-add the DDS_AB_HITS dump helpers in dump.cpp so debug_all links succeed when ab_search logs transposition-table hits. Co-authored-by: Cursor <cursoragent@cursor.com>

The heuristic extraction refactor changed weight_alloc_trump_void1's first branch from `lead_suit == trump` to `suit == trump`. Since that is exhaustive with the following `else if (suit != trump)`, the three ruffing branches (using the `24 - rank + ...` formula) became dead code, and trump ruffs were scored with side-suit discard weights instead. This mis-ordered ruffs, costing alpha-beta cutoffs. The effect is small for solve but compounds heavily in calc's warm-TT iterative deepening: calc explored ~34% more nodes than v2.9. Restoring the original `lead_suit == trump` pitch branch makes the ruffing branches reachable again and cuts calc time ~25% (gap to v2.9: 1.37x -> 1.02x). Ordering-only change; double-dummy results are unchanged. Co-authored-by: Cursor <cursoragent@cursor.com>

Per Copilot. Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

The heuristic/quick-tricks refactor introduced static_cast<unsigned char> wrappers on values that v2.9 used as signed, changing search behavior: - make_3 / make_3_ctx: winner[]/second_best[] .hand and .rank were cast to unsigned char, turning the -1 "no card" sentinel into 255. This broke winner[trump].hand == -1 style checks in QuickTricks, losing cutoffs. - weight_alloc_trump_void2 / _void3: rel_rank[aggr[suit]][...] indexed through static_cast<unsigned char>(aggr[suit]), truncating the 13-bit aggregate holding to 8 bits and reading the wrong rel_rank row. - QuickTricksPartnerHand{Trump,NT}: bit_map_rank index cast the signed rank through unsigned char. With these reverted to v2.9's signed handling, the per-move-generation ordering trace now matches v2.9 exactly (0 divergences on list1), closing the residual calc gap to parity. Ordering/pruning-only change; double-dummy results are unchanged and all library tests pass. Co-authored-by: Cursor <cursoragent@cursor.com>

Whitespace-only cleanup of misindented if/else-if chains and wrapped conditions left over from the v2.9 port; no logic change. Co-authored-by: Cursor <cursoragent@cursor.com>

tameware · 2026-07-03T04:59:42Z

The rebase was a disaster. I made a new PR #223 instead.

tameware requested a review from Copilot June 28, 2026 14:01

Copilot started reviewing on behalf of tameware June 28, 2026 14:01 View session

Copilot AI reviewed Jun 28, 2026

View reviewed changes

tameware requested a review from zzcgumn June 28, 2026 14:09

tameware marked this pull request as ready for review June 28, 2026 14:09

tameware self-assigned this Jun 28, 2026

zzcgumn reviewed Jun 29, 2026

View reviewed changes

tameware and others added 22 commits July 2, 2026 22:06

First commit for calc_dd.cpp and related.

9a3e933

Read from stdin if available. Fix BUILD.bazel target name.

9d4c025

Add two pbn files for manual testing.

7e6d338

renamed to calc_par

075277b

renamed to calc_par

5a9a61e

Renamed calc_par example to dd_table_for_deal

d8c6978

Added read_pbn_file_workspace_relative

901751d

Added dd_table_for_deal.py

7e78900

Added usage for Python dd_table_for_deal

d3f9b18

Added newlines at the end of each file

0a83349

Replace //python:install_dds3_so with //python:_dds3

66b8ede

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Use modern C++20 instead of C-style I/O and string handling.

c42179b

Added tests and a test runner for the Javascript functions, per

8220c41

dds-bridge#171

Trigger CI workflow

da07b90

Trigger CI workflow - 2nd try

aabf60c

Updated comment to fix typo and add alternative test runners.

d327356

Fix typos in comment

2e37174

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Fix dds_mvp_js_test on Linux CI and with focusNorthSpades.

870b500

Pass the resolved dds_mvp.js path via DDS_MVP_JS for Bazel runfiles, and add focus() to mock DOM elements so clearTestData and pageLoad work when merged with web changes. Co-authored-by: Cursor <cursoragent@cursor.com>

Applied Copilot's suggestions

06c470f

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Upload Bazel test logs as CI artifacts on all platforms.

e5a80a1

Add upload-artifact@v6 steps to Linux, macOS, Windows, and WASM workflows using bazel-testlogs/ with if-no-files-found: ignore. Co-authored-by: Cursor <cursoragent@cursor.com>

Initalt commit

6839716

First take on a cross platfor dds wrapper

0a0fb9d

zzcgumn and others added 26 commits July 2, 2026 22:07

fix: remove mention of the guard on user_threads from the python doc …

9e12ddc

…string.

fix: removes incorrect mention of value error.

a89fae8

fix: renames InitialiseStaticMemory to use the American misspelling.

9f5adac

Per Cursor: Allow ASAN and TSAN tests to run on macOS.

efb7e7a

Update MODULE.bazel.lock after adding apple_support for macOS ASAN.

e426b95

Co-authored-by: Cursor <cursoragent@cursor.com>

Update docs to remove the --define=asan=true option.

7eb3819

Add ASan and TSan jobs for CI.

d8c0ec1

Add a CI timeout on all platforms.

bf3f99c

Fix issues identified by local ASan test.

66d823d

Fix AddressSanitizer (ASAN) ODR (One Definition Rule) violations by m…

f348664

…arking global arrays as static. Commented out unused SORT_SOLVE_STRENGTH and SORT_SOLVE_STRENGTH_CUTOFF

Moved mtx and mtxPrint into an anonymous namespace to address ASan fa…

73faa5b

…ilure.

Remove duplicate ThreadMgr to address an ASan error. Per Cursor:

a5e29e9

The BUILD change ensures a stats test links only system_util_stats, not system + system_util_stats. One system variant → one ThreadMgr → one set of mutexes. That matches the intent: a process-wide thread manager.

Enable DDS_AB_STATS via --define=ab_stats=true.

e6bfbee

Wire a Bazel config_setting into DDS_LOCAL_DEFINES so AB stats can be turned on without --cxxopt=-DDDS_AB_STATS; refresh MODULE.bazel.lock. Co-authored-by: Cursor <cursoragent@cursor.com>

Honor dtest -n in loop_calc via CalcAllTablesPBNN.

cfceffa

Pass dtest_effective_threads through to calc batches so calc threading matches solve and responds to the -n flag. Co-authored-by: Cursor <cursoragent@cursor.com>

Stop passing -n to SetResources in dtest drivers.

5188c46

SetResources ignores maxThreads; threading is handled via dtest_effective_threads and the *N batch APIs. Passing -n only triggered a misleading library warning. itest.cpp will be removed in a future PR.

Simplify

66e297c

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Use named constant

34f8d66

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Remove unused itest.cpp test driver.

7724718

The file was excluded from the dtest build, had no Bazel target, and referenced a nonexistent realMain entry point. Co-authored-by: Cursor <cursoragent@cursor.com>

Fix DumpRetrieved call under debug_all builds.

8ec7a47

Dereference the transposition-table NodeCards pointer to match the DumpRetrieved(const NodeCards&) signature when DDS_AB_HITS is enabled. Co-authored-by: Cursor <cursoragent@cursor.com>

Restore DumpRetrieved and DumpStored for debug_all builds.

3462d8c

Re-add the DDS_AB_HITS dump helpers in dump.cpp so debug_all links succeed when ab_search logs transposition-table hits. Co-authored-by: Cursor <cursoragent@cursor.com>

Fix incorrect comment

4cc0073

Per Copilot. Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Rebased against develop.

72d08da

Fix indentation in weight_alloc_trump_void1 and trump_void3

b8143cd

Whitespace-only cleanup of misindented if/else-if chains and wrapped conditions left over from the v2.9 port; no logic change. Co-authored-by: Cursor <cursoragent@cursor.com>

tameware force-pushed the opus-two-percent branch from 6234595 to b8143cd Compare July 3, 2026 03:56

tameware closed this Jul 3, 2026

tameware deleted the opus-two-percent branch July 3, 2026 04:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix small performance regression versus v2.9#214

Fix small performance regression versus v2.9#214
tameware wants to merge 157 commits into
dds-bridge:developfrom
tameware:opus-two-percent

tameware commented Jun 28, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

zzcgumn Jun 29, 2026

Uh oh!

tameware Jun 30, 2026

Uh oh!

zzcgumn Jul 2, 2026

Uh oh!

tameware commented Jun 29, 2026 via email

Uh oh!

tameware commented Jul 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

tameware commented Jun 28, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

zzcgumn Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

tameware Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

zzcgumn Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

tameware commented Jun 29, 2026 via email

Uh oh!

tameware commented Jul 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants