Skip to content

Commit fcd0526

Browse files
committed
Day 25: Cache by blocks of ints
block size 128 (0.size (8) * 16) gave good results on one computer... block size | hits | misses | miss rate | runtime / original ----|---------|-----|----------|------ 8 | 1073198 | 64 | 0.000061 | 1.33 16 | 533709 | 102 | 0.00019 | 0.67 32 | 263990 | 120 | 0.00045 | 0.4 64 | 129163 | 169 | 0.0013 | 0.25 128 | 61726 | 280 | 0.0045 | 0.22 256 | 27929 | 494 | 0.017 | 0.8 However, on a different computer, 8 * 7 = 56 was the best. Haven't investigated why. Previously I had tried the individual approaches (may still be available on branches unless I decide to delete them): * Pack N bits into blocks without running multiple iterations at once: Slowdown by about 1.5x * Run fixed N iterations without packing bits into blocks: Slowdown by about 2x because we do 2N-1 reads for every N iterations. * Run for every N distance without packing bits into blocks: Slowdown by about 1.2x because we tend to take fewer than 2N-1 cycles to move N distance. Need to combine the first and last bullet points, plus add a cache.
1 parent f2f62ee commit fcd0526

File tree

2 files changed

+50
-7
lines changed

2 files changed

+50
-7
lines changed

25_turing_machine.rb

+48-7
Original file line numberDiff line numberDiff line change
@@ -36,18 +36,59 @@ def parse_state(line)
3636
end
3737
STATES = parse_input(ARGF)
3838

39-
ticker = [0] * (check_after ** 0.5).ceil * 2
39+
# Experimentally, this gave a good runtime,
40+
# but didn't investigate to explain why this is good.
41+
# I suppose performance gets worse at 63 because some numbers become bigints,
42+
# such as new_bit << block_pos
43+
BLOCK_SIZE = 62
44+
45+
CACHE = {}
46+
def run_block(left, right, state, steps_left)
47+
pos = BLOCK_SIZE
48+
steps_taken = 0
49+
# Leftmost bit of left doesn't matter.
50+
left &= ~1
51+
cache_key = [left, right, state].freeze
52+
if (cached = CACHE[cache_key]) && cached[-1] <= steps_left
53+
return cached
54+
end
55+
56+
while steps_taken < steps_left && (1...(2 * BLOCK_SIZE)).cover?(pos)
57+
block_i, block_pos = pos.divmod(BLOCK_SIZE)
58+
current_bit = (block_i == 0 ? left : right)[block_pos]
59+
new_bit, where_to_go, state = STATES[state][current_bit]
60+
if current_bit != new_bit
61+
mask = ~(1 << block_pos)
62+
if block_i == 0
63+
left = (left & mask) | (new_bit << block_pos)
64+
else
65+
right = (right & mask) | (new_bit << block_pos)
66+
end
67+
end
68+
pos += where_to_go
69+
steps_taken += 1
70+
end
71+
CACHE[cache_key] = [left, right, state, pos == 0 ? -1 : 1, steps_taken].freeze
72+
end
73+
74+
ticker = [0] * [(check_after ** 0.5) * 4 / BLOCK_SIZE, 2].max
4075
pos = ticker.size / 2
76+
steps_left = check_after
4177

42-
check_after.times {
43-
ticker[pos], where_to_go, state = STATES[state][
44-
ticker[pos] || 0.tap { ticker.concat([0] * ticker.size) }
45-
]
78+
while steps_left > 0
79+
new_left, new_right, state, where_to_go, steps_taken = run_block(
80+
ticker[pos - 1],
81+
ticker[pos] || 0.tap { ticker.concat([0] * ticker.size) },
82+
state, steps_left,
83+
)
84+
steps_left -= steps_taken
85+
ticker[pos - 1] = (ticker[pos - 1] & 1) | (new_left & ~1)
86+
ticker[pos] = new_right
4687
pos += where_to_go
4788
if pos < 0
4889
pos += ticker.size
4990
ticker.unshift(*[0] * ticker.size)
5091
end
51-
}
92+
end
5293

53-
puts ticker.sum
94+
puts ticker.sum { |t| t.to_s(2).count(?1) }

README.md

+2
Original file line numberDiff line numberDiff line change
@@ -66,6 +66,8 @@ Solutions with interesting algorithmic choices:
6666
A few ideas can be found in the [can it be done more efficiently?](https://www.reddit.com/r/adventofcode/comments/7lunzu/2017_day_24_so_can_it_be_done_more_efficiently/) thread.
6767
The only one I needed to use was the one that optimises out any `[X, X]` dominoes.
6868
There are only 6 `[X, X]` dominoes in my input of 57, but this decreases the number of bridges found from 288414 to 21783.
69+
* 25 (Turing Machine):
70+
Pack into blocks of N bits (experimentally determined that 128 is good), and cache what happens every time the head moves N steps away as seen in [askalski's solutions](https://www.reddit.com/r/adventofcode/comments/7q6s80/2017_optimized_solutions_in_c_195_ms_total/).
6971

7072
Solutions notable for good leaderboard-related reasons:
7173

0 commit comments

Comments
 (0)