Skip to content

Latest commit

 

History

History
13 lines (7 loc) · 346 Bytes

depshingle.md

File metadata and controls

13 lines (7 loc) · 346 Bytes

without dependency shingling

  • total lines in shingles.congress.sorted = 64300

  • total lines after single pass over file to remove singletons = 35940

  • there are 6000 possible pairs before reduction

  • there are 600 pairs after reducing to only pairs from different files

  • there are 720313 pairs to check

  • there are 231971 unique pairs