Highlights
-
pgvectorscale Public
Forked from timescale/pgvectorscaleA complement to pgvector for high performance, cost efficient vector search on large workloads.
Rust PostgreSQL License UpdatedMar 14, 2025 -
pdftext Public
Forked from VikParuchuri/pdftextExtract structured text from pdfs quickly
Python Apache License 2.0 UpdatedFeb 26, 2025 -
edgartools Public
Forked from dgunning/edgartoolsNavigate SEC Edgar data in Python
Python MIT License UpdatedFeb 14, 2025 -
tantivy-py Public
Forked from quickwit-oss/tantivy-pyPython bindings for Tantivy
Rust MIT License UpdatedFeb 6, 2025 -
-
fancy-regex Public
Forked from fancy-regex/fancy-regexRust library for regular expressions using "fancy" features like look-around and backreferences
Rust MIT License UpdatedJan 31, 2025 -
surya Public
Forked from VikParuchuri/suryaOCR, layout analysis, reading order, table recognition in 90+ languages
Python GNU General Public License v3.0 UpdatedJan 25, 2025 -
pyvespa Public
Forked from vespa-engine/pyvespaPython API for https://vespa.ai, the open big data serving engine
Python Apache License 2.0 UpdatedDec 12, 2024 -
vidore-benchmark Public
Forked from illuin-tech/vidore-benchmarkVision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
Python MIT License UpdatedOct 21, 2024 -
-
-
mdm4-splicing Public
Computational analysis of RPL22 alterations and impact on MDM4 splicing
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedAug 3, 2024 -
-
text-embeddings-inference Public
Forked from huggingface/text-embeddings-inferenceA blazing fast inference solution for text embeddings models
Rust Apache License 2.0 UpdatedMay 12, 2024 -
-
many Public
Frequently-used methods for exploratory analysis
-
cancer_data Public
A unified downloader+preprocessor for cancer genomics datasets
-
-
gaoya Public
Forked from serega/gaoyaLocality Sensitive Hashing
Rust MIT License UpdatedMar 1, 2024 -
paradedb Public
Forked from paradedb/paradedbPostgres for Search and Analytics
Rust GNU Affero General Public License v3.0 UpdatedMar 1, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedFeb 24, 2024 -
schema-infer Public
Forked from triggerdotdev/schema-inferInfers JSON Schemas and Type Definitions from example JSON
TypeScript MIT License UpdatedFeb 6, 2024 -
json-schema-fns Public
Forked from triggerdotdev/json-schema-fnsModern utility library and typescript typings for building JSON Schema documents
TypeScript MIT License UpdatedFeb 6, 2024 -
json-infer-types Public
Forked from triggerdotdev/json-infer-typesInfers the type and format of JSON values
TypeScript MIT License UpdatedFeb 6, 2024 -
-
imagededup Public
Forked from idealo/imagededup😎 Finding duplicate images made easy!
Python Apache License 2.0 UpdatedJan 2, 2024 -
minify-html Public
Forked from wilsonzlin/minify-htmlExtremely fast and smart HTML + JS + CSS minifier, available for Rust, Deno, Java, Node.js, Python, Ruby, and WASM
Rust MIT License UpdatedDec 14, 2023 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 18, 2023 -