view post Post 2834 text-generation-inference (TGI) is now fully open-source again!Along with text-embeddings-inference.We just switched both of those repos' license back to Apache 2. 🔥
view post Post 2728 Very glad to welcome @josefprusa , pioneer of 3D printing and open source hardware, founder of https://www.prusa3d.com/, to the HF Hub 👋AI applied to 3D printing could be big.
Canonical models This collection lists all the historical (pre-"Hub") canonical model checkpoints, i.e. repos that were not under an org or user namespace albert/albert-base-v1 Fill-Mask • Updated Feb 19 • 15.3k • 6 albert/albert-base-v2 Fill-Mask • Updated Feb 19 • 2.65M • 88 albert/albert-large-v1 Fill-Mask • Updated Feb 19 • 1.64k • 2 albert/albert-large-v2 Fill-Mask • Updated Feb 19 • 74.5k • 13
Papers about model merging referenced in the mergekit repo: https://github.com/cg123/mergekit Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time Paper • 2203.05482 • Published Mar 10, 2022 • 5 Editing Models with Task Arithmetic Paper • 2212.04089 • Published Dec 8, 2022 • 4 Resolving Interference When Merging Models Paper • 2306.01708 • Published Jun 2, 2023 • 10 Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Paper • 2311.03099 • Published Nov 6, 2023 • 27
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time Paper • 2203.05482 • Published Mar 10, 2022 • 5
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Paper • 2311.03099 • Published Nov 6, 2023 • 27