大模型笔记
Home
0.inbox
[Read]2025 04
Alg interview faq
1.[基建]数据
Data
3.[基建]效率
Inference
Train
4.[模型]文本
Embedding
PostTraining
PreTraining
5.[模型]多模态
MultiModalEmbedding
开源资源
T2V
VLA
VLM
6.[模型]评测
Benchmark
7.[应用]产品
Agent
Product
VibeCoding
大模型笔记
5.[模型]多模态
MultiModalEmbedding
多模态Embedding
#
开源资源
#
VGG
[2014.09]
Very Deep Convolutional Networks for Large-Scale Image Recognition
https://pytorch.org/vision/stable/models/vgg.html
CLIP
[2021.02]
Learning Transferable Visual Models From Natural Language Supervision
文本图片对比训练,
« Previous
Next »