
课程目录:
├── 基于LangChain和知识图谱的大模型医疗问答机器人项目
│ ├── 源代码
│ ├── 大模型实战P25LangChain之给Agent加Memory.mp4
│ ├── 大模型实战P11LangChain之Prompt和LLMChain.mp4
│ ├── 大模型实战P45问答机器人项目面试考点总结.mp4
│ ├── 大模型实战P36从用户问题中抽取命名实体词槽.mp4
│ ├── 大模型实战P37CQL词槽填充和相关问题筛选.mp4
│ ├── 大模型实战P1LangChain与知识图谱问答机器人项目.mp4
│ ├── 大模型实战P13LangChain之FewShotPrompt.mp4
│ ├── 大模型实战P41用户消息的补全和归纳总结.mp4
│ ├── 大模型实战P48快速接入百川和Claude大模型.mp4
│ ├── Neo4j实战P7-1Windows和Mac本地安装Neo4j数据库.mp4
│ ├── 大模型实战P44LangChain框架版本升级.mp4
│ ├── 大模型实战P24LangChain之多Agent协作.mp4
│ ├── 大模型实战P12LangChain之多参数与LCEL.mp4
│ ├── 大模型实战P32定义环境变量和模型获取函数.mp4
│ ├── 大模型实战P47一种解决Agent响应慢的方法.mp4
│ ├── 大模型实战P19LangChain之FAISS文档召回.mp4
│ ├── 大模型实战P28LangChain之GraphCypherQAChain.mp4
│ ├── 大模型实战P31项目LangChainAgent架构简介.mp4
│ ├── 大模型实战P40用Agent串联业务处理函数.mp4
│ ├── 大模型实战P43LangSmith监控大模型应用程序.mp4
│ ├── 大模型实战P20LangChain之文档加载和分割.mp4
│ ├── 大模型实战P30Gradio之ChatInterface对话界面.mp4
│ ├── 大模型实战P15LangChain之ConversationChain.mp4
│ ├── 大模型实战P27LangChain之输出提示词重写.mp4
│ ├── 大模型实战P8OpenAI接口实现TextEmbeddings.mp4
│ ├── 大模型实战P18LangChain之问答QAChain.mp4
│ ├── 大模型实战P9根据OpenAI句向量召回相似文本.mp4
│ ├── 大模型实战P26LangChain之命名实体识别.mp4
│ ├── 大模型实战P6OpenAI接口调用Token计算.mp4
│ ├── 大模型实战P46共性问题修复和统一答疑.mp4
│ ├── 大模型实战P39Google搜索回答非在库问题.mp4
│ ├── 大模型实战P10LangChain简介与初体验.mp4
│ ├── 大模型实战P16LangChain之Memory.mp4
│ ├── 大模型实战P17LangChain之LLMRequestsChain.mp4
│ ├── 大模型实战P2基础课和项目课的内容概述.mp4
│ ├── 大模型实战P21LangChain之文档检索问答.mp4
│ ├── 大模型实战P7OpenAI接口实现多轮对话.mp4
│ ├── Neo4j实战P7-2Windows和Mac本地安装Neo4j数据库.mp4
│ ├── 医疗问答P7CSV文件导入到Neo4j数据库.mp4
│ ├── 大模型实战P22LangChain之向量保存和加载.mp4
│ ├── 大模型实战P5OpenAI对话接口代码优化.mp4
│ ├── 大模型实战P3大语言模型通识和课前准备.mp4
│ ├── 大模型实战P42Gradio对话窗口修改和测试.mp4
│ ├── 大模型实战P29Gradio简介与初体验.mp4
│ ├── 大模型实战P14LangChain之SequentialChain.mp4
│ ├── 大模型实战P38查询Neo4j回答医疗相关问题.mp4
│ ├── 大模型实战P35Chroma召回数据回答公司相关问题.mp4
│ ├── 大模型实战P34通用大模型回答日常交际问题.mp4
│ ├── 大模型实战P33公司相关文档向量化和存储.mp4
│ ├── 大模型实战P4OpenAI对话接口简单使用方法.mp4
│ ├── 大模型实战P23LangChain之Agent和自定义Tool.mp4
├── 大模型面试笔记书籍
│ ├── 大模型论文
│ │ ├── CVPR 2024 (最佳+oral+highlight)(持续更新)
│ │ │ ├── 1 CVPR'24 获奖论文
│ │ │ │ ├── 4 最佳学生论文次优奖
│ │ │ │ │ ├── Objects as volumes: A stochastic geometry view of opaque solids.pdf
│ │ │ │ │ ├── Comparing the Decision-Making Mechanisms by Transformers and CNNs.pdf
│ │ │ │ ├── 2 最佳学生论文奖
│ │ │ │ │ ├── BIOCLIP:A Vision Foundation Model for the Tree of Life.pdf
│ │ │ │ │ ├── Mip-Splatting:Alias-free 3D Gaussian Splatting.pdf
│ │ │ │ ├── 3 最佳论文次优奖
│ │ │ │ │ ├── pixelSplat. 3D Gaussian Splats from lmage Pairs for Scalable Generalizable 3D Reconstruction.pdf
│ │ │ │ ├── 1 最佳论文奖
│ │ │ │ │ ├── Rich Human Feedback for Text-to-Image Generation.pdf
│ │ │ │ │ ├── Generative Image Dynamics.pdf
│ │ │ ├── 3 CVPR'24 oral论文(更新完毕)
│ │ │ │ ├── 18 多模态学习
│ │ │ │ │ ├── Describing Differences in Image Sets with Natural Language.pdf
│ │ │ │ │ ├── NoiseCLR:A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions in Diffusion Models.pdf
│ │ │ │ │ ├── MetaCloak.pdf
│ │ │ │ │ ├── InternVL:Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.pdf
│ │ │ │ ├── 1 低层次视觉
│ │ │ │ │ ├── Specularity Factorization for Low-Light Enhancement.pdf
│ │ │ │ │ ├── FMA-Net:Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring.pdf
│ │ │ │ │ ├── Bilateral Event Mining and Complementary for Event Stream Super-Resolution.pdf
│ │ │ │ │ ├── FlowIE:Efficient Image Enhancement via Rectified Flow.pdf
│ │ │ │ │ ├── Towards Robust Event-guided Low-Light Image Enhancement.pdf
│ │ │ │ ├── 11三维视觉
│ │ │ │ │ ├── A Subspace-Constrained Tyler’s Estimator and its Applications to Structure from Motion.pdf
│ │ │ │ ├── 16 低层次视觉与遥感
│ │ │ │ │ ├── DART:Implicit Doppler Tomography for Radar Novel View Synthesis.pdf
│ │ │ │ │ ├── LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network.pdf
│ │ │ │ ├── 14 多视角三维技术和传感器 2
│ │ │ │ │ ├── Learning to Produce Semi-dense Correspondences for Visual Localization.pdf
│ │ │ │ ├── 15 低样本学习、自监督学习和半监督学习
│ │ │ │ │ ├── CroSel.pdf
│ │ │ │ │ ├── LTGC:Long-tail Recognition via Leveraging LLMs-driven Generated Content.pdf
│ │ │ │ │ ├── Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps.pdf
│ │ │ │ ├── 6 多视角三维技术和传感器
│ │ │ │ │ ├── Seeing the World through Your Eyes.pdf
│ │ │ │ │ ├── Tri-Perspective View Decomposition for Geometry-Aware Depth Completion.pdf
│ │ │ │ │ ├── Steerers:A Framework for Rotation Equivariant Keypoint Descriptors.pdf
│ │ │ │ │ ├── Point Transformer V3:Simpler Faster Stronger.pdf
│ │ │ │ │ ├── Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences.pdf
│ │ │ │ ├── 5 深度学习架构与技术
│ │ │ │ │ ├── Neural Lineage.pdf
│ │ │ │ │ ├── Learning Structure-from-Motion with Graph Attention Networks.pdf
│ │ │ │ │ ├── Neural Redshift:Random Networks are not Random Functions.pdf
│ │ │ │ │ ├── In Search of a Data Transformation That Accelerates Neural Field Training.pdf
│ │ │ │ │ ├── Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks.pdf
│ │ │ │ ├── 7 单视角三维技术
│ │ │ │ │ ├── WALT3D:Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects Under Occlusion.pdf
│ │ │ │ │ ├── EscherNet:A Generative Model for Scalable View Synthesis.pdf
│ │ │ │ │ ├── Rethinking Inductive Biases for Surface Normal Estimation.pdf
│ │ │ │ ├── 10 自主导航和自我中心视觉
│ │ │ │ │ ├── SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection.pdf
│ │ │ │ │ ├── EgoGen:An Egocentric Synthetic Data Generator.pdf
│ │ │ │ │ ├── UnO:Unsupervised Occupancy Fields for Perception and Forecasting.pdf
│ │ │ │ ├── 3 人类行为和特征
│ │ │ │ │ ├── Stratified Avatar Generation from Sparse Observations.pdf
│ │ │ │ │ ├── Semantic Human Mesh Reconstruction with Textures.pdf
│ │ │ │ │ ├── URHand:Universal Relightable Hands.pdf
│ │ │ │ │ ├── MultiPly:Reconstruction of Multiple People from Monocular Video in the Wild.pdf
│ │ │ │ │ ├── Relightable Gaussian Codec Avatars.pdf
│ │ │ │ ├── 2 视觉与图形
│ │ │ │ │ ├── Eclipse:Disambiguating Illumination and Materials using Unintended Shadows.pdf
│ │ │ │ │ ├── Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.pdf
│ │ │ │ │ ├── DiffusionLight:Light Probes for Free by Painting a Chrome Ball.pdf
│ │ │ │ ├── 9 医学与物理视觉
│ │ │ │ │ ├── Transcriptomics-guided Slide Representation Learning in Computational Pathology.pdf
│ │ │ │ ├── 17 图像与视频合成 2
│ │ │ │ │ ├── MonoHair:High-Fidelity Hair Modeling from a Monocular Video.pdf
│ │ │ │ │ ├── Alchemist:Parametric Control of Material Properties with Diffusion Models.pdf
│ │ │ │ │ ├── Visual Anagrams:Generating Multi-View Optical Illusions with Diffusion Models.pdf
│ │ │ │ ├── 8 视觉、语言与推理
│ │ │ │ │ ├── Visual Program Distillation:Distilling Tools and Programmatic Reasoning into Vision-Language Models.pdf
│ │ │ │ │ ├── LISA:Reasoning Segmentation via Large Language Model.pdf
│ │ │ │ │ ├── Eyes Wide Shut Exploring the Visual Shortcomings of Multimodal LLMs.pdf
│ │ │ │ ├── 12 动作和运动分析
│ │ │ │ │ ├── An N-Point Linear Solver for Line and Motion Estimation with Event Cameras.pdf
│ │ │ │ │ ├── FineParser:A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment.pdf
│ │ │ │ │ ├── Modeling Multimodal Social Interactions:New Challenges and Baselines with Densely Aligned Representations.pdf
│ │ │ │ │ ├── RoHM:Robust Human Motion Reconstruction via Diffusio.pdf
│ │ │ │ ├── 4 图像与视频合成
│ │ │ │ │ ├── Ranni:Taming Text-to-Image Diffusion for Accurate Instruction Following.pdf
│ │ │ │ │ ├── Attention Calibration for Disentangled Text-to-Image Personalization.pdf
│ │ │ │ │ ├── FreeU:Free Lunch in Diffusion U-Net.pdf
│ │ │ │ │ ├── Instruct-Imagen: Image Generation with Multi-modal Instruction.pdf
│ │ │ │ │ ├── Style Aligned Image Generation via Shared Attention.pdf
│ │ │ │ ├── 13 数据集和评估
│ │ │ │ │ ├── 360+x:A Panoptic Multi-modal Scene Understanding Dataset.pdf
│ │ │ │ │ ├── Deep Generative Model based Rate-Distortion for Image Downscaling Assessment.pdf
│ │ │ │ │ ├── Ego-Exo4D:Understanding Skilled Human Activity from First- and Third-Person Perspectives.pdf
│ │ │ ├── 4 CVPR'24 highlight论文(更新中)
│ │ │ │ ├── ODIN A Single Model for 2D and 3D Segmentation.pdf
│ │ │ │ ├── Enforcing Geometric and Physical Priors.pdf
│ │ │ │ ├── Scaling Up Dynamic Human-Scene Interaction Modeling.pdf
│ │ │ │ ├── CADTalk An Algorithm and Benchmark for Semantic Commenting of CAD Programs.pdf
│ │ │ │ ├── LucidDreamer Towards High-Fidelity Text-to-3D Generation via Interval Score Matching.pdf
│ │ │ │ ├── pix2gestalt Amodal Segmentation by Synthesizing Wholes.pdf
│ │ │ │ ├── Semantic-aware SAM for Point-Prompted Instance Segmentation.pdf
│ │ │ │ ├── Self-Supervised Dual Contouring.pdf
│ │ │ │ ├── Multi-view Aggregation Network for Dichotomous Image Segmentation.pdf
│ │ │ │ ├── From Correspondences to Pose Non-minimal Certifiably Optimal Relative Pose without Disambiguation.pdf
│ │ │ │ ├── 3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation.pdf
│ │ │ │ ├── Suppress and Rebalance Towards Generalized Multi-Modal Face Anti-Spoofing.pdf
│ │ │ │ ├── GraCo Granularity-Controllable Interactive Segmentation.pdf
│ │ │ │ ├── Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.pdf
│ │ │ │ ├── RAVE Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models.pdf
│ │ │ │ ├── DiffusionLight Light Probes for Free by Painting a Chrome Ball.pdf
│ │ │ │ ├── Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation.pdf
│ │ │ │ ├── Towards Robust Event-guided Low-Light Image Enhancement A Large-Scale Real-World Event-Image Dataset and Novel Approach.pdf
│ │ │ │ ├── Eclipse Disambiguating Illumination and Materials using Unintended Shadows.pdf
│ │ │ │ ├── Boosting Neural Representations for Videos with a Conditional Decoder.pdf
│ │ │ │ ├── Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation.pdf
│ │ │ │ ├── LocLLM Exploiting Generalizable Human Keypoint Localization via Large Language Model.pdf
│ │ │ │ ├── HandDiff 3D Hand Pose Estimation with Diffusion on Image-Point Cloud.pdf
│ │ │ │ ├── From Activation to Initialization Scaling Insights for Optimizing Neural Fields.pdf
│ │ │ │ ├── ViT-CoMer Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.pdf
│ │ │ │ ├── NRDF Neural Riemannian Distance Fields for Learning Articulated Pose Priors.pdf
│ │ │ │ ├── Unbiased Estimator for Distorted Conics in Camera Calibration.pdf
│ │ │ │ ├── Restoration by Generation with Constrained Priors.pdf
│ │ │ │ ├── From Activation to Initialization Scaling Insights for Optimizing Neural Fields.pdf
│ │ │ │ ├── Time-, Memory- and Parameter-Efficient Visual Adaptation.pdf
│ │ │ │ ├── FreeU Free Lunch in Diffusion U-Net.pdf
│ │ │ │ ├── EAGLE Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation.pdf
│ │ │ │ ├── Human Motion Prediction Under Unexpected Perturbation.pdf
│ │ │ │ ├── XCube Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies.pdf
│ │ │ │ ├── Relightable and Animatable Neural Avatar from Sparse-View Video.pdf
│ │ │ │ ├── Bilateral Event Mining and Complementary for Event Stream Super-Resolution.pdf
│ │ │ │ ├── Breathing Life Into Sketches Using Text-to-Video Priors.pdf
│ │ │ │ ├── Efficient Deformable ConvNets Rethinking Dynamic and Sparse Operator for Vision Applications.pdf
│ │ │ │ ├── Enhancing Video Super-Resolution via Implicit Resampling-based Alignment.pdf
│ │ │ │ ├── HOLD Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Vide.pdf
│ │ │ │ ├── DreamPropeller Supercharge Text-to-3D Generation with Parallel Sampling.pdf
│ │ │ │ ├── Ranni Taming Text-to-Image Diffusion for Accurate Instruction Following.pdf
│ │ │ │ ├── Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes.pdf
│ │ │ │ ├── Specularity Factorization for Low-Light Enhancement.pdf
│ │ │ │ ├── Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis.pdf
│ │ │ │ ├── HashPoint Accelerated Point Searching and Sampling for Neural Rendering.pdf
│ │ │ │ ├── 3D Human Pose Perception from Egocentric Stereo Videos.pdf
│ │ │ │ ├── Tri-Modal Motion Retrieval by Learning a Joint Embedding Space.pdf
│ │ │ │ ├── Learning Structure-from-Motion with Graph Attention Networks.pdf
│ │ │ │ ├── Real-Time Simulated Avatar from Head-Mounted Sensors.pdf
│ │ │ │ ├── Frequency-Adaptive Dilated Convolution for Semantic Segmentation.pdf
│ │ │ │ ├── Move as You Say, Interact as You Can Language-guided Human Motion Generation with Scene Affordance.pdf
│ │ │ │ ├── FinePOSE Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models.pdf
│ │ │ │ ├── 4D-DRESS A 4D Dataset of Real-world Human Clothing with Semantic Annotations.pdf
│ │ │ │ ├── PhysGaussian Physics-Integrated 3D Gaussians for Generative Dynamics.pdf
│ │ │ │ ├── GAvatar Animatable 3D Gaussian Avatars with Implicit Mesh Learning.pdf
│ │ │ │ ├── Fantastic Animals and Where to Find Them Segment Any Marine Animal with Dual SAM.pdf
│ │ │ │ ├── General Object Foundation Model for Images and Videos at Scale.pdf
│ │ │ │ ├── FMA-Net Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring.pdf
│ │ │ │ ├── Objects as volumes A stochastic geometry view of opaque solids.pdf
│ │ │ │ ├── Point Transformer V3 Simpler, Faster, Stronger.pdf
│ │ │ │ ├── CFPL-FAS Class Free Prompt Learning for Generalizable Face Anti-spoofing.pdf
│ │ │ │ ├── Seeing the World through Your Eyes.pdf
│ │ │ │ ├── Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning.pdf
│ │ │ │ ├── Steerers A framework for rotation equivariant keypoint descriptors.pdf
│ │ │ │ ├── In-Context Matting.pdf
│ │ │ │ ├── In Search of a Data Transformation That Accelerates Neural Field Training.pdf
│ │ │ │ ├── Matching 2D Images in 3D Metric Relative Pose from Metric Correspondences.pdf
│ │ │ │ ├── Point2CAD Reverse Engineering CAD Models from 3D Point Clouds.pdf
│ │ │ │ ├── Putting the Object Back into Video Object Segmentation.pdf
│ │ │ │ ├── MMM Generative Masked Motion Model.pdf
│ │ │ │ ├── Tri-Perspective View Decomposition for Geometry-Aware Depth Completion.pdf
│ │ │ │ ├── CAT-Seg Cost Aggregation for Open-Vocabulary Semantic Segmentation.pdf
│ │ │ │ ├── Neural Redshift Random Networks are not Random Functions.pdf
│ │ │ │ ├── Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations.pdf
│ │ │ │ ├── No Time to Train Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation.pdf
│ │ │ │ ├── LeGO Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example.pdf
│ │ │ │ ├── Attention-Propagation Network for Egocentric Heatmap to 3D.pdf
│ │ │ │ ├── CAD-SIGNet CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention.pdf
│ │ │ ├── 2 CVPR'24 最佳论文提名(更新完毕)
│ │ │ │ ├── 2 开源代码
│ │ │ │ │ ├── Marigold-main.zip
│ │ │ │ │ ├── egtr-main.zip
│ │ │ │ │ ├── pixelsplat-main.zip
│ │ │ │ │ ├── mip-splatting-main.zip
│ │ │ │ │ ├── lambda_vit-main mlp.zip
│ │ │ │ │ ├── Registration-CorrMLP-master.zip
│ │ │ │ │ ├── PlatoNeRF-main.zip
│ │ │ │ │ ├── NVlabs-edm2-main.zip
│ │ │ │ │ ├── MemSAM-main.zip
│ │ │ │ │ ├── PaSCo-main.zip
│ │ │ │ │ ├── MMMU-main.zip
│ │ │ │ │ ├── bioclip-main.zip
│ │ │ │ │ ├── MapUncertaintyPrediction-main.zip
│ │ │ │ │ ├── NeRF-HuGS-master.zip
│ │ │ │ │ ├── spider-match-main.zip
│ │ │ │ ├── 1 提名论文
│ │ │ │ │ ├── 19 EGTR:Extracting Graph from Transformer for Scene Graph Generation.pdf
│ │ │ │ │ ├── 12 Grounding and Enhancing Grid-based Models for Neural Fields.pdf
│ │ │ │ │ ├── 2 Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation.pdf
│ │ │ │ │ ├── 4 MMMU A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.pdf
│ │ │ │ │ ├── 14 Mip-Splatting:Alias-free 3D Gaussian Splatting.pdf
│ │ │ │ │ ├── 11 BIOCLIP:A Vision Foundation Model for the Tree of Life.pdf
│ │ │ │ │ ├── 15 pixelSplat. 3D Gaussian Splats from lmage Pairs for Scalable Generalizable 3D Reconstruction.pdf
│ │ │ │ │ ├── 13 NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes.pdf
│ │ │ │ │ ├── 1 Objects as volumes: A stochastic geometry view of opaque solids.pdf
│ │ │ │ │ ├── 18 Analyzing and Improving the Training Dynamics of Diffusion Models.pdf
│ │ │ │ │ ├── 8 PlatoNeRF 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar.pdf
│ │ │ │ │ ├── 16 MLPCanBeAGoodTransformer Learner.pdf
│ │ │ │ │ ├── 5 Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration.pdf
│ │ │ │ │ ├── 9 Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation.pdf
│ │ │ │ │ ├── 6 Producing and Leveraging Online Map Uncertainty in Trajectory Prediction.pdf
│ │ │ │ │ ├── 3 Comparing the Decision-Making Mechanisms by Transformers and CNNs.pdf
│ │ │ │ │ ├── 10 Rich Human Feedback for Text-to-Image Generation.pdf
│ │ │ │ │ ├── 17 Generative Image Dynamics.pdf
│ │ │ │ │ ├── 7 PaSCo:Urban 3D Panoptic Scene Completion with Uncertainty Awareness.pdf
│ │ ├── 50篇大型语言模型提示工程必读
│ │ │ ├── Prompting in Autoregressive Large Language.pdf
│ │ │ ├── Exploring Visual Prompts for Adapting Large-Scale Models.pdf
│ │ │ ├── Large Language Models Understand and Can Be Enhanced by Emotional Stimuli.pdf
│ │ │ ├── LPML LLM-PROMPTING MARKUP LANGUAGE FOR.pdf
│ │ │ ├── Chain-of-Thought Prompting Elicits Reasoning in Large Language Models.pdf
│ │ │ ├── Joint Prompt Optimization of Stacked LLMs.pdf
│ │ │ ├── Contrastive Chain-of-Thought Prompting.pdf
│ │ │ ├── TAKE A STEP BACK- EVOKING REASONING VIA ABSTRACTION IN LARGE LANGUAGE MODELS.pdf
│ │ │ ├── Reprompting Automated Chain-of-Thought Prompt.pdf
│ │ │ ├── Program of Thoughts Prompting- Disentangling Computation from Reasoning for Numerical Reasoning Tasks.pdf
│ │ │ ├── LARGE LANGUAGE MODELS AS TOOL MAKERS.pdf
│ │ │ ├── A Systematic Survey of Prompt Engineering in Large Language Models- Techniques and Applications.pdf
│ │ │ ├── Rephrase and Respond- Let Large Language Models Ask Better Questions for Themselves.pdf
│ │ │ ├── CHAIN-OF-NOTE- ENHANCING ROBUSTNESS IN RETRIEVAL-AUGMENTED LANGUAGE MODELS.pdf
│ │ │ ├── PROMPTBREEDER.pdf
│ │ │ ├── Prompt Engineering Through the Lens of Optimal.pdf
│ │ │ ├── Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
│ │ │ ├── SELF-CONSISTENCY IMPROVES CHAIN OF THOUGHT REASONING IN LANGUAGE MODELS.pdf
│ │ │ ├── Prompting Is Programming A Query Language for.pdf
│ │ │ ├── Chain of Code- Reasoning with a Language Model-Augmented Code Emulator.pdf
│ │ │ ├── ART- Automatic multi-step reasoning and tool-use for large language models.pdf
│ │ │ ├── Visual ChatGPT- Talking, Drawing and Editing with Visual Foundation Models.pdf
│ │ │ ├── Structured Chain-of-Thought Prompting for Code Generation.pdf
│ │ │ ├── Unleashing the potential of prompt engineering in Large Language Models- a comprehensive review.pdf
│ │ │ ├── Active Prompting with Chain-of-Thought for Large Language Models.pdf
│ │ │ ├── CHAIN-OF-SYMBOL PROMPTING FOR SPATIAL RELATIONSHIPS IN LARGE LANGUAGE MODELS.pdf
│ │ │ ├── Language Models are Few-Shot Learners.pdf
│ │ │ ├── Thread of Thought Unraveling Chaotic Contexts.pdf
│ │ │ ├── Pre-train, Prompt, and Predict- A Systematic Survey of Prompting Methods in Natural Language Processing.pdf
│ │ │ ├── Chain of Code Reasoning with.pdf
│ │ │ ├── REAC T- SYNERGIZING REASONING AND ACTING IN LANGUAGE MODELS.pdf
│ │ │ ├── CHAIN-OF-VERIFICATION REDUCES HALLUCINATION IN LARGE LANGUAGE MODELS.pdf
│ │ │ ├── Large Language Model Guided Tree-of-Thought.pdf
│ │ │ ├── CHAIN-OF-KNOWLEDGE- GROUNDING LARGE LANGUAGE MODELS VIA DYNAMIC KNOWLEDGE ADAPTING OVER HETEROGENEOUS SOURCES.pdf
│ │ │ ├── System 2 Attention (is something you might need too).pdf
│ │ │ ├── UPAR A KANTIAN-INSPIRED PROMPTING FRAME.pdf
│ │ │ ├── A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.pdf
│ │ │ ├── CHAIN-OF-TABLE- EVOLVING TABLES IN THE REASONING CHAIN FOR TABLE UNDERSTANDING.pdf
│ │ │ ├── OlaGPT Empowering LLMs With Human-like Problem-Solving.pdf
│ │ │ ├── A Systematic Survey of Prompt Engineering in Large Language Models Techniques and Applications.pdf
│ │ │ ├── Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models.pdf
│ │ │ ├── Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic.pdf
│ │ │ ├── Boosting Logical Reasoning in Large Language Models through a New.pdf
│ │ │ ├── SHOW YOUR WORK- SCRATCHPADS FOR INTERMEDIATE COMPUTATION WITH LANGUAGE MODELS.pdf
│ │ │ ├── IMPLICIT CHAIN OF THOUGHT REASONING.pdf
│ │ │ ├── Tree of Thoughts- Deliberate Problem Solving with Large Language Models.pdf
│ │ │ ├── A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models.pdf
│ │ │ ├── LARGE LANGUAGE MODELS ARE HUMAN-LEVEL PROMPT ENGINEERS.pdf
│ │ │ ├── AUTOMATIC CHAIN OF THOUGHT PROMPTING IN LARGE LANGUAGE MODELS.pdf
│ │ │ ├── LARGE LANGUAGE MODELS AS OPTIMIZERS.pdf
│ │ ├── ICLR 2024(更新中)
│ │ │ ├── The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation.pdf
│ │ │ ├── Memory Efficient Optimizers with 4-bit States.pdf
│ │ │ ├── Language Is Not All You Need:Aligning Perception with Language Models.pdf
│ │ │ ├── Is Your Code Generated by ChatGPT Really Correct Rigorous Evaluation of Large Language Models for Code Generation.pdf
│ │ │ ├── Fine-Tuning Language Models with Just Forward Passes.pdf
│ │ │ ├── Hierarchical Integration Diffusion Model for Realistic Image Deblurring.pdf
│ │ │ ├── Textually Pretrained Speech Language Models.pdf
│ │ │ ├── VisionLLM:Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks.pdf
│ │ │ ├── Cappy:Outperforming and Boosting Large Multi-Task LMs with a Small Scorer.pdf
│ │ │ ├── One-2-3-45:Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization.pdf
│ │ │ ├── Direct Preference Optimization:Your Language Model is Secretly a Reward Model.pdf
│ │ │ ├── SimMTM:A Simple Pre-Training Framework for Masked Time-Series Modeling.pdf
│ │ │ ├── ProPILE:Probing Privacy Leakage in Large Language Models.pdf
│ │ │ ├── SnapFusion:Text-to-Image Diffusion Model on Mobile Devices within Two Seconds.pdf
│ │ │ ├── Efficient Diffusion Policies for Offline Reinforcement Learning.pdf
│ │ │ ├── Focused Transformer:Contrastive Training for Context Scaling.pdf
│ │ │ ├── LayoutPrompter:Awaken the Design Ability of Large Language Models.pdf
│ │ │ ├── Segment Everything Everywhere All at Once.pdf
│ │ │ ├── RAPHAEL:Text-to-Image Generation via Large Mixture of Diffusion Paths.pdf
│ │ │ ├── Towards Revealing the Mystery behind Chain of Thought:a Theoretical Perspective.pdf
│ │ │ ├── Elastic Decision Transformer.pdf
│ │ │ ├── Training Transformers with 4-bit Integers.pdf
│ │ │ ├── In-Context Impersonation Reveals Large Language Models' Strengths and Biases.pdf
│ │ │ ├── DaTaSeg:Taming a Universal Multi-Dataset Multi-Task Segmentation Model.pdf
│ │ │ ├── How to Turn Your Knowledge Graph Embeddings into Generative Models.pdf
│ │ │ ├── EvoPrompting:Language Models for Code-Level Neural Architecture Search.pdf
│ │ │ ├── Learning to Tokenize for Generative Retrieval.pdf
│ │ │ ├── VanillaNet:the Power of Minimalism in Deep Learning.pdf
│ │ │ ├── Unlimiformer:Long-Range Transformers with Unlimited Length Input.pdf
│ │ │ ├── RRHF:Rank Responses to Align Language Models with Human Feedback without tears.pdf
│ │ │ ├── Language Models Meet World Models:Embodied Experiences Enhance Language Models.pdf
│ │ │ ├── Does Graph Distillation See Like Vision Dataset Counterpart.pdf
│ │ │ ├── Stable and low-precision training for large-scale vision-language models.pdf
│ │ │ ├── Towards Label Position Bias in Graph Neural Networks.pdf
│ │ │ ├── Guiding Large Language Models via Directional Stimulus Prompting.pdf
│ │ │ ├── Bridging Discrete and Backpropagation:Straight-Through and Beyond.pdf
│ │ │ ├── Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization.pdf
│ │ │ ├── Foundation Model is Efficient Multimodal Multitask Model Selector.pdf
│ │ │ ├── Scaling Data-Constrained Language Models.pdf
│ │ │ ├── Differentiable Blocks World:Qualitative 3D Decomposition by Rendering Primitives.pdf
│ │ │ ├── MVDiffusion:Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion.pdf
│ │ │ ├── Chameleon:Plug-and-Play Compositional Reasoning with Large Language Models.pdf
│ │ │ ├── Vision-Flan:Scaling Human-Labeled Tasks in Visual Instruction Tuning.pdf
│ │ │ ├── MarioGPT:Open-Ended Text2Level Generation through Large Language Models.pdf
│ │ │ ├── Recommender Systems with Generative Retrieval.pdf
│ │ │ ├── AlpacaFarm:A Simulation Framework for Methods that Learn from Human Feedback.pdf
│ │ │ ├── Grammar Prompting for Domain-Specific Language Generation with Large Language Models.pdf
│ │ │ ├── QLoRA:Efficient Finetuning of Quantized LLMs.pdf
│ │ │ ├── Can Language Models Solve Graph Problems in Natural Language.pdf
│ │ │ ├── DPM-Solver-v3:Improved Diffusion ODE Solver with Empirical Model Statistics.pdf
│ │ │ ├── 3D-LLM:Injecting the 3D World into Large Language Models.pdf
│ │ │ ├── ToolkenGPT:Augmenting Frozen Language Models with Massive Tools via Tool Embeddings.pdf
│ │ │ ├── HuggingGPT:Solving AI Tasks with ChatGPT and its Friends in HuggingFace.pdf
│ │ │ ├── Sample-efficient Multi-objective Molecular Optimization with GFlowNets.pdf
│ │ │ ├── Tailoring Self-Attention for Graph via Rooted Subtrees.pdf
│ │ │ ├── SheetCopilot:Bringing Software Productivity to the Next Level through Large Language Models.pdf
│ │ │ ├── MotionGPT:Human Motion as a Foreign Language.pdf
│ │ │ ├── Fine-Grained Human Feedback Gives Better Rewards for Language Model Training.pdf
│ │ │ ├── Learning Large Graph Property Prediction via Graph Segment Training.pdf
│ │ │ ├── White-Box Transformers via Sparse Rate Reduction.pdf
│ │ │ ├── Meta In-Context Learning:Harnessing Large Language Models for Electrical Data Classification.pdf
│ │ │ ├── Deductive Verification of Chain-of-Thought Reasoning.pdf
│ │ │ ├── Fairness-guided Few-shot Prompting for Large Language Models.pdf
│ │ │ ├── No Train No Gain:Revisiting Efficient Training Algorithms For Transformer-based Language Models.pdf
│ │ │ ├── ImageReward:Learning and Evaluating Human Preferences for Text-to-Image Generation.pdf
│ │ │ ├── Are aligned neural networks adversarially aligned.pdf
│ │ │ ├── Convolutions Die Hard:Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP.pdf
│ │ │ ├── Large Language Models of Code Fail at Completing Code with Potential Bugs.pdf
│ │ │ ├── A Decomposable Causal View of Compositional Zero-Shot Learning.pdf
│ │ │ ├── HyenaDNA:Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution.pdf
│ │ │ ├── Tree of Thoughts:Deliberate Problem Solving with Large Language Models.pdf
│ │ │ ├── LIMA:Less Is More for Alignment.pdf
│ │ │ ├── Improving CLIP Training with Language Rewrites.pdf
│ │ │ ├── Language models are weak learners.pdf
│ │ │ ├── Reverse Engineering Self-Supervised Learning.pdf
│ │ │ ├── ProlificDreamer:High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation.pdf
│ │ │ ├── Large Language Models as Commonsense Knowledge for Large-Scale Task Planning.pdf
│ │ │ ├── AR-Diffusion:Auto-Regressive Diffusion Model for Text Generation.pdf
│ │ │ ├── Reflexion:language agents with verbal reinforcement learning.pdf
│ │ │ ├── Symbolic Discovery of Optimization Algorithms.pdf
│ │ │ ├── Language Models Don't Always Say What They Think:Unfaithful Explanations in Chain-of-Thought Prompting.pdf
│ │ │ ├── InstructBLIP:Towards General-purpose Vision-Language Models with Instruction Tuning.pdf
│ │ │ ├── Cheap and Quick:Efficient Vision-Language Instruction Tuning for Large Language Models.pdf
│ │ │ ├── Inference-Time Intervention:Eliciting Truthful Answers from a Language Model.pdf
│ │ │ ├── DoReMi:Optimizing Data Mixtures Speeds Up Language Model Pretraining.pdf
│ │ │ ├── Toolformer:Language Models Can Teach Themselves to Use Tools.pdf
│ │ │ ├── Transformers learn through gradual rank increase.pdf
│ │ │ ├── Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning.pdf
│ │ │ ├── GPT4Tools:Teaching Large Language Model to Use Tools via Self-instruction.pdf
│ │ │ ├── STEVE-1:A Generative Model for Text-to-Behavior in Minecraft.pdf
│ │ │ ├── Self-Refine:Iterative Refinement with Self-Feedback.pdf
│ │ │ ├── Are Emergent Abilities of Large Language Models a Mirage.pdf
│ │ │ ├── Augmenting Language Models with Long-Term Memory.pdf
│ │ │ ├── UniControl:A Unified Diffusion Model for Controllable Visual Generation In the Wild.pdf
│ │ │ ├── DiffComplete:Diffusion-based Generative 3D Shape Completion.pdf
│ │ │ ├── Any-to-Any Generation via Composable Diffusion.pdf
│ │ │ ├── SANeRF-HQ:Segment Anything for NeRF in High Quality.pdf
│ │ │ ├── Voicebox:Text-Guided Multilingual Universal Speech Generation at Scale.pdf
│ │ │ ├── MEGABYTE:Predicting Million-byte Sequences with Multiscale Transformers.pdf
│ │ │ ├── VisorGPT:Learning Visual Prior via Generative Pre-Training.pdf
│ │ │ ├── Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition.pdf
│ │ │ ├── Simple and Controllable Music Generation.pdf
│ │ │ ├── Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models.pdf
│ │ │ ├── Flocks of Stochastic Parrots:Differentially Private Prompt Learning for Large Language Models.pdf
│ │ │ ├── SwiftSage:A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks.pdf
│ │ │ ├── EmbodiedGPT:Vision-Language Pre-Training via Embodied Chain of Thought.pdf
│ │ ├── 20篇llm必读
│ │ │ ├── AWQ Activation-aware Weight Quantization.pdf
│ │ │ ├── The Internal State of an LLM Knows When It’s Lying.pdf
│ │ │ ├── OpenAGI When LLM Meets Domain Experts.pdf
│ │ │ ├── X-LLM.pdf
│ │ │ ├── Wider and Deeper LLM Networks.pdf
│ │ │ ├── Judging LLM-as-a-Judge.pdf
│ │ │ ├── Jailbroken How Does LLM Safety Training Fail.pdf
│ │ │ ├── Can LLM Already Serve as A Database Interface.pdf
│ │ │ ├── LLM-grounded Diffusion Enhancing Prompt Understanding of.pdf
│ │ │ ├── Why Johnny Can’t Prompt.pdf
│ │ │ ├── NExT-GPT Any-to-Any Multimodal LLM.pdf
│ │ │ ├── Large Language Models are Few-shot Testers.pdf
│ │ │ ├── AutoGen Enabling Next-Gen LLM.pdf
│ │ │ ├── Song_LLM-Planner_Few-Shot_Grounded_Planning_for_Embodied_Agents_with_Large_Language_ICCV_2023_paper.pdf
│ │ │ ├── CHATEVAL TOWARDS BETTER LLM-BASED EVALUATORS THROUGH MULTI-AGENT DEBATE.pdf
│ │ │ ├── Large language models (LLM) and ChatGPT what will the impact.pdf
│ │ │ ├── LLM-Pruner On the Structural Pruning.pdf
│ │ │ ├── The RefinedWeb Dataset for Falcon LLM.pdf
│ │ │ ├── LLM-BL E N D E R Ensembling Large Language Models.pdf
│ │ │ ├── LLM-Adapters An Adapter Family for Parameter-Efficient Fine-Tuning of.pdf
│ │ ├── ICLR 2024
│ │ │ ├── 【时间检验奖】Auto-Encoding Variational Bayes.pdf
│ │ ├── AAAI 2024 111篇
│ │ │ ├── Parallel Ranking of Ads and Creative Services for Real-time.pdf
│ │ │ ├── AT4CTR Auxiliary Match Tasks for Enhancing Click-Through Rate Prediction.pdf
│ │ │ ├── Upper Bounding Barlow Twins:A Novel Filter for Multi-relational.pdf
│ │ │ ├── Non-Excludable Bilateral Trade Between Groups.pdf
│ │ │ ├── Identification of Causal Structure in the Presence of Missing Data with Additive.pdf
│ │ │ ├── Few-shot Part Segmentation Reveals Compositional Logic for Industrial.pdf
│ │ │ ├── Learning Human-like Representations to Enable Learning Human Values.pdf
│ │ │ ├── OVD-Explorer:Optimism Should Not Be the Sole Pursuit of Exploration.pdf
│ │ │ ├── Federated Learning with Extremely Noisy Clients via Negative Distillation.pdf
│ │ │ ├── EarthVQA:Towards Queryable Earth via Relational Reasoning-Based Remote.pdf
│ │ │ ├── MDGNN:Multi-Relational Dynamic Graph Neural Network for Comprehensive and Dynamic Stock Investment Prediction.pdf
│ │ │ ├── Towards Fairness in Online Service with k Servers and its Application.pdf
│ │ │ ├── Unified framework for diffusion generative models in SO(3).pdf
│ │ │ ├── Text2Analysis:A Benchmark of Table Question Answering with Advanced.pdf
│ │ │ ├── Spectral-based Graph Neutral Networks for Complementary Item.pdf
│ │ │ ├── ECHO-GL Earnings Calls-Driven Heterogeneous Graph Learning for Stock.pdf
│ │ │ ├── Point Cloud Part Editing:Segmentation, Generation, Assembly, and.pdf
│ │ │ ├── IS-DARTS:Stabilizing DARTS through Precise Measurement.pdf
│ │ │ ├── Robust Active Measuring under Model Uncertainty.pdf
│ │ │ ├── MASTER:Market-Guided Stock Transformer for Stock Price Forecasting.pdf
│ │ │ ├── Provably Convergent Federated Trilevel Learning.pdf
│ │ │ ├── Exploring Gradient Explosion in Generative Adversarial Imitation.pdf
│ │ │ ├── AE-NeRF:Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis.pdf
│ │ │ ├── Learning Fair Policies for Multi-stage Problem Solving from.pdf
│ │ │ ├── AI-Based Energy Transportation Safety:Pipeline Radial Threat.pdf
│ │ │ ├── EFFECT SIZE ESTIMATION FOR DURATION RECOMMENDATION.pdf
│ │ │ ├── When Model Meets New Normals:Test-Time Adaptation for Unsupervised.pdf
│ │ │ ├── Fluctuation-based Adaptive Structured Pruning for Large Language.pdf
│ │ │ ├── ContraNovo:A Contrastive Learning Approach to Enhance De Novo Peptide.pdf
│ │ │ ├── CR-SAM: Curvature Regularized Sharpness-aware Minimization.pdf
│ │ │ ├── HuTuMotion:Human-Tuned Motion of Latent Motion Diffusions with.pdf
│ │ │ ├── Enhancing Job Recommendation through.pdf
│ │ │ ├── H-ensemble: An Information Theoretic Approach to Reliable Few-Shot.pdf
│ │ │ ├── Temporally and Distributionally Robust Optimization for Cold-start.pdf
│ │ │ ├── Structure-CLIP:Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations.pdf
│ │ │ ├── Probabilistic Offline Policy Ranking with Approximate Bayesian.pdf
│ │ │ ├── Foreseeing Reconstruction Quality of Gradient Inversion.pdf
│ │ │ ├── Successive POI Recommendation via Brain-inspired Spatiotemporal Aware Representation.pdf
│ │ │ ├── No More Shortcuts:Realizing the Potential of Temporal Self-Supervision.pdf
│ │ │ ├── PPEA-Depth:Progressive Parameter-efficient Adaptation for.pdf
│ │ │ ├── FedDiv:Collaborative Noise Filtering for Federated Learning with Noisy Labels.pdf
│ │ │ ├── Cached Transformers:Improving Transformers with Differentiable Memory.pdf
│ │ │ ├── Market-GAN Adding Control to Financial Market Data Generation with.pdf
│ │ │ ├── CORECODE: A Common Sense Annotated Dialogue Dataset with Benchmark.pdf
│ │ │ ├── Uncertainty Quantification for Data-Driven Change-Point Learning via.pdf
│ │ │ ├── Regulating Intermediate 3D Features for Vision-Centric Autonomous.pdf
│ │ │ ├── Imitation of Life:A Search Engine for Biologically Inspired Design.pdf
│ │ │ ├── Blind-Touch:Homomorphic Encryption-Based Distributed Neural Network.pdf
│ │ │ ├── Domain Invariant Learning for Gaussian Processes and Bayesian.pdf
│ │ │ ├── Effectiveness of Constant Stepsize in Markovian LSA and Statistical.pdf
│ │ │ ├── On Partial Optimal Transport:Revising the Infeasibility of Sinkhorn.pdf
│ │ │ ├── Peer Learning Learning Complex Policies in Groups from Scratch via Action.pdf
│ │ │ ├── Identification of Causal Structure with Latent Variables Based on Higher Order Cumulants.pdf
│ │ │ ├── MmAP:Multi-modal Alignment Prompt for Cross-domain Multi-task Learning.pdf
│ │ │ ├── DataElixir:Purifying Poisoned Dataset to Mitigate Backdoor Attacks.pdf
│ │ │ ├── Estimation of individual causal effects in network setup for multiple.pdf
│ │ │ ├── VITA:Carefully Chosen and Weighted Less Is Better in Medication.pdf
│ │ │ ├── SeGA:Preference-Aware Self-Contrasting Learning with Prompts for.pdf
│ │ │ ├── Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers.pdf
│ │ │ ├── Fine-Grained Knowledge Selection and Restoration for Non-exemplar.pdf
│ │ │ ├── Augmented Negative Sampling for Collaborative Filtering.pdf
│ │ │ ├── Chasing Fairness in Graphs: A GNN Architecture Perspective.pdf
│ │ │ ├── LGMRec Local and Global Graph Learning for Multimodal Recommendation.pdf
│ │ │ ├── Fine-tuning Graph Neural Networks by Preserving Graph Generative.pdf
│ │ │ ├── Hierarchical and Incremental Structural Entropy Minimization for Unsupervised Social Event Detection.pdf
│ │ │ ├── Coreference Graph Guidance for Mind-Map Generation.pdf
│ │ │ ├── Doubly Perturbed Task Free Continual Learning.pdf
│ │ │ ├── Explaining Reinforcement Learning Agents Through Counterfactual Action Outcomes.pdf
│ │ │ ├── Progressive Poisoned Data Isolation for Training-time Backdoor Attack.pdf
│ │ │ ├── COOPER: Coordinating Specialized Agents towards a Complex Dialogue Goal.pdf
│ │ │ ├── BadRL:Sparse Targeted Backdoor Attack Against Reinforcement Learning.pdf
│ │ │ ├── Learning Multimodal Volumetric Features for Large-Scale Neuron Tracing.pdf
│ │ │ ├── Curriculum Learning and Imitation Learning for Model-free Control on Financial Time-series.pdf
│ │ │ ├── An Attentive Inductive Bias for Sequential Recommendation.pdf
│ │ │ ├── Entropic Open-set Active Learning.pdf
│ │ │ ├── EarnHFT Efficient Hierarchical Reinforcement Learning for High Frequency Trading.pdf
│ │ │ ├── Distributional Off-Policy Evaluation for Slate Recommendations.pdf
│ │ │ ├── Robust Loss Functions for Training Decision Trees with Noisy Labels.pdf
│ │ │ ├── VITA ‘Carefully Chosen and Weighted Less’ Is Better.pdf
│ │ │ ├── Big Learning Expectation Maximization.pdf
│ │ │ ├── Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling.pdf
│ │ │ ├── Competition among Pairwise Lottery Contests.pdf
│ │ │ ├── Envy-free House Allocation under Uncertainty Preferences.pdf
│ │ │ ├── Learning Domain-Independent Heuristics for Grounded and Lifted Planning.pdf
│ │ │ ├── RadOcc:Learning Cross-Modality Occupancy Knowledge through Rendering.pdf
│ │ │ ├── Root Cause Explanation of Outliers under Noisy Mechanisms.pdf
│ │ │ ├── Exploring Large Language Model for Graph Data Understanding.pdf
│ │ │ ├── Q-SENN: Quantized Self-explaining Neural Networks.pdf
│ │ │ ├── Knowledge Graph Error Detection with Contrastive Confidence Adaption.pdf
│ │ │ ├── Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition.pdf
│ │ │ ├── STEM Unleashing the Power of Embeddings for Multi-task Recommendation.pdf
│ │ │ ├── Protect Your Score: Contact Tracing with Differential Privacy.pdf
│ │ │ ├── Inducing Point Operator Transformer:A Flexible and Scalable Architecture for Solving PDEs.pdf
│ │ │ ├── Weakly Supervised Open-Vocabulary Object Detection.pdf
│ │ │ ├── Cautiously-Optimistic Knowledge Sharing for Cooperative Multi-Agent Reinforcement Learning.pdf
│ │ │ ├── Ada-Ranker A Data Distribution Adaptive Ranking Paradigm.pdf
│ │ │ ├── Topic Shifts as a Proxy for Assessing Politicization in Social Media.pdf
│ │ │ ├── No prejudice! Fair Federated Graph Neural Networks for Personalized.pdf
│ │ │ ├── Fortify Your Defenses:Strategic Allocation to Enhance Defense Grid.pdf
│ │ │ ├── MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities.pdf
│ │ │ ├── CI-STHPAN Pre-trained Attention Network for Stock Selection with Channel-Independent Spatio-Temporal Hypergraph.pdf
│ │ │ ├── Towards Efficient Verification of Quantized Neural Networks.pdf
│ │ │ ├── On the Role of Server Momentum in Federated Learning.pdf
│ │ │ ├── Roll With the Punches:Expansion and Shrinkage of Soft Label Selection.pdf
│ │ │ ├── Bi-directional Adapter for Multi-modal Tracking.pdf
│ │ │ ├── FontDiffuser: One-Shot Font Generation via Denoising Diffusion with.pdf
│ │ │ ├── Signed Graph Neural Ordinary Differential Equation for Modeling.pdf
│ │ │ ├── Continuous Time Graph Representation with Sequential Survival Process.pdf
│ │ │ ├── FontDiffuser:One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning.pdf
│ │ │ ├── Brush Your Text:Synthesize Any Scene Text on Images via Diffusion Model.pdf
│ │ │ ├── LAMM:Label Alignment for Multi-Modal Prompt Learning.pdf
│ │ ├── 大模型MoE必读论文
│ │ │ ├── 【直播课原文】Pushing Mixture of Experts to the Limit Extremely Parameter Efficient MoE for Instruction Tuning.pdf
│ │ ├── LISA:大模型微调40篇
│ │ │ ├── 在大型视觉语言模型中评估物体幻觉.pdf
│ │ │ ├── MiniGPT-v2:大型语言模型作为视觉语言多任务学习的统一接口.pdf
│ │ │ ├── SPHINX:多模态大型语言模型的权重、任务和视觉嵌入的联合混合.pdf
│ │ │ ├── 利用显式推理链和可视化问题生成推进大型多模态模型.pdf
│ │ │ ├── 睁大眼睛?探索多模态LLMs的视觉缺陷.pdf
│ │ │ ├── LLaMA-VID:在大型语言模型中,一个图像值 2 个令牌.pdf
│ │ │ ├── LST:用于参数和内存高效迁移学习的梯形图侧调.pdf
│ │ │ ├── VL-PET:通过粒度控制进行视觉和语言参数高效调整.pdf
│ │ │ ├── mPLUG-Owl2:通过模态协作彻底改变多模态大型语言模型.pdf
│ │ │ ├── CaMML:适用于大型模型的情境感知多模态学习器.pdf
│ │ │ ├── Ziya-Visual:通过多任务指令调优的双语大型视觉语言模型.pdf
│ │ │ ├── Qwen-VL:用于理解、定位、文本阅读等的多功能视觉语言模型.pdf
│ │ │ ├── Lyrics-通过语义感知视觉对象促进细粒度语言-视觉对齐和理解.pdf
│ │ │ ├── MMBench:你的多模态模型是一个全能的玩家吗?.pdf
│ │ │ ├── OtterHD:高分辨率多模态模型.pdf
│ │ │ ├── 通过视觉指令调整改进基线.pdf
│ │ │ ├── 可视化指令调优.pdf
│ │ │ ├── 对比视觉-语言对齐使教学成为学习者的高效.pdf
│ │ │ ├── MiniGPT-4:使用高级大型语言模型增强视觉语言理解.pdf
│ │ │ ├── SVIT:扩展可视化指令调优.pdf
│ │ │ ├── InfMLLM:可视化语言任务的统一框架.pdf
│ │ │ ├── ReForm-Eval:通过统一重新制定面向任务的基准来评估大型视觉语言模型.pdf
│ │ │ ├── InstructBLIP:通过指令调整实现通用视觉语言模型.pdf
│ │ │ ├── Compacter:高效的低秩超复杂适配器层.pdf
│ │ │ ├── Shikra:释放多模态LLM的参照对话魔力.pdf
│ │ │ ├── Genixer:将多模态大型语言模型赋能为强大的数据生成器提供支持.pdf
│ │ │ ├── 眼见为实:提示 GPT-4V 进行更好的视觉指令调整.pdf
│ │ │ ├── SEED-Bench:对多模态LLMs进行生成式理解的基准测试.pdf
│ │ │ ├── UniPT:具有高效参数和存储器的迁移学习通用并行调优.pdf
│ │ │ ├── LISA: Layerwise Importance Sampling for Memory-efficient Large Language Model Fine-Tuning.pdf
│ │ │ ├── GlitchBench:大型多模态模型可以检测视频游戏故障吗?.pdf
│ │ │ ├── Video-LLaVA:通过投影前的对齐来学习统一的视觉表示.pdf
│ │ │ ├── 视觉语言预训练模型的近似提示调整.pdf
│ │ │ ├── VL-ADAPTER:用于视觉和语言任务的参数高效迁移学习.pdf
│ │ │ ├── ShareGPT4V:使用更好的字幕改进大型多模态模型.pdf
│ │ │ ├── 关于多模态语言模型的性能.pdf
│ │ │ ├── Visual Instruction Tuning with Polite Flamingo.pdf
│ │ │ ├── MM-Vet:评估大型多模态模型的集成能力.pdf
│ │ │ ├── HyperPELT:针对语言和视觉与语言任务的统一参数高效语言模型调优.pdf
│ │ │ ├── DoRA- Weight-Decomposed Low-Rank Adaptation.pdf
│ │ ├── ECCV24 收录论文83篇(更新中)
│ │ │ ├── 推荐工作
│ │ │ │ ├── FontStudio Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation.pdf
│ │ │ │ ├── LEGO Learning EGOcentric Action FrameGeneration via Visual Instruction Tuning.pdf
│ │ │ │ ├── FSGS Real Time Few shot View Synthesis using Gaussian Splatting.pdf
│ │ │ │ ├── Glyph-ByT5 A Customized Text Encoder for Accurate Visual Text Rendering.pdf
│ │ │ │ ├── ZipLoRA Any Subject in Any Style by Effectively Merging LoRAs..pdf
│ │ │ │ ├── DreamScene360 Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting.pdf
│ │ │ │ ├── SwapAnything Enabling Arbitrary Object Swapping in Personalized Visual Editing.pdf
│ │ │ │ ├── DiffiT Diffusion Vision Transformers for Image Generation.pdf
│ │ │ ├── Contrastive Region Guidance:Improving Grounding in Vision-Language Models without Training.pdf
│ │ │ ├── MIPI 2024 Challenge on Demosaic for Hybridevs Camera: Methods and Results.pdf
│ │ │ ├── BLINK:Multimodal Large Language Models Can See but Not Perceive.pdf
│ │ │ ├── CityGaussian:Real-time High-quality Large-Scale Scene Rendering with Gaussians.pdf
│ │ │ ├── Align, Minimize and Diversify A Source-Free Unsupervised Domain Adaptation Method for Handwritten Text Recognition.pdf
│ │ │ ├── DATENeRF:Depth-Aware Text-based Editing of NeRFs.pdf
│ │ │ ├── Dyadic Interaction Modeling for Social Behavior Generation.pdf
│ │ │ ├── DragAnything:Motion Control for Anything.pdf
│ │ │ ├── GiT:Towards Generalist Vision Transformer through Universal Language Interface.pdf
│ │ │ ├── SuperGaussian:Repurposing Video Models for 3D Super Resolution.pdf
│ │ │ ├── EvAC3D From Event-based Apparent Contours to 3D Models via Continuous Visual Hulls.pdf
│ │ │ ├── GScream:Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal.pdf
│ │ │ ├── N2F2:Hierarchical Scene Understanding with Nested Neural Feature Fields.pdf
│ │ │ ├── Object-Centric Diffusion for Efficient Video Editing.pdf
│ │ │ ├── SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas.pdf
│ │ │ ├── Listen to Look into the Future:Audio-Visual Egocentric Gaze Anticipation.pdf
│ │ │ ├── MixDQ:Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization.pdf
│ │ │ ├── DreamMotion:Space-Time Self-Similarity Score Distillation for Zero-Shot Video Editing.pdf
│ │ │ ├── PEAVS:Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion Scores.pdf
│ │ │ ├── FreeInit:Bridging Initialization Gap in Video Diffusion Models.pdf
│ │ │ ├── SpecFormer Guarding Vision Transformer Robustness via Maximum Singular Value Penalization.pdf
│ │ │ ├── Empowering 3D Visual Grounding with Reasoning Capabilities.pdf
│ │ │ ├── Introducing HOT3D:An Egocentric Dataset for 3D Hand and Object Tracking.pdf
│ │ │ ├── Rasterized Edge Gradients:Handling Discontinuities Differentiably.pdf
│ │ │ ├── A Task is Worth One Word:Learning with Task Prompts for High-Quality Versatile Image Inpainting.pdf
│ │ │ ├── An Image is Worth 1`2 Tokens After Layer 2:Plug and Play Inference Acceleration for Large Vision Language Models.pdf
│ │ │ ├── Neural Graphics Texture Compression Supporting Random Access.pdf
│ │ │ ├── LA3 Efficient Label-Aware AutoAugment.pdf
│ │ │ ├── Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision.pdf
│ │ │ ├── Learning Neural Volumetric Pose Features for Camera Localization.pdf
│ │ │ ├── UniDream:UnifyingDiffusionPriorsforRelightableText-to-3DGeneration.pdf
│ │ │ ├── Prompt Federated Learning for Weather Forecasting:Toward Foundation Models on Meteorological Data.pdf
│ │ │ ├── Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance.pdf
│ │ │ ├── DGInStyle:Domain Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control.pdf
│ │ │ ├── Robo-ABC:Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation.pdf
│ │ │ ├── Agent3D-Zero: An automatic agent leverages VLM for zero-shot 3D understanding.pdf
│ │ │ ├── Compact3D:Smaller and Faster Gaussian Splatting with Vector Quantization.pdf
│ │ │ ├── Pix2Gif:Motion-Guided Diffusion for GIF Generation.pdf
│ │ │ ├── TriNeRFLet:A Wavelet Based Multiscale Triplane NeRF Representation.pdf
│ │ │ ├── ClusteringSDF:Self-Organized Neural Implicit Surfaces for 3D Decomposition.pdf
│ │ │ ├── Map-free Visual Relocalization:Metric Pose Relative to a Single Image.pdf
│ │ │ ├── T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy.pdf
│ │ │ ├── MVSplat:Efficient 3D Gaussian Splatting from Sparse Multi-View Images.pdf
│ │ │ ├── Training Full Spike Neural Networks via Auxiliary Accumulation Pathway.pdf
│ │ │ ├── ScanTalk:3D Talking Heads from Unregistered Scans.pdf
│ │ │ ├── VITATECS:A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models.pdf
│ │ │ ├── DenseNets Reloaded:Paradigm Shift Beyond ResNets and ViTs.pdf
│ │ │ ├── HYPE:Hyperbolic Entailment Filtering for Underspecified Images and Texts.pdf
│ │ │ ├── Open-Vocabulary SAM:Segment and Recognize Twenty-thousand Classes Interactively.pdf
│ │ │ ├── Controllable Human-Object Interaction Synthesis.pdf
│ │ │ ├── DragAPart:Learning a Part-Level Motion Prior for Articulated Objects.pdf
│ │ │ ├── DragVideo:Interactive Drag-style Video Editing.pdf
│ │ │ ├── GalLoP:Learning Global and Local Prompts.pdf
│ │ │ ├── GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection.pdf
│ │ │ ├── CoLLaVO:Crayon Large Language and Vision mOdel.pdf
│ │ │ ├── WordRobe:Text-Guided Generation of Textured 3D Garments.pdf
│ │ │ ├── AdaDistill:Adaptive Knowledge Distillation for Deep Face Recognition.pdf
│ │ │ ├── AnyLens:A Generative Diffusion Model with Any Rendering Lens.pdf
│ │ │ ├── PointLLM:Empowering Large Language Models to Understand Point Clouds.pdf
│ │ │ ├── E.T. the Exceptional Trajectories:Text-to-camera-trajectory generation with character awareness.pdf
│ │ │ ├── DreamReward:Text-to-3D Generation with Human Preference.pdf
│ │ │ ├── Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning.pdf
│ │ │ ├── Mismatch Quest:Visual and Textual Feedback for Image-Text Misalignment.pdf
│ │ │ ├── Pyramid Diffusion for Fine 3D Large Scene Generation.pdf
│ │ │ ├── MoAI:Mixture of All Intelligence for Large Language and Vision Models.pdf
│ │ │ ├── NIGHT - Non-Line-of-Sight Imaging from Indirect Time of Flight Data.pdf
│ │ │ ├── Modality Translation for Object Detection Adaptation Without Forgetting Prior Knowledge.pdf
│ │ │ ├── Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation.pdf
│ │ │ ├── PaPr Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster Inference.pdf
│ │ │ ├── ZeST:Zero-Shot Material Transfer from a Single Image.pdf
│ │ │ ├── GVGEN:A text-to-GS generation framework with volumetric representation.pdf
│ │ │ ├── MotionLCM:Real-time Controllable Motion Generation via Latent Consistency Model.pdf
│ │ │ ├── ManiGaussian:Dynamic Gaussian Splatting for Multi-task Robotic Manipulation.pdf
│ │ │ ├── MOTIONDIRECTOR:MOTION CUSTOMIZATION OF TEXT-TO-VIDEO DIFFUSION MODELS.pdf
│ │ ├── Code Llama论文(5月最新+内含24篇)
│ │ │ ├── 2 LLaMA 1、2 论文&源码
│ │ │ │ ├── 源码:llama-main.zip
│ │ │ │ ├── LLaMA: Open and Efficient Foundation Language Models.pdf
│ │ │ │ ├── Llama 2:Open Foundation and Fine-Tuned Chat Models.pdf
│ │ │ ├── 3 Code Llama 其他相关论文
│ │ │ │ ├── TinyLlama:An Open-Source Small Language Model.pdf
│ │ │ │ ├── S3LLM: Large-Scale Scientific Software Understanding.pdf
│ │ │ │ ├── IS SELF-REPAIR A SILVER BULLET FOR CODE GENERATION.pdf
│ │ │ │ ├── MFTCODER: BOOSTING CODE LLMS WITH MULTITASK.pdf
│ │ │ │ ├── README++:Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment.pdf
│ │ │ │ ├── Open-TransMind:A New Baseline and Benchmark for 1st Foundation Model.pdf
│ │ │ │ ├── Binary Code Summarization:Benchmarking ChatGPT、GPT-4 and Other Large Language Models.pdf
│ │ │ │ ├── LLAMA PRO:Progressive LLaMA with Block Expansion.pdf
│ │ │ │ ├── LLaMA-LoRA Neural Prompt Engineering.pdf
│ │ │ │ ├── Open-SQL Framework: Enhancing Text-to-SQL on Open-source Large.pdf
│ │ │ │ ├── A Comparative Analysis of Large Language Models for Code.pdf
│ │ │ │ ├── A Controlled Experiment on the Energy Efficiency of the Source Code Generated by Code Llama.pdf
│ │ │ │ ├── CRUXEval:A Benchmark for Code Reasoning.pdf
│ │ │ │ ├── LLaMA-Reviewer:Advancing Code Review Automation with Large Language Models through Parameter-Efficient Fine-Tuning.pdf
│ │ │ │ ├── LLaMA-Adapter:Efficient Fine-tuning of Language.pdf
│ │ │ │ ├── LLaMA-Adapter V2:Parameter-Efficient Visual Instruction Model.pdf
│ │ │ │ ├── Semantic Similarity Loss for Neural Source Code.pdf
│ │ │ │ ├── Granite Code Models:A Family of Open.pdf
│ │ │ │ ├── Evaluating In-Context Learning of Libraries for Code Generation.pdf
│ │ │ │ ├── Making Large Language Models A Better Foundation For Dense Retrieval.pdf
│ │ │ │ ├── DebugBench:Evaluating Debugging Capability of Large Language Models.pdf
│ │ │ ├── 1 Code Llama 论文&源码
│ │ │ │ ├── 源码:codellama-main.zip
│ │ │ │ ├── 论文:Code Llama:Open Foundation Models for Code.pdf
│ │ ├── ICML 2024 67篇
│ │ │ ├── ICML'23
│ │ │ │ ├── 看不见的概括,逻辑推理和学位课程.pdf
│ │ │ │ ├── 适应零和不完全信息博弈中的博弈树.pdf
│ │ │ │ ├── 大型语言模型的水印.pdf
│ │ │ │ ├── 像素递归神经网络.pdf
│ │ │ │ ├── 混淆梯度给人一种虚假的安全感:规避对抗性示例的防御.pdf
│ │ │ │ ├── D-Adaptation 的无学习率学习.pdf
│ │ │ │ ├── 异质性治疗效果的因果等渗校准.pdf
│ │ │ │ ├── 通过影响函数理解黑盒预测.pdf
│ │ │ │ ├── Beyond Hawkes:时空点过程的神经多事件预测.pdf
│ │ │ │ ├── 用于统一通用逼近的 Leaky-ReLU 神经网络的最小宽度.pdf
│ │ │ │ ├── 通过噪声到噪声映射从噪声 3D 点云中学习有符号距离函数.pdf
│ │ │ │ ├── 用于子集选择的可解释行列式选择模型.pdf
│ │ │ │ ├── 正交解耦高斯过程的球形诱导特征.pdf
│ │ │ ├── ICML'24 最佳论文+时间检验奖
│ │ │ │ ├── Scaling Rectified Flow Transformers for High-Resolution Image Synthesis.pdf
│ │ │ │ ├── Debating with More Persuasive LLMs Leads to More Truthful Answers.pdf
│ │ │ │ ├── Information Complexity of Stochastic Convex OptimizationP:Applications to Generalization, Memorization, and Tracing.pdf
│ │ │ │ ├── Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo.pdf
│ │ │ │ ├── Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution.pdf
│ │ │ │ ├── DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition.pdf
│ │ │ │ ├── VideoPoet:A Large Language Model for Zero-Shot Video Generation.pdf
│ │ │ │ ├── Stealing part of a production language model.pdf
│ │ │ │ ├── Genie:Generative Interactive Environments.pdf
│ │ │ │ ├── Considerations for Differentially Private Learning with Large-Scale Public Pretraining.pdf
│ │ │ │ ├── Position:Measure Dataset Diversity, Don't Just Claim It.pdf
│ │ │ ├── ICML'24 oral(更新中)
│ │ │ │ ├── Transformers Learn Nonlinear Features In Context:Nonconvex Mean-field Dynamics on the Attention Landscape.pdf
│ │ │ │ ├── HowPrivate are DP-SGD Implementations.pdf
│ │ │ │ ├── Monitoring AI-Modified Content at Scale:A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews.pdf
│ │ │ │ ├── Hybrid2 Neural ODE Causal Modeling and an Application to Glycemic Response.pdf
│ │ │ │ ├── GaLore:Memory-Efficient LLM Training by Gradient Low-Rank Projection.pdf
│ │ │ │ ├── PrE-Text:Training Language Models on Private Federated Data in the Age of LLMs.pdf
│ │ │ │ ├── FedMBridge:Bridgeable Multimodal Federated Learning.pdf
│ │ │ │ ├── Position:Open-Endedness is Essential for Artificial Superhuman Intelligence.pdf
│ │ │ │ ├── Less is More:on the Over-Globalizing Problem in Graph Transformers.pdf
│ │ │ │ ├── Evolution of Heuristics:Towards Efficient Automatic Algorithm Design Using Large Language Model.pdf
│ │ │ │ ├── Expressivity and Generalization:Fragment-Biases for Molecular GNNs.pdf
│ │ │ │ ├── Locality-Sensitive Hashing-Based Efficient Point Transformer with Applications in High-Energy Physics.pdf
│ │ │ │ ├── Stop Regressing:Training Value Functions via Classification for Scalable Deep RL.pdf
│ │ │ │ ├── Emergent Equivariance in Deep Ensembles.pdf
│ │ │ │ ├── Improving Transformers with Dynamically Composable Multi-Head Attention.pdf
│ │ │ │ ├── Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling.pdf
│ │ │ │ ├── SAPG:Split and Aggregate Policy Gradients.pdf
│ │ │ │ ├── Position:Automatic Environment Shaping is the Next Frontier in RL.pdf
│ │ │ │ ├── Multiplicative Weights Update, Area Convexity and Random Coordinate Descent for Densest Subgraph Problems.pdf
│ │ │ │ ├── Weak-to-Strong Generalization:Eliciting Strong Capabilities With Weak Supervision.pdf
│ │ │ │ ├── Discovering Environments with XRM.pdf
│ │ │ │ ├── Unified Training of Universal Time Series Forecasting Transformers.pdf
│ │ │ │ ├── A Dynamic Algorithm for Weighted Submodular Cover Problem.pdf
│ │ │ │ ├── Theoretical Analysis of Learned Database Operations under Distribution Shift through Distribution Learnability.pdf
│ │ │ │ ├── SceneCraft:An LLM Agent for Synthesizing 3D Scenes as Blender Code.pdf
│ │ │ │ ├── Doubly Robust Causal Effect Estimation under Networked Interference via Targeted Learning.pdf
│ │ │ │ ├── Robust CLIP:Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models.pdf
│ │ │ │ ├── Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks.pdf
│ │ │ │ ├── Position:Technical Research and Talent is Needed for Effective AI Governance.pdf
│ │ │ │ ├── Position:Opportunities Exist for Machine Learning in Magnetic Fusion Energy.pdf
│ │ │ │ ├── Online Matching with Stochastic Rewards:Provable Better Bound via Adversarial Reinforcement Learning.pdf
│ │ │ │ ├── How do Large Language Models Navigate Conflicts between Honesty and Helpfulness.pdf
│ │ │ │ ├── Is DPO Superior to PPO for LLM Alignment A Comprehensive Study.pdf
│ │ │ │ ├── Trained Random Forests Completely Reveal your Dataset.pdf
│ │ │ │ ├── Rethinking Data Shapley for Data Selection Tasks:Misleads and Merits.pdf
│ │ │ │ ├── Inferring the Long-Term Causal Effects of Long-Term Treatments from Short-Term Experiments.pdf
│ │ │ │ ├── Fast Co-Training under Weak Dependence via Stream-Based Active Learning.pdf
│ │ │ │ ├── Learning Useful Representations of Recurrent Neural Network Weight Matrices.pdf
│ │ │ │ ├── Bottleneck-Minimal Indexing for Generative Document Retrieval.pdf
│ │ │ │ ├── I.O Complexity of Attention or How Optimal is FlashAttention.pdf
│ │ │ │ ├── ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization.pdf
│ │ │ │ ├── Position:Beyond Personhood:Agency, Accountability, and the Limits of Anthropomorphic Ethical Analysis.pdf
│ │ │ │ ├── LoRA Training in the NTK Regime has No Spurious Local Minima.pdf
│ │ ├── 100篇大模型必读论文
│ │ │ ├── Solving Quantitative Reasoning Problems with Language Models.pdf
│ │ │ ├── A ConvNet for the 2020s..pdf
│ │ │ ├── KERPLE Kernelized Relative Positional Embedding for Length Extrapolation.pdf
│ │ │ ├── Emergent Abilities of Large Language Models.pdf
│ │ │ ├── Red Teaming Language Models with Language Models.pdf
│ │ │ ├── GET3D A Generative Model of High Quality 3D Textured Shapes Learned from Images.pdf
│ │ │ ├── GLM-130B An Open Bilingual Pre-trained Model.pdf
│ │ │ ├── Compositional character models for open vocabulary word representation.pdf
│ │ │ ├── Efficient Estimation of Word Representation in Vector Space.pdf
│ │ │ ├── Beyond the Imitation Game Quantifying and extrapolating the capabilities of language models.pdf
│ │ │ ├── A Survey on Knowledge Graphs Representation, Acquisition, and Applications.pdf
│ │ │ ├── Evaluating Large Language Models Trained on Code.pdf
│ │ │ ├── Multi-Grained Vision Language Pre-Training Aligning Texts with Visual Concepts.pdf
│ │ │ ├── When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations.pdf
│ │ │ ├── OFA Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework..pdf
│ │ │ ├── COLD A Benchmark for Chinese Offensive Language Detection.pdf
│ │ │ ├── Language models generalize beyond natural proteins.pdf
│ │ │ ├── High-Resolution Image Synthesis with Latent Diffusion Models.pdf
│ │ │ ├── Fine-Tuning Language Models from Human Preferences.pdf
│ │ │ ├── Imagen Video High Definition Video Generation with Diffusion Models.pdf
│ │ │ ├── No Language Left Behind Scaling Human-Centered Machine Translation.pdf
│ │ │ ├── Zero-Shot Video Question Answering via Frozen Bidirectional Language Models.pdf
│ │ │ ├── Towards Efficient Post-training Quantization of Pre-trained Language Models.pdf
│ │ │ ├── Retrieval Augmented Generation for.pdf
│ │ │ ├── Reducing Activation Recomputation in Large Transformer Models.pdf
│ │ │ ├── GPT Understands, Too.pdf
│ │ │ ├── Transformer-Xl Attentive Language Models Beyond A Fixed-Length Context.pdf
│ │ │ ├── InstructPix2Pix Learning to Follow Image Editing Instructions.pdf
│ │ │ ├── PPT Pre-trained Prompt Tuning for Few-shot Learning.pdf
│ │ │ ├── Generating Training Data with Language Models Towards Zero-Shot Language Understanding.pdf
│ │ │ ├── SmoothQuant Accurate and Efficient Post-Training Quantization for Large Language Models.pdf
│ │ │ ├── Tensor Programs V Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer.pdf
│ │ │ ├── Hierarchical Text-Conditional Image Generation with CLIP Latents.pdf
│ │ │ ├── Knowledgeable Prompt-tuning Incorporating Knowledge into Prompt Verbalizer for Text Classification.pdf
│ │ │ ├── BLOOM A 176B-Parameter Open-Access Multilingual Language Model.pdf
│ │ │ ├── SGM Sequence Generation Model for Multi-label Classification.pdf
│ │ │ ├── Pre-train, Prompt, and Predict A Systematic Survey of Prompting Methods in Natural Language Processing.pdf
│ │ │ ├── Improving Language Models by Retrieving from Trillions of Tokens.pdf
│ │ │ ├── Learning Transferable Visual Models From Natural Language Supervision.pdf
│ │ │ ├── BaGuaLu targeting brain scale pretrained models with over 37 million cores.pdf
│ │ │ ├── Zero-Shot Text-to-Image Generation.pdf
│ │ │ ├── CogView Mastering Text-to-Image Generation via Transformers.pdf
│ │ │ ├── Training Language Models with Memory Augmentation.pdf
│ │ │ ├── Denoising Diffusion Implicit Models.pdf
│ │ │ ├── WebGPT Browser-assisted question-answering with human feedback.pdf
│ │ │ ├── Fine-mixing Mitigating Backdoors in Fine-tuned Language Models.pdf
│ │ │ ├── GPT-NeoX-20B An Open-Source Autoregressive Language Model.pdf
│ │ │ ├── Character-level Convolutional Networks for Text Classification.pdf
│ │ │ ├── Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners.pdf
│ │ │ ├── FastMoE A Fast Mixture-of-Expert Training System.pdf
│ │ │ ├── Autoformalization with Large Language Models.pdf
│ │ │ ├── Evolutionary-scale prediction of atomic level protein structure with a language model.pdf
│ │ │ ├── Score-Based Generative Modeling through Stochastic Differential Equations.pdf
│ │ │ ├── ERNIE 3.0 Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation.pdf
│ │ │ ├── Versatile Diffusion Text, Images and Variations All in One Diffusion Model.pdf
│ │ │ ├── Discrete mean estimates and the Landau-Siegel zero.pdf
│ │ │ ├── Training Compute-Optimal Large Language Models.pdf
│ │ │ ├── Video PreTraining (VPT) Learning to Act by Watching Unlabeled Online Videos.pdf
│ │ │ ├── UnifiedSKG Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models.pdf
│ │ │ ├── Foundation Transformers.pdf
│ │ │ ├── Chain-of-Thought Prompting Elicits Reasoning in Large Language Models.pdf
│ │ │ ├── PAL Program-aided Language Models.pdf
│ │ │ ├── GLM General Language Model Pretraining with Autoregressive Blank Infilling.pdf
│ │ │ ├── Training language models to follow instructions with human feedback.pdf
│ │ │ ├── Colossal-AI A Unified Deep Learning System For Large-Scale Parallel Training.pdf
│ │ │ ├── Galactica A Large Language Model for Science.pdf
│ │ │ ├── Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval.pdf
│ │ │ ├── PaLM Scaling Language Modeling with Pathways.pdf
│ │ │ ├── OPT Open Pre-trained Transformer Language Models.pdf
│ │ │ ├── Few-shot Learning with Multilingual Language Models.pdf
│ │ │ ├── UL2 Unifying Language Learning Paradigms.pdf
│ │ │ ├── Prompt-and-Rerank A Method for Zero-Shot and Few-Shot Arbitrary Textual Style Transfer with Small Language Models.pdf
│ │ │ ├── InternImage Exploring Large-Scale Vision Foundation Models with Deformable Convolutions.pdf
│ │ │ ├── Sequence to Sequence Learning with Neural Networks.pdf
│ │ │ ├── AltCLIP Altering the Language Encoder in CLIP for Extended Language Capabilities.pdf
│ │ │ ├── Convolutional Neural Network for Sentence Classification.pdf
│ │ │ ├── Character-Aware Neural Language Models.pdf
│ │ │ ├── Holistic Evaluation of Language Models.pdf
│ │ │ ├── CPM A large-scale generative Chinese Pre-trained language model.pdf
│ │ │ ├── Language Models are Few-Shot Learners.pdf
│ │ │ ├── DiffusionDet Diffusion Model for Object Detection.pdf
│ │ │ ├── Improving language understanding by generative pre training.pdf
│ │ │ ├── DeepSpeed Data Efficiency Improving Deep Learning Model Quality and Training Efficiency via Efficient Data Sampling and Routing.pdf
│ │ │ ├── PaLI A Jointly-Scaled Multilingual Language-Image Model.pdf
│ │ │ ├── Language Models are Unsupervised Multitask Learners.pdf
│ │ │ ├── Git Re-Basin Merging Models modulo Permutation Symmetries.pdf
│ │ │ ├── How Much Knowledge Can You Pack Into the Parameters of a Language Model.pdf
│ │ │ ├── BLIP Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation..pdf
│ │ │ ├── Muse Text-To-Image Generation via Masked Generative Transformers.pdf
│ │ │ ├── The Stability-Efficiency Dilemma Investigating Sequence Length Warmup for Training GPT Models.pdf
│ │ │ ├── Masked Autoencoders Are Scalable Vision Learners.pdf
│ │ │ ├── A Survey on In-context Learning.pdf
│ │ │ ├── An Image is Worth 16x16 Words Transformers for Image Recognition at Scale.pdf
│ │ │ ├── Learning to summarize from human feedback.pdf
│ │ │ ├── ERNIE 3.0 Titan Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation.pdf
│ │ │ ├── Language Models as Knowledge Bases.pdf
│ │ │ ├── CodeGen An Open Large Language Model for Code with Multi-Turn Program Synthesis.pdf
│ │ │ ├── LAION-5B An open large-scale dataset for training next generation image-text models.pdf
│ │ │ ├── Generating Sequences With Recurrent Neural Networks.pdf
│ │ │ ├── Language Models as Zero-Shot Planners Extracting Actionable Knowledge for Embodied Agents.pdf
│ │ │ ├── Vision-Language Pre-Training with Triple Contrastive Learning.pdf
│ │ │ ├── 01必读.jpg
│ │ ├── EMNLP 19篇
│ │ │ ├── 自然语言生成的主动学习.pdf
│ │ │ ├── 通过概念化来解释嵌入空间.pdf
│ │ │ ├── IMTLab:用于构建、评估和诊断交互式机器翻译系统的开源平台.pdf
│ │ │ ├── 驾驭灰色地带:不确定性和过度自信的表达如何影响语言模型.pdf
│ │ │ ├── RAPL:一种用于少样本文档级关系提取的关系感知原型学习方法.pdf
│ │ │ ├── 重新审视机器翻译的跨语言分类.pdf
│ │ │ ├── 视觉、机器人技术及其他领域的语言基础.pdf
│ │ │ ├── 通过对NLP领域学术写作的对比分析来解决语言偏见.pdf
│ │ │ ├── 了解模型压缩对大型语言模型中社会偏见的影响.pdf
│ │ │ ├── 凝聚力:生成文本连贯性的增量与整体评估的新基准.pdf
│ │ │ ├── 用语言模型进行推理就是用世界模型进行规划.pdf
│ │ │ ├── 使用大型语言模型进行可解释的心理健康分析.pdf
│ │ │ ├── TopWORDS-Poetry:基于贝叶斯推理的中国古典诗歌同步文本分割和单词发现.pdf
│ │ │ ├── 学习用于多模态失语症类型检测的共同语音手势.pdf
│ │ │ ├── 具有 Wasserstein 独立性的公平文本分类.pdf
│ │ │ ├── ROBBIE:大型生成语言模型的鲁棒偏差评估.pdf
│ │ │ ├── 大型语言模型可以自我改进.pdf
│ │ │ ├── SODA:具有社会常识语境化的百万级对话提炼.pdf
│ │ │ ├── 混合倒挂索引是用于密集检索的鲁棒加速器.pdf
│ │ ├── CVPR 2024 (持续更新)
│ │ │ ├── 1 CVPR'24 获奖论文
│ │ │ │ ├── 4 最佳学生论文次优奖
│ │ │ │ │ ├── Objects as volumes: A stochastic geometry view of opaque solids.pdf
│ │ │ │ │ ├── Comparing the Decision-Making Mechanisms by Transformers and CNNs.pdf
│ │ │ │ ├── 3 最佳论文次优奖
│ │ │ │ │ ├── pixelSplat. 3D Gaussian Splats from lmage Pairs for Scalable Generalizable 3D Reconstruction.pdf
│ │ │ │ ├── 2 最佳学生论文奖
│ │ │ │ │ ├── Mip-Splatting:Alias-free 3D Gaussian Splatting.pdf
│ │ │ │ │ ├── BIOCLIP:A Vision Foundation Model for the Tree of Life.pdf
│ │ │ │ ├── 1 最佳论文奖
│ │ │ │ │ ├── Generative Image Dynamics.pdf
│ │ │ │ │ ├── Rich Human Feedback for Text-to-Image Generation.pdf
│ │ │ ├── 3 CVPR'24 oral论文(更新完毕)
│ │ │ │ ├── 10 自主导航和自我中心视觉
│ │ │ │ │ ├── EgoGen:An Egocentric Synthetic Data Generator.pdf
│ │ │ │ │ ├── SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection.pdf
│ │ │ │ │ ├── UnO:Unsupervised Occupancy Fields for Perception and Forecasting.pdf
│ │ │ │ ├── 15 低样本学习、自监督学习和半监督学习
│ │ │ │ │ ├── Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps.pdf
│ │ │ │ │ ├── CroSel.pdf
│ │ │ │ │ ├── LTGC:Long-tail Recognition via Leveraging LLMs-driven Generated Content.pdf
│ │ │ │ ├── 13 数据集和评估
│ │ │ │ │ ├── 360+x:A Panoptic Multi-modal Scene Understanding Dataset.pdf
│ │ │ │ │ ├── Deep Generative Model based Rate-Distortion for Image Downscaling Assessment.pdf
│ │ │ │ │ ├── Ego-Exo4D:Understanding Skilled Human Activity from First- and Third-Person Perspectives.pdf
│ │ │ │ ├── 12 动作和运动分析
│ │ │ │ │ ├── An N-Point Linear Solver for Line and Motion Estimation with Event Cameras.pdf
│ │ │ │ │ ├── Modeling Multimodal Social Interactions:New Challenges and Baselines with Densely Aligned Representations.pdf
│ │ │ │ │ ├── FineParser:A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment.pdf
│ │ │ │ │ ├── RoHM:Robust Human Motion Reconstruction via Diffusio.pdf
│ │ │ │ ├── 5 深度学习架构与技术
│ │ │ │ │ ├── Learning Structure-from-Motion with Graph Attention Networks.pdf
│ │ │ │ │ ├── Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks.pdf
│ │ │ │ │ ├── In Search of a Data Transformation That Accelerates Neural Field Training.pdf
│ │ │ │ │ ├── Neural Lineage.pdf
│ │ │ │ │ ├── Neural Redshift:Random Networks are not Random Functions.pdf
│ │ │ │ ├── 7 单视角三维技术
│ │ │ │ │ ├── WALT3D:Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects Under Occlusion.pdf
│ │ │ │ │ ├── EscherNet:A Generative Model for Scalable View Synthesis.pdf
│ │ │ │ │ ├── Rethinking Inductive Biases for Surface Normal Estimation.pdf
│ │ │ │ ├── 17 图像与视频合成 2
│ │ │ │ │ ├── Visual Anagrams:Generating Multi-View Optical Illusions with Diffusion Models.pdf
│ │ │ │ │ ├── Alchemist:Parametric Control of Material Properties with Diffusion Models.pdf
│ │ │ │ │ ├── MonoHair:High-Fidelity Hair Modeling from a Monocular Video.pdf
│ │ │ │ ├── 1 低层次视觉
│ │ │ │ │ ├── Towards Robust Event-guided Low-Light Image Enhancement.pdf
│ │ │ │ │ ├── Bilateral Event Mining and Complementary for Event Stream Super-Resolution.pdf
│ │ │ │ │ ├── Specularity Factorization for Low-Light Enhancement.pdf
│ │ │ │ │ ├── FMA-Net:Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring.pdf
│ │ │ │ │ ├── FlowIE:Efficient Image Enhancement via Rectified Flow.pdf
│ │ │ │ ├── 4 图像与视频合成
│ │ │ │ │ ├── FreeU:Free Lunch in Diffusion U-Net.pdf
│ │ │ │ │ ├── Attention Calibration for Disentangled Text-to-Image Personalization.pdf
│ │ │ │ │ ├── Instruct-Imagen: Image Generation with Multi-modal Instruction.pdf
│ │ │ │ │ ├── Ranni:Taming Text-to-Image Diffusion for Accurate Instruction Following.pdf
│ │ │ │ │ ├── Style Aligned Image Generation via Shared Attention.pdf
│ │ │ │ ├── 18 多模态学习
│ │ │ │ │ ├── NoiseCLR:A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions in Diffusion Models.pdf
│ │ │ │ │ ├── InternVL:Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.pdf
│ │ │ │ │ ├── MetaCloak.pdf
│ │ │ │ │ ├── Describing Differences in Image Sets with Natural Language.pdf
│ │ │ │ ├── 6 多视角三维技术和传感器
│ │ │ │ │ ├── Point Transformer V3:Simpler Faster Stronger.pdf
│ │ │ │ │ ├── Steerers:A Framework for Rotation Equivariant Keypoint Descriptors.pdf
│ │ │ │ │ ├── Tri-Perspective View Decomposition for Geometry-Aware Depth Completion.pdf
│ │ │ │ │ ├── Seeing the World through Your Eyes.pdf
│ │ │ │ │ ├── Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences.pdf
│ │ │ │ ├── 14 多视角三维技术和传感器 2
│ │ │ │ │ ├── Learning to Produce Semi-dense Correspondences for Visual Localization.pdf
│ │ │ │ ├── 3 人类行为和特征
│ │ │ │ │ ├── Semantic Human Mesh Reconstruction with Textures.pdf
│ │ │ │ │ ├── Stratified Avatar Generation from Sparse Observations.pdf
│ │ │ │ │ ├── MultiPly:Reconstruction of Multiple People from Monocular Video in the Wild.pdf
│ │ │ │ │ ├── Relightable Gaussian Codec Avatars.pdf
│ │ │ │ │ ├── URHand:Universal Relightable Hands.pdf
│ │ │ │ ├── 16 低层次视觉与遥感
│ │ │ │ │ ├── DART:Implicit Doppler Tomography for Radar Novel View Synthesis.pdf
│ │ │ │ │ ├── LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network.pdf
│ │ │ │ ├── 8 视觉、语言与推理
│ │ │ │ │ ├── Eyes Wide Shut Exploring the Visual Shortcomings of Multimodal LLMs.pdf
│ │ │ │ │ ├── Visual Program Distillation:Distilling Tools and Programmatic Reasoning into Vision-Language Models.pdf
│ │ │ │ │ ├── LISA:Reasoning Segmentation via Large Language Model.pdf
│ │ │ │ ├── 9 医学与物理视觉
│ │ │ │ │ ├── Transcriptomics-guided Slide Representation Learning in Computational Pathology.pdf
│ │ │ │ ├── 11三维视觉
│ │ │ │ │ ├── A Subspace-Constrained Tyler’s Estimator and its Applications to Structure from Motion.pdf
│ │ │ │ ├── 2 视觉与图形
│ │ │ │ │ ├── Eclipse:Disambiguating Illumination and Materials using Unintended Shadows.pdf
│ │ │ │ │ ├── Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.pdf
│ │ │ │ │ ├── DiffusionLight:Light Probes for Free by Painting a Chrome Ball.pdf
│ │ │ ├── 4 CVPR'24 highlight论文(更新中)
│ │ │ │ ├── Learning Structure-from-Motion with Graph Attention Networks.pdf
│ │ │ │ ├── CFPL-FAS Class Free Prompt Learning for Generalizable Face Anti-spoofing.pdf
│ │ │ │ ├── Efficient Deformable ConvNets Rethinking Dynamic and Sparse Operator for Vision Applications.pdf
│ │ │ │ ├── Human Motion Prediction Under Unexpected Perturbation.pdf
│ │ │ │ ├── XCube Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies.pdf
│ │ │ │ ├── Boosting Neural Representations for Videos with a Conditional Decoder.pdf
│ │ │ │ ├── Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations.pdf
│ │ │ │ ├── Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation.pdf
│ │ │ │ ├── ODIN A Single Model for 2D and 3D Segmentation.pdf
│ │ │ │ ├── LucidDreamer Towards High-Fidelity Text-to-3D Generation via Interval Score Matching.pdf
│ │ │ │ ├── Ranni Taming Text-to-Image Diffusion for Accurate Instruction Following.pdf
│ │ │ │ ├── Point2CAD Reverse Engineering CAD Models from 3D Point Clouds.pdf
│ │ │ │ ├── ViT-CoMer Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.pdf
│ │ │ │ ├── Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.pdf
│ │ │ │ ├── Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning.pdf
│ │ │ │ ├── FinePOSE Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models.pdf
│ │ │ │ ├── HOLD Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Vide.pdf
│ │ │ │ ├── Enhancing Video Super-Resolution via Implicit Resampling-based Alignment.pdf
│ │ │ │ ├── Relightable and Animatable Neural Avatar from Sparse-View Video.pdf
│ │ │ │ ├── In Search of a Data Transformation That Accelerates Neural Field Training.pdf
│ │ │ │ ├── FMA-Net Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring.pdf
│ │ │ │ ├── LocLLM Exploiting Generalizable Human Keypoint Localization via Large Language Model.pdf
│ │ │ │ ├── DreamPropeller Supercharge Text-to-3D Generation with Parallel Sampling.pdf
│ │ │ │ ├── Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes.pdf
│ │ │ │ ├── Breathing Life Into Sketches Using Text-to-Video Priors.pdf
│ │ │ │ ├── In-Context Matting.pdf
│ │ │ │ ├── From Correspondences to Pose Non-minimal Certifiably Optimal Relative Pose without Disambiguation.pdf
│ │ │ │ ├── Neural Redshift Random Networks are not Random Functions.pdf
│ │ │ │ ├── 3D Human Pose Perception from Egocentric Stereo Videos.pdf
│ │ │ │ ├── pix2gestalt Amodal Segmentation by Synthesizing Wholes.pdf
│ │ │ │ ├── Frequency-Adaptive Dilated Convolution for Semantic Segmentation.pdf
│ │ │ │ ├── HandDiff 3D Hand Pose Estimation with Diffusion on Image-Point Cloud.pdf
│ │ │ │ ├── RAVE Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models.pdf
│ │ │ │ ├── 4D-DRESS A 4D Dataset of Real-world Human Clothing with Semantic Annotations.pdf
│ │ │ │ ├── Bilateral Event Mining and Complementary for Event Stream Super-Resolution.pdf
│ │ │ │ ├── Real-Time Simulated Avatar from Head-Mounted Sensors.pdf
│ │ │ │ ├── Tri-Modal Motion Retrieval by Learning a Joint Embedding Space.pdf
│ │ │ │ ├── DiffusionLight Light Probes for Free by Painting a Chrome Ball.pdf
│ │ │ │ ├── From Activation to Initialization Scaling Insights for Optimizing Neural Fields.pdf
│ │ │ │ ├── FreeU Free Lunch in Diffusion U-Net.pdf
│ │ │ │ ├── MMM Generative Masked Motion Model.pdf
│ │ │ │ ├── Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis.pdf
│ │ │ │ ├── Attention-Propagation Network for Egocentric Heatmap to 3D.pdf
│ │ │ │ ├── GraCo Granularity-Controllable Interactive Segmentation.pdf
│ │ │ │ ├── No Time to Train Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation.pdf
│ │ │ │ ├── HashPoint Accelerated Point Searching and Sampling for Neural Rendering.pdf
│ │ │ │ ├── CAD-SIGNet CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention.pdf
│ │ │ │ ├── Tri-Perspective View Decomposition for Geometry-Aware Depth Completion.pdf
│ │ │ │ ├── Move as You Say, Interact as You Can Language-guided Human Motion Generation with Scene Affordance.pdf
│ │ │ │ ├── Seeing the World through Your Eyes.pdf
│ │ │ │ ├── Enforcing Geometric and Physical Priors.pdf
│ │ │ │ ├── CAT-Seg Cost Aggregation for Open-Vocabulary Semantic Segmentation.pdf
│ │ │ │ ├── Suppress and Rebalance Towards Generalized Multi-Modal Face Anti-Spoofing.pdf
│ │ │ │ ├── Unbiased Estimator for Distorted Conics in Camera Calibration.pdf
│ │ │ │ ├── Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation.pdf
│ │ │ │ ├── 3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation.pdf
│ │ │ │ ├── Scaling Up Dynamic Human-Scene Interaction Modeling.pdf
│ │ │ │ ├── General Object Foundation Model for Images and Videos at Scale.pdf
│ │ │ │ ├── Putting the Object Back into Video Object Segmentation.pdf
│ │ │ │ ├── Time-, Memory- and Parameter-Efficient Visual Adaptation.pdf
│ │ │ │ ├── Towards Robust Event-guided Low-Light Image Enhancement A Large-Scale Real-World Event-Image Dataset and Novel Approach.pdf
│ │ │ │ ├── GAvatar Animatable 3D Gaussian Avatars with Implicit Mesh Learning.pdf
│ │ │ │ ├── EAGLE Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation.pdf
│ │ │ │ ├── Point Transformer V3 Simpler, Faster, Stronger.pdf
│ │ │ │ ├── CADTalk An Algorithm and Benchmark for Semantic Commenting of CAD Programs.pdf
│ │ │ │ ├── Steerers A framework for rotation equivariant keypoint descriptors.pdf
│ │ │ │ ├── PhysGaussian Physics-Integrated 3D Gaussians for Generative Dynamics.pdf
│ │ │ │ ├── Specularity Factorization for Low-Light Enhancement.pdf
│ │ │ │ ├── Objects as volumes A stochastic geometry view of opaque solids.pdf
│ │ │ │ ├── LeGO Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example.pdf
│ │ │ │ ├── Semantic-aware SAM for Point-Prompted Instance Segmentation.pdf
│ │ │ │ ├── Restoration by Generation with Constrained Priors.pdf
│ │ │ │ ├── Multi-view Aggregation Network for Dichotomous Image Segmentation.pdf
│ │ │ │ ├── Fantastic Animals and Where to Find Them Segment Any Marine Animal with Dual SAM.pdf
│ │ │ │ ├── From Activation to Initialization Scaling Insights for Optimizing Neural Fields.pdf
│ │ │ │ ├── Self-Supervised Dual Contouring.pdf
│ │ │ │ ├── NRDF Neural Riemannian Distance Fields for Learning Articulated Pose Priors.pdf
│ │ │ │ ├── Matching 2D Images in 3D Metric Relative Pose from Metric Correspondences.pdf
│ │ │ │ ├── Eclipse Disambiguating Illumination and Materials using Unintended Shadows.pdf
│ │ │ ├── 2 CVPR'24 最佳论文提名(更新完毕)
│ │ │ │ ├── 2 开源代码
│ │ │ │ │ ├── spider-match-main.zip
│ │ │ │ │ ├── PlatoNeRF-main.zip
│ │ │ │ │ ├── Registration-CorrMLP-master.zip
│ │ │ │ │ ├── pixelsplat-main.zip
│ │ │ │ │ ├── PaSCo-main.zip
│ │ │ │ │ ├── NVlabs-edm2-main.zip
│ │ │ │ │ ├── NeRF-HuGS-master.zip
│ │ │ │ │ ├── MMMU-main.zip
│ │ │ │ │ ├── Marigold-main.zip
│ │ │ │ │ ├── MemSAM-main.zip
│ │ │ │ │ ├── mip-splatting-main.zip
│ │ │ │ │ ├── lambda_vit-main mlp.zip
│ │ │ │ │ ├── MapUncertaintyPrediction-main.zip
│ │ │ │ │ ├── egtr-main.zip
│ │ │ │ │ ├── bioclip-main.zip
│ │ │ │ ├── 1 提名论文
│ │ │ │ │ ├── 9 Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation.pdf
│ │ │ │ │ ├── 8 PlatoNeRF 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar.pdf
│ │ │ │ │ ├── 6 Producing and Leveraging Online Map Uncertainty in Trajectory Prediction.pdf
│ │ │ │ │ ├── 7 PaSCo:Urban 3D Panoptic Scene Completion with Uncertainty Awareness.pdf
│ │ │ │ │ ├── 5 Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration.pdf
│ │ │ │ │ ├── 4 MMMU A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.pdf
│ │ │ │ │ ├── 3 Comparing the Decision-Making Mechanisms by Transformers and CNNs.pdf
│ │ │ │ │ ├── 2 Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation.pdf
│ │ │ │ │ ├── 19 EGTR:Extracting Graph from Transformer for Scene Graph Generation.pdf
│ │ │ │ │ ├── 18 Analyzing and Improving the Training Dynamics of Diffusion Models.pdf
│ │ │ │ │ ├── 17 Generative Image Dynamics.pdf
│ │ │ │ │ ├── 16 MLPCanBeAGoodTransformer Learner.pdf
│ │ │ │ │ ├── 14 Mip-Splatting:Alias-free 3D Gaussian Splatting.pdf
│ │ │ │ │ ├── 15 pixelSplat. 3D Gaussian Splats from lmage Pairs for Scalable Generalizable 3D Reconstruction.pdf
│ │ │ │ │ ├── 13 NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes.pdf
│ │ │ │ │ ├── 12 Grounding and Enhancing Grid-based Models for Neural Fields.pdf
│ │ │ │ │ ├── 11 BIOCLIP:A Vision Foundation Model for the Tree of Life.pdf
│ │ │ │ │ ├── 10 Rich Human Feedback for Text-to-Image Generation.pdf
│ │ │ │ │ ├── 1 Objects as volumes: A stochastic geometry view of opaque solids.pdf
│ ├── 小黄搞AI大模型面试目录
│ │ ├── 小黄搞AI_大模型面试100问(PDF更新至90).pdf
│ │ ├── 小黄搞AI_大模型面试100问(PDF更新至74).pdf
│ │ ├── 小黄搞AI_大模型面试100问(PDF更新至107).pdf
│ ├── 大模型书籍
│ │ ├── Mastering Transformers_ Build state-of-the-art models from -- .pdf
│ │ ├── 预训练语言模型 2021 (邵浩 刘一烽) .pdf
│ │ ├── BERT基础教程:Transformer大模型实战 (苏达哈尔桑·拉维昌迪兰) .azw3
│ │ ├── 自然语言处理:基于预训练模型的方法_2021.pdf
│ │ ├── 精通Transformer:从零开始构建最先进的NLP模型_2023.epub
│ │ ├── Mastering NLP from Foundations to LLMs_ Apply advanced.pdf
│ │ ├── Building LLM Apps Create Intelligent Apps and Agents with Large Language Models_2024 .pdf
│ │ ├── 大规模语言模型:从理论到实践_2023.pdf
│ │ ├── 大语言模型_2024.pdf
│ │ ├── HuggingFace自然语言处理详解:基于BERT中文模型的任务实战.epub
│ │ ├── 大语言模型:基础与前沿_2024.epub
│ │ ├── 面向开发者的 LLM 入门课.pdf
│ │ ├── Transformer, BERT, and GPT:Including ChatGPT and Prompt Engineering_2024.pdf
│ │ ├── 大语言模型:基础与前沿_2024.pdf
│ │ ├── Transformers for Natural Language Processing Build, train, and fine-tune deep neural network architectures for NLP with... (--).pdf
│ │ ├── 扩散模型从原理到实战.epub
│ │ ├── Natural Language Processing with Transformers Building Language Applications with Hugging Face.pdf
│ │ ├── 中国人工智能系列白皮书——大模型技术(2023 版).pdf
│ │ ├── Transformers in Action (MEAP v7) _2024 .pdf
│ │ ├── Transformers生成式AI实用指南(提前发售 GPT双语) _2023 .epub
│ │ ├── 自然语言处理:原理、方法与应用.zip
│ │ ├── HuggingFace自然语言处理详解:基于BERT中文模型的任务实战.pdf
│ │ ├── Mastering Large Language Models Advanced techniques, applications, cutting-edge methods, and top LLMs_2024 .pdf
│ │ ├── 自然语言处理导论 2023 张奇.pdf
│ │ ├── Modern Generative AI with ChatGPT and OpenAI Models.pdf
│ │ ├── Generative AI with LangChain_ Build large language model.pdf
│ │ ├── BERT基础教程:Transformer大模型实战_2023.zip
│ │ ├── 精通Transformer:从零开始构建最先进的NLP模型_2023.pdf
│ │ ├── Getting Started with Google BERT_ Build and train .pdf
│ │ ├── 自然语言处理:原理、方法与应用 2023 (王志立 雷鹏斌 吴宇凡) .epub
│ │ ├── LLM Prompt Engineering For Developers The Art and Science of Unlocking LLMs True Potential_2024 .epub
│ │ ├── Mastering Large Language Models Advanced techniques, applications, cutting-edge methods, and top LLMs_2024 .epub
│ │ ├── Transformer自然语言处理实战:使用Hugging-Face-Transformers库构建NLP应用_2024.pdf
│ ├── 面试八股文
│ │ ├── 大模型校招面试题.pdf
│ │ ├── LLMs大模型面试问题和答案(97).pdf
│ │ ├── 大模型常见面试题及解答1.pdf
│ │ ├── 大模型 LLM 最全八股和答案.pdf
│ │ ├── AI大模型面试题(102).pdf
│ │ ├── 大模型岗位面试全纪录.pdf
│ │ ├── 大模型常考面试题总结(含答案).pdf
│ │ ├── 大模型常见面试题及解答2.pdf
│ │ ├── 大模型LLMS.pdf
│ │ ├── 从零开始大模型开发与微调基于PyTorch与ChatGLM.pdf
│ │ ├── 大模型常见面试题3.pdf
│ │ ├── 大模型落地应用案例集.pdf
│ ├── 大模型面试题
│ │ ├── 大模型(LLMs)参数高效微调(PEFT)面
│ │ │ ├── 适配器微调(Adapter-tuning)篇.pdf
│ │ │ ├── LoRA篇.pdf
│ │ │ ├── 参数高效微调篇PRFT.pdf
│ │ │ ├── 提示学习(Prompting)篇.pdf
│ │ ├── 大模型(LLMs)langchain面
│ │ │ ├── 基于LLM+向量库的文档对话经验面.pdf
│ │ │ ├── 大模型(LLMs)langchain面.pdf
│ │ ├── 31-LLM-Interview-Plus
│ │ │ ├── 大模型(LLMs)推理加速篇.pdf
│ │ │ ├── 大模型(LLMs)Tokenizer篇.pdf
│ │ │ ├── 多模态常见面试题.pdf
│ │ │ ├── 大模型校招面试题.pdf
│ │ │ ├── 大模型(LLMs)面试题答案Plus.pdf
│ │ │ ├── 大模型(LLMs)蒸馏面.pdf
│ │ │ ├── 大模型(LLMs)幻觉面.pdf
│ │ │ ├── 大模型(LLMs)分布式训练面.pdf
│ │ │ ├── 大模型(LLMs)显存问题面.pdf
│ │ │ ├── 大模型 RAG 检索增强生成面.pdf
│ │ │ ├── 大模型(LLMs)增量预训练篇.pdf
│ │ ├── 大模型(LLMs)强化学习—— PPO 面.pdf
│ │ ├── 大模型(LLMs)基础面.pdf
│ │ ├── 大模型(LLMs)强化学习——RLHF及其变种面.pdf
│ │ ├── 大模型(LLMs)训练集面.pdf
│ │ ├── 大模型(LLMs)进阶面.pdf
│ │ ├── 大模型(LLMs)评测面.pdf
│ │ ├── 大模型(LLMs)agent 面.pdf
│ │ ├── 大模型(LLMs)推理面.pdf
│ │ ├── 大模型(LLMs)幻觉面.pdf
│ │ ├── 大模型(LLMs)微调面.pdf
声明:本站所有文章,如无特殊说明或标注,均为本站原创发布。任何个人或组织,在未征得本站同意时,禁止复制、盗用、采集、发布本站内容到任何网站、书籍等各类媒体平台。如若本站内容侵犯了原著者的合法权益,可联系我们进行处理。