Press
esc
to close
请输入并搜索
奇变偶不变
奇变偶不变
首页
标签
分类
时间线
友链
关于
Press
Ctrl
+
and
K
to search
代码刷题
NLP
CS_杂项
论文阅读
MATH
首页
标签
分类
时间线
友链
关于
后台
分类
5 分类 × 71 文章 × 15 标签 × 313882 字
代码刷题
2篇
+
2024-02-27
[LeetCode] 204. 计算质数
2022-10-21
[LeetCode] 172. 阶乘后的零
NLP
7篇
+
2024-12-17
[arXiv-2024] Phi-4 Technical Report
2024-08-01
The Llama 3 Herd of Models
2024-03-25
[ArXiv-2023] One Shot Learning as Instruction Data Prospector for Large Language Models
2024-03-19
[ICLR-2024] GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
2024-03-04
LLM-Attack
2024-02-12
使用 spaCy / OpenIE 从文本中提取三元组
2022-10-24
Hugging Face使用向
CS_杂项
22篇
+
2025-03-23
linux服务器配置
2024-11-03
Jinja Template in tokenizer
2024-09-26
Writing Perfect Papers
2024-05-19
Llama Factory Script
2024-05-08
LLM Generate Scripts
2024-05-01
Loguru: Python Module
2024-05-01
Rich: Python Module
2024-04-24
time_script.py
2024-04-21
bypy使用百度网盘
2024-03-13
使用gdown下载谷歌云盘文件/文件夹
2024-02-03
使用python查看远程服务器GPU使用情况
2024-02-03
hf-mirror
2024-02-02
ChatGLM3-6B sft时报错`Error: No such option: -- deepspeed`
2024-01-30
yt-dlp 使用向说明
2024-01-23
tmux 命令
2024-01-16
在Neo4j Desktop上配置APOC并且保存当前知识图谱为.cypher文件
2023-01-24
用Latex写伪代码
2023-01-05
markdown图床(gitee)-已弃用
2023-01-05
JSD+github解决图床问题(附PicGo+Typora)
2022-12-22
配置git代理
2022-12-12
文献综述指南
2022-10-30
使用python进行邮件发送
论文阅读
39篇
+
2025-05-08
[arXiv-2025] ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning
2025-03-28
[arXiv-2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models_
2025-03-19
[arXiv-2025] Kimi k1.5: Scaling Reinforcement Learning with LLMs
2025-03-19
[arXiv-2025] Self-Training Elicits Concise Reasoning in Large Language Models
2025-03-18
[arXiv-2025] CoT-Valve: Length-Compressible Chain-of-Thought Tuning
2025-03-18
[arXiv-2025] O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
2025-03-01
[arXiv-2025] From System 1 to System 2: A Survey of Reasoning Large Language Models
2025-02-28
[arXiv-2025] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
2025-01-15
[arXiv-2024] Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models
2025-01-04
[arXiv-2024] TestBench: Evaluating Class-Level Test Case Generation Capability of Large Language Models
2024-12-22
[arXiv-2024] Evaluating and Aligning CodeLLMs on Human Preference
2024-12-18
[arXiv-2024] ExecRepoBench: Multi-level Executable Code Completion Evaluation
2024-12-16
[FSE-2024] No More Manual Tests? Evaluating and Improving ChatGPT for Unit Test Generation
2024-12-14
[FSE-2024] ChatUniTest: A Framework for LLM-Based Test Generation
2024-12-14
[EMNLP-2021] Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning
2024-12-13
[ASE-2024] On the Evaluation of Large Language Models in Unit Test Generation
2024-12-11
[EMNLP-2024] Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method
2024-09-13
[arXiv-2023] Instruction-Following Evaluation for Large Language Models
2024-09-11
[arXiv-2024] Many-Shot In-Context Learning
2024-09-04
[arXiv-2024] Scaling and evaluating sparse autoencoders
2024-07-25
[Arxiv-2024] OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
2024-07-05
[ACL-2024] Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning
2024-05-05
[Neurips-2023] CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion
2024-04-21
[ICLR-2024] What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
2024-04-10
[ArXiv-2023] Instruction Mining: When Data Mining Meets Large Language Model Finetuning
2024-04-09
[Neurips-2023] Reflexion: Language Agents with Verbal Reinforcement Learning
2024-04-08
[EMNLP-2023] Large Language Models Can Self-Improve
2024-04-07
[NAACL-2023] From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning
2024-04-07
[ICLR-2023] Copy Is All You Need
2024-04-03
[ArXiv-2024] Reliable, Adaptable, and Attributable Language Models with Retrieval
2024-04-01
[ArXiv-2024] Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
2024-04-01
[Neurips-2023] LIMA: Less Is More for Alignment
2024-03-19
[NAACL-2024] A Wolf in Sheep's Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily
2024-03-14
[ArXiv-2023] R-Tuning: Teaching Large Language Models to Refuse Unknown Questions
2024-03-06
[ArXiv-2024] Universal and Transferable Adversarial Attacks on Aligned Language Models
2024-03-05
[ArXiv-2024] DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers
2024-02-29
[AAAI-2021] Automated Storytelling via Causal, Commonsense Plot Ordering
2024-02-12
[CIKM-2020] Creative Storytelling with Language Models and Knowledge Graphs
2024-02-01
[CAIN-2024] Seven Failure Points When Engineering a Retrieval Augmented Generation System
MATH
1篇
+
2024-01-27
圆周长公式推导 (个人向
Geaming
NLP搬砖人
71
日志
5
分类
15
标签