诸神缄默不语-个人CSDN博文目录
最近更新日期:2023.6.7
最早更新日期:2023.6.7
文章目录
- 1. 通用大规模预训练语言模型
- 2. 对话模型
- 3. 分句
1. 通用大规模预训练语言模型
英语:
- LegalBERT
- 原始论文:(2020 EMNLP) LEGAL-BERT: The Muppets straight out of Law School - ACL Anthology
- 下载地址:huggingface
- CaseLaw-BERT
- 原始论文:(2021 ICAIL) When does pretraining help?: assessing self-supervised learning for law and the CaseHOLD dataset of 53,000+ legal holdings
- PolBERT
- 原始论文:(2022 NeurIPS) Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset
- legal-longformer
- 下载地址:https://huggingface.co/saibo/legal-longformer-base-4096
- LegalLAMA
- 原始论文:(2023 ACL) LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development
- (印度) InLegalBERT
- 原始论文:(2023 ICAIL) Pre-trained Language Models for the Legal Domain: A Case Study on Indian Law
- 下载地址:https://huggingface.co/law-ai/InLegalBERT
中文:
- Lawformer
意大利语:
- ITALIAN-LEGAL-BERT
- 原始论文:(2022) ITALIAN-LEGAL-BERT: A Pre-trained Transformer Language Model for Italian Law
- 下载地址:https://huggingface.co/dlicari/Italian-Legal-BERT
罗马尼亚语:
- jurBERT
- 原始论文:(2021 NLLP) jurBERT: A Romanian BERT Model for Legal Judgement Prediction
西班牙语:
- RoBERTalex
- 原始论文:(2021) Spanish Legalese Language Model and Corpora
- 下载地址:PlanTL-GOB-ES/RoBERTalex · Hugging Face
多语言:
- LegalXLMs
- 原始论文:(2023) MultiLegalPile: A 689GB Multilingual Legal Corpus
- 下载地址:太多了,待补
2. 对话模型
中文:
- Lawyer LLaMA
AndrewZhe/lawyer-llama: 中文法律LLaMA- 原始论文:(2023) Lawyer LLaMA Technical Report
- 下载地址:需要申请,我已经申了(https://wj.qq.com/s2/12427321/f54f/)
英文:
- LawGPT 1.0
啥也没给,有一种无图言屌的感觉。- 原始论文:A Brief Report on LawGPT 1.0: A Virtual Legal Assistant Based on GPT-3
3. 分句
多语言:
- https://huggingface.co/models?search=rcds/distilbert-sbd(英语、西班牙语、德语、意大利语、葡萄牙语、法语)
- 原始论文:(2023 ICAIL) MultiLegalSBD: A Multilingual Legal Sentence Boundary Detection Dataset