stereoplegic 's Collections Multilingual
updated
Dissecting In-Context Learning of Translations in GPTs
Paper
• 2310.15987
• Published
• 6
Monolingual or Multilingual Instruction Tuning: Which Makes a Better
Alpaca
Paper
• 2309.08958
• Published
• 2
X-LLM: Bootstrapping Advanced Large Language Models by Treating
Multi-Modalities as Foreign Languages
Paper
• 2305.04160
• Published
• 2
Ziya-VL: Bilingual Large Vision-Language Model via Multi-Task
Instruction Tuning
Paper
• 2310.08166
• Published
• 1
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large
Language Models in 167 Languages
Paper
• 2309.09400
• Published
• 87
A Paradigm Shift in Machine Translation: Boosting Translation
Performance of Large Language Models
Paper
• 2309.11674
• Published
• 33
Towards Effective Disambiguation for Machine Translation with Large
Language Models
Paper
• 2309.11668
• Published
• 1
Improving Translation Faithfulness of Large Language Models via
Augmenting Instructions
Paper
• 2308.12674
• Published
• 1
Empowering Cross-lingual Abilities of Instruction-tuned Large Language
Models by Translation-following demonstrations
Paper
• 2308.14186
• Published
• 1
Extrapolating Large Language Models to Non-English by Aligning Languages
Paper
• 2308.04948
• Published
• 1
Neural Machine Translation Models Can Learn to be Few-shot Learners
Paper
• 2309.08590
• Published
• 1
LongBench: A Bilingual, Multitask Benchmark for Long Context
Understanding
Paper
• 2308.14508
• Published
• 2
Multilingual Machine Translation with Large Language Models: Empirical
Results and Analysis
Paper
• 2304.04675
• Published
• 1
Baichuan 2: Open Large-scale Language Models
Paper
• 2309.10305
• Published
• 22
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Paper
• 2303.03915
• Published
• 7
Efficient Finetuning Large Language Models For Vietnamese Chatbot
Paper
• 2309.04646
• Published
• 2
Paper
• 2309.16609
• Published
• 38
OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model
Pre-trained from Scratch
Paper
• 2309.10706
• Published
• 17
Sequence-to-Sequence Spanish Pre-trained Language Models
Paper
• 2309.11259
• Published
• 1
Snowman: A Million-scale Chinese Commonsense Knowledge Graph Distilled
from Foundation Model
Paper
• 2306.10241
• Published
• 1
Zero-Shot Cross-Lingual Summarization via Large Language Models
Paper
• 2302.14229
• Published
• 1
Towards Unifying Multi-Lingual and Cross-Lingual Summarization
Paper
• 2305.09220
• Published
• 2
Distilling Efficient Language-Specific Models for Cross-Lingual Transfer
Paper
• 2306.01709
• Published
• 1
Parameter-Efficient Neural Reranking for Cross-Lingual and Multilingual
Retrieval
Paper
• 2204.02292
• Published
• 1
Composable Sparse Fine-Tuning for Cross-Lingual Transfer
Paper
• 2110.07560
• Published
• 2
Comparison between parameter-efficient techniques and full fine-tuning:
A case study on multilingual news article classification
Paper
• 2308.07282
• Published
• 1
A General-Purpose Multilingual Document Encoder
Paper
• 2305.07016
• Published
• 1
Transfer to a Low-Resource Language via Close Relatives: The Case Study
on Faroese
Paper
• 2304.08823
• Published
• 1
Massively Multilingual Lexical Specialization of Multilingual
Transformers
Paper
• 2208.01018
• Published
• 1
Improved Cross-Lingual Transfer Learning For Automatic Speech
Translation
Paper
• 2306.00789
• Published
• 1
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual
Speech Representation
Paper
• 2205.08180
• Published
• 1
Bootstrapping Multilingual AMR with Contextual Word Alignments
Paper
• 2102.02189
• Published
• 1
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Paper
• 2212.09535
• Published
• 1
A Family of Pretrained Transformer Language Models for Russian
Paper
• 2309.10931
• Published
• 6
Are Large Language Model-based Evaluators the Solution to Scaling Up
Multilingual Evaluation?
Paper
• 2309.07462
• Published
• 4
MADLAD-400: A Multilingual And Document-Level Large Audited Dataset
Paper
• 2309.04662
• Published
• 25
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open
Generative Large Language Models
Paper
• 2308.16149
• Published
• 29
Steering Large Language Models for Machine Translation with Finetuning
and In-Context Learning
Paper
• 2310.13448
• Published
• 1
Terminology-Aware Translation with Constrained Decoding and Large
Language Model Prompting
Paper
• 2310.05824
• Published
• 1
GEMBA-MQM: Detecting Translation Quality Error Spans with GPT-4
Paper
• 2310.13988
• Published
• 1
SCALE: Synergized Collaboration of Asymmetric Language Translation
Engines
Paper
• 2309.17061
• Published
• 1
Fast Lexically Constrained Decoding with Dynamic Beam Allocation for
Neural Machine Translation
Paper
• 1804.06609
• Published
• 1
Skywork: A More Open Bilingual Foundation Model
Paper
• 2310.19341
• Published
• 6
Chinese Fine-Grained Financial Sentiment Analysis with Large Language
Models
Paper
• 2306.14096
• Published
• 1
Bactrian-X : A Multilingual Replicable Instruction-Following Model with
Low-Rank Adaptation
Paper
• 2305.15011
• Published
• 1
InstructAlign: High-and-Low Resource Language Alignment via Continual
Crosslingual Instruction Tuning
Paper
• 2305.13627
• Published
• 1
Not All Languages Are Created Equal in LLMs: Improving Multilingual
Capability by Cross-Lingual-Thought Prompting
Paper
• 2305.07004
• Published
• 1
M^3IT: A Large-Scale Dataset towards Multi-Modal Multilingual
Instruction Tuning
Paper
• 2306.04387
• Published
• 9
Prompt-Tuning Can Be Much Better Than Fine-Tuning on Cross-lingual
Understanding With Multilingual Language Models
Paper
• 2210.12360
• Published
• 1
GrammarGPT: Exploring Open-Source LLMs for Native Chinese Grammatical
Error Correction with Supervised Fine-Tuning
Paper
• 2307.13923
• Published
• 1
Evaluating GPT-3.5 and GPT-4 on Grammatical Error Correction for
Brazilian Portuguese
Paper
• 2306.15788
• Published
• 2
Are Pre-trained Language Models Useful for Model Ensemble in Chinese
Grammatical Error Correction?
Paper
• 2305.15183
• Published
• 1
CFGPT: Chinese Financial Assistant with Large Language Model
Paper
• 2309.10654
• Published
• 1
Chinese Open Instruction Generalist: A Preliminary Release
Paper
• 2304.07987
• Published
• 2
Okapi: Instruction-tuned Large Language Models in Multiple Languages
with Reinforcement Learning from Human Feedback
Paper
• 2307.16039
• Published
• 4
NusaWrites: Constructing High-Quality Corpora for Underrepresented and
Extremely Low-Resource Languages
Paper
• 2309.10661
• Published
• 1
AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for
Efficient Neural Machine Translation
Paper
• 2210.07535
• Published
• 1
Only 5\% Attention Is All You Need: Efficient Long-range Document-level
Neural Machine Translation
Paper
• 2309.14174
• Published
• 1
Semantics-aware Attention Improves Neural Machine Translation
Paper
• 2110.06920
• Published
• 1
CodeApex: A Bilingual Programming Evaluation Benchmark for Large
Language Models
Paper
• 2309.01940
• Published
• 2
mLongT5: A Multilingual and Efficient Text-To-Text Transformer for
Longer Sequences
Paper
• 2305.11129
• Published
• 2
The Vault: A Comprehensive Multilingual Dataset for Advancing Code
Understanding and Generation
Paper
• 2305.06156
• Published
• 2
Constructing Multilingual Code Search Dataset Using Neural Machine
Translation
Paper
• 2306.15604
• Published
• 1
Neural Machine Translation for Code Generation
Paper
• 2305.13504
• Published
• 1
On Learning Meaningful Code Changes via Neural Machine Translation
Paper
• 1901.09102
• Published
• 1
CCT-Code: Cross-Consistency Training for Multilingual Clone Detection
and Code Search
Paper
• 2305.11626
• Published
• 1
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for
Programming Languages
Paper
• 2212.06742
• Published
• 3
Simultaneous Machine Translation with Large Language Models
Paper
• 2309.06706
• Published
• 1
New Trends in Machine Translation using Large Language Models: Case
Examples with ChatGPT
Paper
• 2305.01181
• Published
• 1
Language-Routing Mixture of Experts for Multilingual and Code-Switching
Speech Recognition
Paper
• 2307.05956
• Published
• 1
Distributional Data Augmentation Methods for Low Resource Language
Paper
• 2309.04862
• Published
• 1
Direct Neural Machine Translation with Task-level Mixture of Experts
models
Paper
• 2310.12236
• Published
• 3
CONFLATOR: Incorporating Switching Point based Rotatory Positional
Encodings for Code-Mixed Language Modeling
Paper
• 2309.05270
• Published
• 1
Memory-efficient NLLB-200: Language-specific Expert Pruning of a
Massively Multilingual Machine Translation Model
Paper
• 2212.09811
• Published
• 1
Order Matters in the Presence of Dataset Imbalance for Multilingual
Learning
Paper
• 2312.06134
• Published
• 3
Sabiá: Portuguese Large Language Models
Paper
• 2304.07880
• Published
• 4
YAYI 2: Multilingual Open-Source Large Language Models
Paper
• 2312.14862
• Published
• 14
Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer
Paper
• 2205.12148
• Published
• 2
Efficient Monotonic Multihead Attention
Paper
• 2312.04515
• Published
• 8
TigerBot: An Open Multilingual Multitask LLM
Paper
• 2312.08688
• Published
• 6
MEGAVERSE: Benchmarking Large Language Models Across Languages,
Modalities, Models and Tasks
Paper
• 2311.07463
• Published
• 15
Towards General-Purpose Speech Abilities for Large Language Models Using
Unpaired Data
Paper
• 2311.06753
• Published
• 7
FinGPT: Large Generative Models for a Small Language
Paper
• 2311.05640
• Published
• 30
On the Impact of Cross-Domain Data on German Language Models
Paper
• 2310.07321
• Published
• 1
Assessment of Pre-Trained Models Across Languages and Grammars
Paper
• 2309.11165
• Published
• 1
Democratizing Neural Machine Translation with OPUS-MT
Paper
• 2212.01936
• Published
• 1
LINGUIST: Language Model Instruction Tuning to Generate Annotated
Utterances for Intent Classification and Slot Tagging
Paper
• 2209.09900
• Published
• 1
MaLA-500: Massive Language Adaptation of Large Language Models
Paper
• 2401.13303
• Published
• 12
LangBridge: Multilingual Reasoning Without Multilingual Supervision
Paper
• 2401.10695
• Published
• 5
Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is
Needed?
Paper
• 2312.12683
• Published
• 1
A Simple Framework to Accelerate Multilingual Language Model for
Monolingual Text Generation
Paper
• 2401.10660
• Published
• 2
PersianMind: A Cross-Lingual Persian-English Large Language Model
Paper
• 2401.06466
• Published
• 5
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper
• 2401.01055
• Published
• 55
Eliciting the Translation Ability of Large Language Models via
Multilingual Finetuning with Translation Instructions
Paper
• 2305.15083
• Published
• 2
Multilingual Instruction Tuning With Just a Pinch of Multilinguality
Paper
• 2401.01854
• Published
• 11
Aya Dataset: An Open-Access Collection for Multilingual Instruction
Tuning
Paper
• 2402.06619
• Published
• 57
PALO: A Polyglot Large Multimodal Model for 5B People
Paper
• 2402.14818
• Published
• 23
LexC-Gen: Generating Data for Extremely Low-Resource Languages with
Large Language Models and Bilingual Lexicons
Paper
• 2402.14086
• Published
• 12
Zero-shot Sentiment Analysis in Low-Resource Languages Using a
Multilingual Sentiment Lexicon
Paper
• 2402.02113
• Published
Translation Errors Significantly Impact Low-Resource Languages in
Cross-Lingual Learning
Paper
• 2402.02080
• Published
• 2
A Morphologically-Aware Dictionary-based Data Augmentation Technique for
Machine Translation of Under-Represented Languages
Paper
• 2402.01939
• Published
• 1
xCoT: Cross-lingual Instruction Tuning for Cross-lingual
Chain-of-Thought Reasoning
Paper
• 2401.07037
• Published
• 2
Constrained Decoding for Cross-lingual Label Projection
Paper
• 2402.03131
• Published
Do Llamas Work in English? On the Latent Language of Multilingual
Transformers
Paper
• 2402.10588
• Published
• 1
Neural Machine Translation with Byte-Level Subwords
Paper
• 1909.03341
• Published
• 1
Multilingual Byte2Speech Models for Scalable Low-resource Speech
Synthesis
Paper
• 2103.03541
• Published
GECTurk: Grammatical Error Correction and Detection Dataset for Turkish
Paper
• 2309.11346
• Published
ÚFAL at MultiLexNorm 2021: Improving Multilingual Lexical
Normalization by Fine-tuning ByT5
Paper
• 2110.15248
• Published
MonoByte: A Pool of Monolingual Byte-level Language Models
Paper
• 2209.11035
• Published
Tokenization Impacts Multilingual Language Modeling: Assessing
Vocabulary Allocation and Overlap Across Languages
Paper
• 2305.17179
• Published
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient
Large-scale Multilingual Continued Pretraining
Paper
• 2311.08849
• Published
• 6
Machine Translation by Projecting Text into the Same
Phonetic-Orthographic Space Using a Common Encoding
Paper
• 2305.12371
• Published
Are Character-level Translations Worth the Wait? Comparing Character-
and Subword-level Models for Machine Translation
Paper
• 2302.14220
• Published
Bilingual End-to-End ASR with Byte-Level Subwords
Paper
• 2205.00485
• Published
GeMQuAD : Generating Multilingual Question Answering Datasets from Large
Language Models using Few Shot Learning
Paper
• 2404.09163
• Published
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning
Paper
• 2403.18058
• Published
• 4
Are LLMs Effective Backbones for Fine-tuning? An Experimental
Investigation of Supervised LLMs on Chinese Short Text Matching
Paper
• 2403.19930
• Published
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model
Series
Paper
• 2405.19327
• Published
• 48
Enhancing Amharic-LLaMA: Integrating Task Specific and Generative
Datasets
Paper
• 2402.08015
• Published
• 1
Multilingual Large Language Models Are Not (Yet) Code-Switchers
Paper
• 2305.14235
• Published
Deterministic Reversible Data Augmentation for Neural Machine
Translation
Paper
• 2406.02517
• Published
Context-Aware Machine Translation with Source Coreference Explanation
Paper
• 2404.19505
• Published
Integrating Multi-scale Contextualized Information for Byte-based Neural
Machine Translation
Paper
• 2405.19290
• Published
MaskLID: Code-Switching Language Identification through Iterative
Masking
Paper
• 2406.06263
• Published
• 5
PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal
Documents
Paper
• 2406.13923
• Published
• 25
Rethinking Tokenization: Crafting Better Tokenizers for Large Language
Models
Paper
• 2403.00417
• Published
• 3
Breaking the Script Barrier in Multilingual Pre-Trained Language Models
with Transliteration-Based Post-Training Alignment
Paper
• 2406.19759
• Published
Glot500: Scaling Multilingual Corpora and Language Models to 500
Languages
Paper
• 2305.12182
• Published
• 1