Merge pull request #175 from diogo-gsa/palm2-model
Add PaLM2 to Models and Papers sectionspull/178/head
commit
7f7d2bceb7
|
@ -63,3 +63,4 @@ Aquesta secció consisteix en una col·lecció i resum de models LLM notables i
|
|||
| [PanGu-Σ](https://arxiv.org/abs/2303.10845v1) | 2023 | PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing |
|
||||
| [BloombergGPT](https://arxiv.org/abs/2303.17564v1)| 2023 |BloombergGPT: A Large Language Model for Finance|
|
||||
| [Cerebras-GPT](https://arxiv.org/abs/2304.03208) | 2023 | Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster |
|
||||
| [PaLM 2](https://ai.google/static/documents/palm2techreport.pdf) | 2023 | A Language Model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. |
|
|
@ -64,3 +64,4 @@ This section consists of a collection and summary of notable and foundational LL
|
|||
| [PanGu-Σ](https://arxiv.org/abs/2303.10845v1) | 2023 | PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing |
|
||||
| [BloombergGPT](https://arxiv.org/abs/2303.17564v1)| 2023 |BloombergGPT: A Large Language Model for Finance|
|
||||
| [Cerebras-GPT](https://arxiv.org/abs/2304.03208) | 2023 | Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster |
|
||||
| [PaLM 2](https://ai.google/static/documents/palm2techreport.pdf) | 2023 | A Language Model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. |
|
|
@ -64,3 +64,4 @@ Esta sección consta de una colección y resumen de LLMs notables y fundamentale
|
|||
| [PanGu-Σ](https://arxiv.org/abs/2303.10845v1) | 2023 | PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing |
|
||||
| [BloombergGPT](https://arxiv.org/abs/2303.17564v1)| 2023 |BloombergGPT: A Large Language Model for Finance|
|
||||
| [Cerebras-GPT](https://arxiv.org/abs/2304.03208) | 2023 | Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster |
|
||||
| [PaLM 2](https://ai.google/static/documents/palm2techreport.pdf) | 2023 | A Language Model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. |
|
||||
|
|
|
@ -64,3 +64,4 @@ Cette section est composée d'une collection et d'un résumé des LLMs notables
|
|||
| [PanGu-Σ](https://arxiv.org/abs/2303.10845v1) | 2023 | PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing |
|
||||
| [BloombergGPT](https://arxiv.org/abs/2303.17564v1)| 2023 |BloombergGPT: A Large Language Model for Finance|
|
||||
| [Cerebras-GPT](https://arxiv.org/abs/2304.03208) | 2023 | Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster |
|
||||
| [PaLM 2](https://ai.google/static/documents/palm2techreport.pdf) | 2023 | A Language Model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. |
|
|
@ -63,3 +63,4 @@ Questa sezione consiste in una raccolta e in una sintesi di LLM importanti e fon
|
|||
| [GPT-4](https://arxiv.org/abs/2303.08774v3) | 2023 |GPT-4 Technical Report |
|
||||
| [PanGu-Σ](https://arxiv.org/abs/2303.10845v1) | 2023 | PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing |
|
||||
| [BloombergGPT](https://arxiv.org/abs/2303.17564v1)| 2023 |BloombergGPT: A Large Language Model for Finance|
|
||||
| [PaLM 2](https://ai.google/static/documents/palm2techreport.pdf) | 2023 | A Language Model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. |
|
|
@ -62,3 +62,4 @@ import { Callout, FileTree } from 'nextra-theme-docs'
|
|||
| [GPT-4](https://arxiv.org/abs/2303.08774v3) | 2023 | GPT-4 Technical Report |
|
||||
| [PanGu-Σ](https://arxiv.org/abs/2303.10845v1) | 2023 | PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing |
|
||||
| [BloombergGPT](https://arxiv.org/abs/2303.17564v1)| 2023 |BloombergGPT: A Large Language Model for Finance |
|
||||
| [PaLM 2](https://ai.google/static/documents/palm2techreport.pdf) | 2023 | A Language Model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. |
|
|
@ -63,3 +63,4 @@ import { Callout, FileTree } from 'nextra-theme-docs'
|
|||
| [PanGu-Σ](https://arxiv.org/abs/2303.10845v1) | 2023 | PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing |
|
||||
| [BloombergGPT](https://arxiv.org/abs/2303.17564v1)| 2023 |BloombergGPT: A Large Language Model for Finance|
|
||||
| [Cerebras-GPT](https://arxiv.org/abs/2304.03208) | 2023 | Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster |
|
||||
| [PaLM 2](https://ai.google/static/documents/palm2techreport.pdf) | 2023 | A Language Model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. |
|
|
@ -63,3 +63,4 @@ Esta seção consiste em uma coleção e resumo de LLMs notáveis e fundamentais
|
|||
| [GPT-4](https://arxiv.org/abs/2303.08774v3) | 2023 |GPT-4 Technical Report |
|
||||
| [PanGu-Σ](https://arxiv.org/abs/2303.10845v1) | 2023 | PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing |
|
||||
| [BloombergGPT](https://arxiv.org/abs/2303.17564v1)| 2023 |BloombergGPT: A Large Language Model for Finance|
|
||||
| [PaLM 2](https://ai.google/static/documents/palm2techreport.pdf) | 2023 | A Language Model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. |
|
|
@ -63,3 +63,4 @@ Bu bölüm, dikkate değer ve temel LLM'lerin bir koleksiyonunu ve özetini içe
|
|||
| [GPT-4](https://arxiv.org/abs/2303.08774v3) | 2023 |GPT-4 Technical Report |
|
||||
| [PanGu-Σ](https://arxiv.org/abs/2303.10845v1) | 2023 | PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing |
|
||||
| [BloombergGPT](https://arxiv.org/abs/2303.17564v1)| 2023 |BloombergGPT: A Large Language Model for Finance|
|
||||
| [PaLM 2](https://ai.google/static/documents/palm2techreport.pdf) | 2023 | A Language Model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. |
|
|
@ -62,3 +62,4 @@ This section consists of a collection and summary of notable and foundational LL
|
|||
| [GPT-4](https://arxiv.org/abs/2303.08774v3) | 2023 |GPT-4 Technical Report |
|
||||
| [PanGu-Σ](https://arxiv.org/abs/2303.10845v1) | 2023 | PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing |
|
||||
| [BloombergGPT](https://arxiv.org/abs/2303.17564v1)| 2023 |BloombergGPT: A Large Language Model for Finance|
|
||||
| [PaLM 2](https://ai.google/static/documents/palm2techreport.pdf) | 2023 | A Language Model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. |
|
|
@ -130,6 +130,7 @@ A continuació es mostren els últims articles (ordenats per data de llançament
|
|||
|
||||
## Aplicacions
|
||||
|
||||
- [PaLM 2 Technical Report](https://ai.google/static/documents/palm2techreport.pdf) (May 2023)
|
||||
- [Are LLMs All You Need for Task-Oriented Dialogue?](https://arxiv.org/abs/2304.06556) (Abril 2023)
|
||||
- [HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting](https://arxiv.org/abs/2304.05973) (Abril 2023)
|
||||
- [Approximating Human Evaluation of Social Chatbots with Prompting](https://arxiv.org/abs/2304.05253) (Abril 2023)
|
||||
|
|
|
@ -130,6 +130,7 @@ The following are the latest papers (sorted by release date) on prompt engineeri
|
|||
|
||||
## Applications
|
||||
|
||||
- [PaLM 2 Technical Report](https://ai.google/static/documents/palm2techreport.pdf) (May 2023)
|
||||
- [Are LLMs All You Need for Task-Oriented Dialogue?](https://arxiv.org/abs/2304.06556) (April 2023)
|
||||
- [HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting](https://arxiv.org/abs/2304.05973) (April 2023)
|
||||
- [Approximating Human Evaluation of Social Chatbots with Prompting](https://arxiv.org/abs/2304.05253) (April 2023)
|
||||
|
|
|
@ -126,6 +126,7 @@ A continuación se presentan los últimos artículos (ordenados por fecha de pub
|
|||
|
||||
## Aplicaciones
|
||||
|
||||
- [PaLM 2 Technical Report](https://ai.google/static/documents/palm2techreport.pdf) (May 2023)
|
||||
- [Automated Reading Passage Generation with OpenAI's Large Language Model](https://arxiv.org/abs/2304.04616) (April 2023)
|
||||
- [WebBrain: Learning to Generate Factually Correct Articles for Queries by Grounding on Large Web Corpus](https://arxiv.org/abs/2304.04358) (April 2023)
|
||||
- [Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition](https://arxiv.org/abs/2304.04704) (April 2023)
|
||||
|
|
|
@ -126,6 +126,7 @@ Voici les derniers articles sur le prompt engineering (classés par date de publ
|
|||
|
||||
## Applications
|
||||
|
||||
- [PaLM 2 Technical Report](https://ai.google/static/documents/palm2techreport.pdf) (May 2023)
|
||||
- [Automated Reading Passage Generation with OpenAI's Large Language Model](https://arxiv.org/abs/2304.04616) (April 2023)
|
||||
- [WebBrain: Learning to Generate Factually Correct Articles for Queries by Grounding on Large Web Corpus](https://arxiv.org/abs/2304.04358) (April 2023)
|
||||
- [Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition](https://arxiv.org/abs/2304.04704) (April 2023)
|
||||
|
|
|
@ -122,6 +122,7 @@ Di seguito sono riportati gli articoli scientifici più recenti (ordinati in bas
|
|||
|
||||
## Applicazioni
|
||||
|
||||
- [PaLM 2 Technical Report](https://ai.google/static/documents/palm2techreport.pdf) (May 2023)
|
||||
- [Assessing Language Model Deployment with Risk Cards]() (Aprile 2023)
|
||||
- [Enhancing Large Language Models with Climate Resources](https://arxiv.org/abs/2304.00116) (Marzo 2023)
|
||||
- [BloombergGPT: A Large Language Model for Finance](https://arxiv.org/abs/2303.17564) (Marzo 2023)
|
||||
|
|
|
@ -119,6 +119,7 @@
|
|||
|
||||
## Applications
|
||||
|
||||
- [PaLM 2 Technical Report](https://ai.google/static/documents/palm2techreport.pdf) (May 2023)
|
||||
- [BloombergGPT: A Large Language Model for Finance](https://arxiv.org/abs/2303.17564) (March 2023)
|
||||
- [Medical Intervention Duration Estimation Using Language-enhanced Transformer Encoder with Medical Prompts](https://arxiv.org/abs/2303.17408) (March 2023)
|
||||
- [Soft-prompt tuning to predict lung cancer using primary care free-text Dutch medical notes](https://arxiv.org/abs/2303.15846) (March 2023)
|
||||
|
|
|
@ -121,6 +121,7 @@
|
|||
|
||||
## Applications
|
||||
|
||||
- [PaLM 2 Technical Report](https://ai.google/static/documents/palm2techreport.pdf) (May 2023)
|
||||
- [Assessing Language Model Deployment with Risk Cards]() (April 2023)
|
||||
- [Enhancing Large Language Models with Climate Resources](https://arxiv.org/abs/2304.00116) (March 2023)
|
||||
- [BloombergGPT: A Large Language Model for Finance](https://arxiv.org/abs/2303.17564) (March 2023)
|
||||
|
|
|
@ -119,6 +119,7 @@ A seguir estão os artigos mais recentes (classificados por data de lançamento)
|
|||
|
||||
## Aplicações
|
||||
|
||||
- [PaLM 2 Technical Report](https://ai.google/static/documents/palm2techreport.pdf) (May 2023)
|
||||
- [BloombergGPT: A Large Language Model for Finance](https://arxiv.org/abs/2303.17564) (March 2023)
|
||||
- [Medical Intervention Duration Estimation Using Language-enhanced Transformer Encoder with Medical Prompts](https://arxiv.org/abs/2303.17408) (March 2023)
|
||||
- [Soft-prompt tuning to predict lung cancer using primary care free-text Dutch medical notes](https://arxiv.org/abs/2303.15846) (March 2023)
|
||||
|
|
|
@ -118,7 +118,7 @@ Aşağıdakiler, hızlı mühendislikle ilgili en son makalelerdir (yayınlanma
|
|||
- [Scaling Laws for Neural Language Models](https://arxiv.org/abs/2001.08361) (Jan 2020)
|
||||
|
||||
## Uygulamalar
|
||||
|
||||
- [PaLM 2 Technical Report](https://ai.google/static/documents/palm2techreport.pdf) (May 2023)
|
||||
- [BloombergGPT: A Large Language Model for Finance](https://arxiv.org/abs/2303.17564) (March 2023)
|
||||
- [Medical Intervention Duration Estimation Using Language-enhanced Transformer Encoder with Medical Prompts](https://arxiv.org/abs/2303.17408) (March 2023)
|
||||
- [Soft-prompt tuning to predict lung cancer using primary care free-text Dutch medical notes](https://arxiv.org/abs/2303.15846) (March 2023)
|
||||
|
|
|
@ -118,7 +118,7 @@
|
|||
- [Scaling Laws for Neural Language Models](https://arxiv.org/abs/2001.08361) (Jan 2020)
|
||||
|
||||
## 应用
|
||||
|
||||
- [PaLM 2 Technical Report](https://ai.google/static/documents/palm2techreport.pdf) (May 2023)
|
||||
- [BloombergGPT: A Large Language Model for Finance](https://arxiv.org/abs/2303.17564) (March 2023)
|
||||
- [Medical Intervention Duration Estimation Using Language-enhanced Transformer Encoder with Medical Prompts](https://arxiv.org/abs/2303.17408) (March 2023)
|
||||
- [Soft-prompt tuning to predict lung cancer using primary care free-text Dutch medical notes](https://arxiv.org/abs/2303.15846) (March 2023)
|
||||
|
|
Loading…
Reference in New Issue