AI Archives - Page 13 of 663

Quantization and Swap Memory for GPTQ

Here’s how it works: first, we “quantize” the weights in the neural network that makes up our language model. This means taking all those…
GPTQ for Llama: 4 bits quantization using GPTQ

Use examples when they help make things clearer. Let me break it down for you. GPTQ stands for “Gradient Pursuit Quantization,” and it’s a…
PubLayNet Dataset for Document Layout Analysis

These pages have been annotated with both bounding boxes and polygonal segmentations, which means we can see exactly where each word or image is…
FlaxBart for Causal Language Modeling

So how does it work? Well, first we feed the model some input text (let’s say “I spilled coffee on my laptop because I…
FlaxBartForCausalLMModule: A New Approach to Language Modeling

Now what this actually means. Language models are basically computer programs that can understand and generate human-like text. They work by analyzing patterns in…
FlaxBartForQuestionAnsweringModule

Here’s how it works: first, we feed our model some text data to train on. This could be anything from news articles to product…
FlaxBartForSequenceClassificationModule: A New Approach to Sequence Classification

Use examples when they help make things clearer. The FlaxBartForSequenceClassificationModule is a modification of the BART (Bidirectional Encoder Representations from Transformers) model, which has…
FlaxBartForConditionalGenerationModule

The module takes in input ids, attention masks, token type ids, position ids, deterministic flag (True or False), output attentions flag (True or False),…
FlaxBartDecoderLayerCollection

So how does it work? Well, imagine you have a big ol’ text document and you want to analyze the words inside it. FlaxBartDecoderLayerCollection…

Quantization and Swap Memory for GPTQ

GPTQ for Llama: 4 bits quantization using GPTQ

PubLayNet Dataset for Document Layout Analysis

FlaxBart for Causal Language Modeling

FlaxBartForCausalLMModule: A New Approach to Language Modeling

FlaxBartForQuestionAnsweringModule

FlaxBartForSequenceClassificationModule: A New Approach to Sequence Classification

FlaxBartForConditionalGenerationModule

FlaxBartDecoderLayerCollection

Social

About

Privacy