How to use from
vLLM
Install from pip and serve model
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "rizla/rizla-69"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "rizla/rizla-69",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'
Use Docker
docker model run hf.co/rizla/rizla-69
Quick Links

Rizla-69

This is a crop of momo-qwen-72B

This repository contains a state-of-the-art machine learning model that promises to bring big changes to the field. The model is trained on [describe the dataset or type of data here].

License

This project is licensed under the terms of the Apache 2.0 license.

Model Architecture

The model uses [describe the model architecture here, e.g., a transformer-based architecture with a specific type of attention mechanism].

Training

The model was trained on [describe the hardware used, e.g., an NVIDIA Tesla P100 GPU] using [mention the optimization algorithm, learning rate, batch size, number of epochs, etc.].

Results

Our model achieved [mention the results here, e.g., an accuracy of 95% on the test set].

Usage

To use the model in your project, follow these steps:

  1. Install the Hugging Face Transformers library:
pip install transformers
Downloads last month
69
Safetensors
Model size
69B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for rizla/rizla-69

Quantizations
1 model