openhermes_2_5_mistral_7b_awq
Creates, updates, deletes, gets or lists an openhermes_2_5_mistral_7b_awq resource.
Overview
| Name | openhermes_2_5_mistral_7b_awq |
| Type | Resource |
| Id | cloudflare.ai.openhermes_2_5_mistral_7b_awq |
Fields
The following fields are returned by SELECT queries:
SELECT not supported for this resource, use SHOW METHODS to view available operations for the resource.
Methods
The following methods are available for this resource:
| Name | Accessible by | Required Params | Optional Params | Description |
|---|---|---|---|---|
workers_ai_post_run_hf_thebloke_openhermes_2_5_mistral_7b_awq | insert | account_id | queueRequest, tags | Runs inference on the @hf/thebloke/openhermes-2.5-mistral-7b-awq model. |
Parameters
Parameters can be passed in the WHERE clause of a query. Check the Methods section to see which parameters are required or optional for each operation.
| Name | Datatype | Description |
|---|---|---|
account_id | string | The Cloudflare account ID. |
queueRequest | string | |
tags | string |
INSERT examples
- workers_ai_post_run_hf_thebloke_openhermes_2_5_mistral_7b_awq
- Manifest
Runs inference on the @hf/thebloke/openhermes-2.5-mistral-7b-awq model.
INSERT INTO cloudflare.ai.openhermes_2_5_mistral_7b_awq (
frequency_penalty,
lora,
max_tokens,
presence_penalty,
prompt,
raw,
repetition_penalty,
response_format,
seed,
stream,
temperature,
top_k,
top_p,
functions,
messages,
tools,
account_id,
queueRequest,
tags
)
SELECT
{{ frequency_penalty }},
'{{ lora }}',
{{ max_tokens }},
{{ presence_penalty }},
'{{ prompt }}',
{{ raw }},
{{ repetition_penalty }},
'{{ response_format }}',
{{ seed }},
{{ stream }},
{{ temperature }},
{{ top_k }},
{{ top_p }},
'{{ functions }}',
'{{ messages }}',
'{{ tools }}',
'{{ account_id }}',
'{{ queueRequest }}',
'{{ tags }}'
;
# Description fields are for documentation purposes
- name: openhermes_2_5_mistral_7b_awq
props:
- name: account_id
value: "{{ account_id }}"
description: Required parameter for the openhermes_2_5_mistral_7b_awq resource.
- name: frequency_penalty
value: {{ frequency_penalty }}
description: |
Decreases the likelihood of the model repeating the same lines verbatim.
- name: lora
value: "{{ lora }}"
description: |
Name of the LoRA (Low-Rank Adaptation) model to fine-tune the base model.
- name: max_tokens
value: {{ max_tokens }}
description: |
The maximum number of tokens to generate in the response.
default: 256
- name: presence_penalty
value: {{ presence_penalty }}
description: |
Increases the likelihood of the model introducing new topics.
- name: prompt
value: "{{ prompt }}"
description: |
The input text prompt for the model to generate a response.
- name: raw
value: {{ raw }}
description: |
If true, a chat template is not applied and you must adhere to the specific model's expected formatting.
default: false
- name: repetition_penalty
value: {{ repetition_penalty }}
description: |
Penalty for repeated tokens; higher values discourage repetition.
- name: response_format
value:
json_schema: "{{ json_schema }}"
type: "{{ type }}"
- name: seed
value: {{ seed }}
description: |
Random seed for reproducibility of the generation.
- name: stream
value: {{ stream }}
description: |
If true, the response will be streamed back incrementally using SSE, Server Sent Events.
default: false
- name: temperature
value: {{ temperature }}
description: |
Controls the randomness of the output; higher values produce more random results.
default: 0.6
- name: top_k
value: {{ top_k }}
description: |
Limits the AI to choose from the top 'k' most probable words. Lower values make responses more focused; higher values introduce more variety and potential surprises.
- name: top_p
value: {{ top_p }}
description: |
Adjusts the creativity of the AI's responses by controlling how many possible words it considers. Lower values make outputs more predictable; higher values allow for more varied and creative responses.
- name: functions
value:
- code: "{{ code }}"
name: "{{ name }}"
- name: messages
description: |
An array of message objects representing the conversation history.
value:
- content: "{{ content }}"
role: "{{ role }}"
- name: tools
description: |
A list of tools available for the assistant to use.
value:
- description: "{{ description }}"
name: "{{ name }}"
parameters:
properties: "{{ properties }}"
required:
- "{{ required }}"
type: "{{ type }}"
function:
description: "{{ description }}"
name: "{{ name }}"
parameters:
properties: "{{ properties }}"
required:
- "{{ required }}"
type: "{{ type }}"
type: "{{ type }}"
- name: queueRequest
value: "{{ queueRequest }}"
- name: tags
value: "{{ tags }}"