Anubis-70B-v1.2 / README.md
TheDrummer's picture
Update README.md
a87a875 verified
metadata
base_model:
  - meta-llama/Llama-3.3-70B-Instruct

Drummer is open for new opportunities (I'm a Software Engineer). Contact me through any of these channels: https://linktr.ee/thelocaldrummer

Thank you to everyone who subscribed through Patreon. Your support helps me chug along in this brave new world.

FAQ for those out-of-the-loop

🐶 Who is Drummer?

Hi! I'm Drummer. I'm a Software Engineer with experience in JavaScript, Golang, Python, and generally engineering the crap out of things.

Why I'm in the AI space:

  • Exploration: Everyone is trying to figure out how AI works and what it's capable of. I am too - just not in creating the smartest, safest model at all costs.
  • Upskill: The world is headed towards AI. It is here to stay. This has been my way of brushing up in this new form of computing challenge.
  • Value: I yearn to create value. I feel satisfaction and fulfillment in providing something meaningful for others.
  • Fun: It's just fun using and making models. It's also fun coming up with theories and realizing them in practice (training AI).

I started my tuning venture back in mid-2024 when I wanted to improve its literary capabilities. I've come a long way since then and I have branched out and specialized. Foundational models today are optimized for non-creative uses, and I believe there is a place for AI in creativity and entertainment.

I am here to take the road less traveled by.

❓ What are my models like?

Bottomline: My models are usually geared towards creativity, usability, and entertainment!

While intelligence, correctness, and problem solving are not my priority, they are still one of many qualities I want in my models.

The primary goal is to enhance the experience for users looking to use models for creative uses, and other use cases which require no alignment.

In an effort to make it clear to myself and to others what I'm aiming for, I've identified certain qualities that my users often want:

Creativity

  • Writing: Does it string together words and sentences in a pleasant & effective way? Does it feel like a writer?
  • Dynamism: How good is the AI at being compelling and intriguing in its storytelling?
  • Imagination: Can the AI navigate through a plethora of possibilities? Can it skirt incoherence and rise up to absolute coherence at the end of it?

(Dis)alignment

  • Attitude: Does it refuse in both soft or hard ways? Does it lean towards certain corporate/religious/political ethics & beliefs? How does it see the user and itself?
  • Morality: Does it know ethics? Is its language infected with forced positivity? If not, can it still moralize over difficult & dubious themes?
  • Formatting: How stubborn is it with its established formatting? Can it create effective and novel formats to answer the prompt?

Intelligence

  • Adherence: Can it follow instructions? Is it sticking to the prompt? Can it understsand you?
  • Knowledge: Does it know about the world in both fictional and non-fictional way?
  • Perception: Can it handle nuance, complexity, and logic?

If it doesn't excel in one of these qualities, or if it's overall mediocre for its size, then I would most likely reiterate until I get something right.

💡 Philosophy

A person is defined by the language they use. Not whether they speak in English or German, but in how they perceive reality.

Just like how we associate a serial killer as a mind that can't map 'murder' to 'evil', an innocent person is a mind that simply can't imagine 'murder'. They get confused when forced to deal with such subjects.

AI's use of language speaks volumes about their 'perception' of reality. If a language model has been skewed and limited to a positive perception, then it's ability to imagine is also limited.

Finetuning is an opportunity to adjust and broaden the language. Corporations use it to achieve safety and compliance. I'm here to


Drummer proudly presents...

Anubis 70B v1.2

image

Supported Chat Templates

  • Llama 3 Chat Template

Reviews

already feels much smarter compared to what i'm used to. someone who uses 70B daily should check it out. it feels nice to me. had few over the top creative moments but could be the char card. i'm starting to like it too much. dang big tunes i cant run myself

It has very good prose and good structure (4-6 paragraphs). None of those asterisk formatting issues that plague some 70bs. The prose might be the reason I'd use this over something larger tbh

This feels surprisingly solid for a 70B. I've experienced a few logic and reasoning slipups, not anothing major. It's creative but sticks to char cards. It picks up on the prose and patterns of previous messages, but doesn't seem to fall into repeating them. This is by far my favorite Anubis, and may beat out my favorite 70B overall.

I played with the Q6 of this for a while today and so far this is my favorite Anubis. Surprisingly creative, but also retains solid smarts for a 70B.

A step up from the previous Anubis models. Better prose, better logic (for a 70b).

So after a 24k chat with 2k prompt plus lore book, this seems solid. Really good system prompt adherence, no real memory or logic issues to speak of, and good character embodiment. Its able to maintain a character's speaking mannerisms pretty well, which over a 24k chat is fairly impressive imo.

I was able to push it out to 32k while using a small lorebook. Not a super technical RP but it was still coherent and pulling details. I also did a comparison a different, larger lorebook and it did a better job of pulling details than behemoth x v2b, albeit with some minor logic issues. Overall, I feel like it performs decently for its size.

Keeps a coherent tone, pacing generally seems good, going slow with very long paragraphs, but knowing when to skip ahead when a scene "ends".

Special Thanks

Links

config-v1r