What is the secret behind selecting layers from different models ? :)

by Shamane - opened Dec 17, 2023

Discussion

Shamane

Dec 17, 2023

Amazing work. I can see you have selected different layers from different models. Is it random?

Aryanne

Owner Dec 17, 2023

thanks 🤗, I tried in this merge to be kinda symmetrical, choosing the middle layers of Astridboros and some layers of the beginning and end of Zephyr

Shamane

Dec 18, 2023

Thanks. Any logic behind the order? Also did you have to fine-tune the merged model again ?

Aryanne

Owner Dec 18, 2023

Thanks. Any logic behind the order?

Symmetry, this way the model has in the middle 6 layers of PAIXAI/Astrid-3B and 6 layers of jondurbin/airoboros-3b-3p0(which both came from Astridboros), to have a bit of the style of both(kinda human-like/RP).

Also did you have to fine-tune the merged model again ?

I recommend fine-tuning it if you want, cause swapping and inserting layers seems to confuse the model a bit.

Aryanne changed discussion status to closed Dec 25, 2023

Shamane

Jan 2, 2024

Thanks a lot

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment