This mirrors what I experienced when I enrolled in "free drawing" (no teaching) ...

snyhlxde · on May 8, 2024

Hi we are CLLM authors and thanks for sharing your experience and insights! I can see this drawing skill refining process echoes with the training process in CLLM, the only thing is at this point stressor in CLLM training is not getting progressively demanding.

For example, while drawing, you can set very specific time limit on how long you are allowed to draw in each trial and make the time progressively shorter. In CLLM, maybe we can make this the learning process more and more difficult by mapping more and more distant states in Jacobi trajectory to its final state.

We are using the term "consistency" because we draw parallelism between consistency LLM and the consistency model in diffusion image generation where the training processes are analogous.

boroboro4 · on May 9, 2024

Do you use same dataset to train / eval the model? Was the model used for example trained on GSM8K dataset for example?

snyhlxde · on May 9, 2024

Yes, we consider both domain-specific applications (spider for text2SQL, gsm8k for math, codesearchnet for python) as well as open-domain conversational applications (ShareGPT). We use test set from each application to evaluate CLLMs’ performance in our paper.

On the other hand, technically CLLM works on any kind of queries. But the speedup might vary. Feel free to try out our codebase for your use cases!

Quarrel · on May 8, 2024

Is it just me, or does this read like it was written by an LLM ... ?!

jasonjmcghee · on May 9, 2024

It's just much more formal than people generally speak on HN.

snyhlxde · on May 8, 2024

lol I take that as a compliment. Good try but sadly no LLM in this writing :)

aamargulies · on May 8, 2024

I had an interesting experience in an Invertebrate Zoology lab class one summer.

We students were brought into a lab, given specimens to draw, and the only instructions we received were 'You have 30 minutes to draw this. Go.'

There was no "here's how to draw. here's what to do and not to do". It was just basically "We don't care about any insecurities you might have. We don't care if you think you can't draw. No excuses, just fucking draw it. Now."

Not only did we draw, but we (all of us) improved enormously over the course of the class as more animals were brought in and the exercise was repeated over and over and over again throughout the summer.

What it taught us is that everyone, and I mean everyone, can draw. Our collective attitude shifted from "don't know if this is even possible" to "of course we can do this. this is easy. routine. trivial."

Highly recommended approach.

It was the most freeing and amazing class I had in college.

Version467 · on May 9, 2024

That sounds like a pretty awesome experience. Thanks for sharing.

manmal · on May 8, 2024

Systems generally become more efficient when under stress. They are also forced into local optima - everything has upsides and downsides.

sheepscreek · on May 9, 2024

Interestingly - this is the idea behind Nassim Taleb’s book “Antifragile” and the concept of “anti-fragility”.

In essence, it promotes dynamic/evolutionary/always learning behaviour than performing the same set of steps every time, and in the process, becoming stronger than before.

An example he shares is: how the breakdown of muscle tissue through exercise leads to more muscle development and an increase in strength. I guess it’s similar to LLM training using error/loss reducing functions (practice makes perfect) but dissimilar in the sense that training is a one—time action.

TeMPOraL · on May 9, 2024

> They are also forced into local optima

The good ol', "under pressure, you don't rise to the occasion, but sink to the level of your training"?