grantmaking.ai Launch Round
I have developed a dynamical taxonomy of recursive collapse in autoregressive LMs, introducing MASSIF, a Multiscale Attractor Stability & Stress Inference Framework telemetry, measuring the dynamics within the hidden state geometry (see the study across a multitude of small open source LMs here: https://doi.org/10.5281/zenodo.20576209). The next stage of the project is the real-time applications of this telemetry in a LM. Building a custom architecture model which is enabled to measure these dynamic processes, report them in real-time and adjust its internal processes if criticality is detected. So far, models have no access to their own hidden process and no auti-corrective mechanisms.
The principles are described in greater detail here: https://www.linkedin.com/pulse/mycelia-self-regulating-telemetry-aware-language-daniel-solis-rajgf/
The research report of the ongoing project is here: https://www.linkedin.com/pulse/mapping-mycelia-dynamical-systems-case-study-real-time-daniel-solis-6rzff
The proposed telemetry framework and its application in LM has a direct impact on AI safety as it reveals the inner workings of the hidden state and makes the black-box transparent The method is not only predictive - providing interpretability and early warning, it is also prescriptive as it is able to condition the model to avoid instability and criticality before it even happens. Until now, all systems are analyzed and contained post-hoc. My suggested approach enables the application of countermeasures ex-ante.
Most costly aspect of my research is the compute - I am conducting my research on a T4 GPU with 16 GB vRAM. With the proper funding, I could scale it up. I also need to subsist. A focused research without existential distractions leads to faster and oftentimes more robust results.