You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Parameter-efficient optimization of conditional diffusion models using multi-resolution attention, classifier-free guidance ablation, and DDIM sampling — achieving 17% FID improvement with 85% reduced training time.
Parameter-efficient small LLM training experiments achieve <16MB/10 min training using recurrent/shared-block transformers, unigram/bigram logit calibration, and INT8-compressed language model architectures with MLX and PyTorch.