transformers

6 stories

Industrial Machinery #transformers#indian manufacturers

Indian Transformer Makers Shrug Off China Re-entry; Capacity Ramp-Up and Strong Demand Offset Competition

The Indian government granted a two-year exemption under the Public Procurement Order for four Chinese-owned manufacturers—TBEA, Nanjing Electric, New Northeast Electric, and Taikai Electric—to bid for local power transmission projects. Indian transformer makers, however, remain undeterred due to ongoing capacity expansions and strong demand, with only TBEA having significant capacity to compete. The exemption requires these Chinese entities to manufacture in India with 50-60% domestic content.

Jul 8, 2026 1 source

New Tokenization Method Merges Tokens to Improve Diffusion Transformer Efficiency

Technology

Artificial Intelligence #variable-length tokenization#learnable global merging

New Tokenization Method Merges Tokens to Improve Diffusion Transformer Efficiency

A research paper introduces a variable-length tokenizer that merges tokens instead of truncating them, enabling adaptive compression for diffusion transformers. The method, called learnable global merging, addresses representational alignment issues across token lengths and achieves a superior trade-off between image quality (gFID) and computational cost.

Jun 20, 2026 1 source

CoT Transformers Can Efficiently Simulate Word RAM Algorithms, New Research Shows

Technology

Artificial Intelligence #artificial intelligence#transformers

CoT Transformers Can Efficiently Simulate Word RAM Algorithms, New Research Shows

A new paper on arXiv demonstrates that chain-of-thought (CoT) transformers can efficiently simulate Word RAM algorithms, which are more intuitive and efficient than Turing machines for discussing algorithms. The authors show that with poly-logarithmic overhead, CoT transformers can execute algorithms like sorting and Dijkstra's in near-optimal steps, and extend the result to practical settings like continuous CoT and hybrid architectures.

Jun 20, 2026 1 source

Study: LLM Accuracy Declines Predictably as Reasoning Steps Increase in Clinical AI Tasks

Technology

Artificial Intelligence #ai#clinical ai

Study: LLM Accuracy Declines Predictably as Reasoning Steps Increase in Clinical AI Tasks

A study on arXiv introduces a hop-count taxonomy to predict LLM failure on clinical question answering. Tests across Claude and GPT models show monotone accuracy decline with reasoning depth, with extended thinking failing to flatten the curve.

Jun 16, 2026 1 source

New Drift-RAE Method Distills Transformers Efficiently Using Representation Autoencoders

Technology

Artificial Intelligence #transformers#representation autoencoders

New Drift-RAE Method Distills Transformers Efficiently Using Representation Autoencoders

A new research paper proposes Drift-RAE, a method for distilling pretrained flow models in representation autoencoder latent spaces. It overcomes anisotropy and large curvature challenges, achieving 1.77 FID on ImageNet 256 with only 10,000 distillation steps, outperforming existing RAE distillation methods.

Jun 16, 2026 1 source

New Theory Explains How Deep Transformers Achieve Adaptive Inference Using Function Vectors

Technology

Artificial Intelligence #artificial intelligence#deep learning

New Theory Explains How Deep Transformers Achieve Adaptive Inference Using Function Vectors

A new research paper introduces a theory of deep transformers as mean-field interacting systems that implement distributed inference using 'function vectors' to adaptively infer latent context variables at finer scales over layers. The theory predicts a relationship between non-Gaussian hierarchical structure and transformer depth, tested with constrained linear attention models.

Jun 16, 2026 1 source