专题:Parallel Computing and Optimization Techniques

This cluster of papers focuses on parallel computing, performance optimization, and various aspects of multicore and heterogeneous computing. It covers topics such as GPU computing, memory systems, benchmarking, power management, simulation platforms, and high-performance computing.
最新文献
Simulating the Sampling-Rate Hypothesis: Cadence, Proxy Faithfulness, and Latency in Runtime AI Oversight

article Full Text OpenAlex

EA-HK-INFRA-01: The Infrastructure Holographic Kernel — Infrastructure That Survives Its Own Deprecation

article Full Text OpenAlex

SPXI for Websites: Standing Protocol for Entity Inscription and Compression Survival (EA-SPXI-WEB-01 v3.0)

article Full Text OpenAlex

Github Repository link to the actual research.

article Full Text OpenAlex

Structural Replay in Dense Transformers Under a Frozen FP32 Regime: Evidence from Closed 3B/4B Models and Boundary Results

article Full Text OpenAlex

Fidelity Collapse in Asynchronous ERC-4626 Vaults: Prediction and Controlled Verification

article Full Text OpenAlex

Waste-to-Energy-Coupled AI Data Centers: Cooling Efficiency and Grid Resilience

article Full Text OpenAlex

Crimson Hexagonal Interface v2.0 — Governed Operating Surface for the Crimson Hexagonal Archive (EA-HEXAGON-OS-02) — Crimson Hexagonal Archive

article Full Text OpenAlex

L4 Glitch / Research Quarantine Stack — Core Specification v0.1

article Full Text OpenAlex

Trickums: The VRAM Illusion System of ForgeBorn

article Full Text OpenAlex

近5年高被引文献
Suspending OpenMP Tasks on Asynchronous Events: Extending the Taskwait Construct

book-chapter Full Text OpenAlex 12930 FWCI836.2713

UCSF ChimeraX: Tools for structure building and analysis

article Full Text OpenAlex 3851 FWCI816.1599

Algorithms+Data Structures = Programs

book-chapter Full Text OpenAlex 961 FWCI10.3989

IEEE Transactions on Parallel and Distributed Systems

paratext Full Text OpenAlex 581 FWCI0

PyTorch 2: Faster Machine Learning Through Dynamic Python Bytecode Transformation and Graph Compilation

article Full Text OpenAlex 534 FWCI251.5292

QLoRA: Efficient Finetuning of Quantized LLMs

preprint Full Text OpenAlex 490 FWCI0

TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings

article Full Text OpenAlex 407 FWCI181.2534

NextDenovo: an efficient error correction and accurate assembly tool for noisy long reads

article Full Text OpenAlex 358 FWCI110.3777

PREDICTIVE PERFORMANCE AND SCALABILITY MODELING OF A LARGE-SCALE APPLICATION

article Full Text OpenAlex 312 FWCI0

IEEE Transactions on Dependable and Secure Computing

paratext Full Text OpenAlex 310 FWCI0