About 4,220 results
Open links in new tab
  1. We introduce CLEVER, the first curated benchmark for evaluating the generation of specifications and formally verified code in Lean. The benchmark comprises of 161 programming problems; …

  2. CLEVER: A Curated Benchmark for Formally Verified Code Generation

    TL;DR: We introduce CLEVER, a hand-curated benchmark for verified code generation in Lean. It requires full formal specs and proofs. No few-shot method solves all stages, making it a strong …

  3. Submissions | OpenReview

    Jan 22, 2025 · Promoting openness in scientific communication and the peer-review process

  4. Evaluating the Robustness of Neural Networks: An Extreme Value...

    Feb 15, 2018 · Our analysis yields a novel robustness metric called CLEVER, which is short for Cross Lipschitz Extreme Value for nEtwork Robustness. The proposed CLEVER score is …

  5. 579 In this paper, we have proposed a novel counter- factual framework CLEVER for debiasing fact- checking models. Unlike existing works, CLEVER is augmentation-free and mitigates …

  6. Alias-Free Mamba Neural Operator | OpenReview

    Sep 25, 2024 · Functionally, MambaNO achieves a clever balance between global integration, facilitated by state space model of Mamba that scans the entire function, and local integration, …

  7. Provably Mitigating Overoptimization in RLHF: Your SFT Loss is...

    Jun 18, 2024 · With a clever usage of the equivalence between reward models and the corresponding optimal policy, the algorithm features a simple objective that combines (i) a …

  8. Weakly-Supervised Affordance Grounding Guided by Part-Level...

    Jan 22, 2025 · In this work, we focus on the task of weakly supervised affordance grounding, where a model is trained to identify affordance regions on objects using human-object …

  9. While, as we mentioned earlier, there can be thorny “clever hans” issues about humans prompting LLMs, an automated verifier mechanically backprompting the LLM doesn’t suffer from these. …

  10. La RoSA: Enhancing LLM Efficiency via Layerwise Rotated Sparse...

    May 1, 2025 · We use a clever technique that involves rotating the data within each layer of the model, making it easier to identify and keep only the most important parts for processing. This …