Considering Lit for the Beman Project

dsankel · July 26, 2024, 9:13pm

Abstract

The LLVM Integrated Tester (lit) is being proposed as the recommended test framework for Beman Projects. While it supports compilation failure testing, it has limited usage experience, introduces complexity, and has ergonomics challenges. We instead suggest recommending a more popular framework, such as GTest, supplemented with CMake functions adding negative compilation testing capabilities.

Link to full doc…

InbalL · July 29, 2024, 5:13pm

That sounds like the right direction, thank you for exploring that.
As discussed during the meeting, for a simple go-to solution we could recommend “doctest” (somewhat improved version of gtest):

GitHub - doctest/doctest: The fastest feature-rich C++11/14/17/20/23 single-header testing framework
Better Ways to Test with doctest – the Fastest C++ Unit Testing Framework | The ReSharper C++ Blog

linusboehm · July 29, 2024, 5:38pm

I agree with the findings in the paper. Seems like there’s no one-size-fits-all testing solution. Having gtest, catch2, or doctest as a default recommendation sounds sensible. In cases where these are not an option (testing standard lib) or are missing features (assert compile failure) authors can go for a more suitable solution.

I don’t have any experience with doctest. I’ve used gtest and catch2 and was happy with both. I think catch2 was a bit easier to use compared to gtest and I like the expressiveness of the Given-When-Then-style tests in catch2.

bcraig · July 31, 2024, 12:48pm

I’m fine with the conclusion of not using lit. The python dependency and complexity of deploying lit are real issues.

However, I’m not sure about recommending tools like GTest since that will make it more difficult for standard library vendors to adopt Beman tests. None of the big three vendors use a testing framework like GTest, catch2, or doctest. Instead, they have a bunch of TUs, each with their own main. If Beman tests follow that pattern, then it is easy to move the tests into each respective STL vendor’s test suite. Those frameworks are all great for most library uses, but they are less great if the goal is to get into a standard library. I’ll also note that using one of the big test frameworks will end up as a barrier for freestanding testing.

My suggestion will be main tests, but orchestrated by cmake code. Much of that work would need to be done for negative compilation testing anyway, so making it work for the positive testing too doesn’t seem like much of a stretch.

I’ll note that lit is used by all of clang / llvm for testing, and not just the libc++ portion of the project. That demonstrates more of the flexibility of the framework. A long time ago (2015) I was able to use lit to cross-compile libc++ tests on my host machine, then run the tests on a simulator.

One of the benefits of lit that wasn’t mentioned was in-test build configuration control. The test framework can attempt to build and run each test under multiple configurations, and the tests can indicate whether they support that configuration or not. In libc++, this most commonly manifests as saying that a test only supports certain standards, but other features can be tested for (example). I can imagine Beman tests needing to annotate that they are expected to fail on certain implementations.

I do feel that the adoption and documentation roadblock points are a bit overblown. If anything, the concern is in the other direction (i.e. the test frameworks are harder to understand). With lit tests (or other main tests), you write a main, add some asserts, and you are done (example). With the frameworks, you need to see how this framework defines their tests, how this framework spells it’s asserts and expectations, and so on.

jwakely · August 1, 2024, 12:27pm

Don’t those kind of cmake functions (such as icm) have even less usage experience than lit? (which is used by all of LLVM, not just by libc++ as stated in the doc).

dsankel · August 1, 2024, 1:53pm

icm’s build failure testing capability certainly has less usage experience than lit’s. A GitHub code search revealed only a couple Open Source projects using it. It was interesting to see NVIDIA/stdexec as one of those.

By the way, I updated the document to clarify all of LLVM is using lit and that icm has had little uptake in the Open Source world.

correaa · August 5, 2024, 1:41am

In my opinion, header-only libraries should have the option to be tested with header-only test frameworks.
Also, to not generate almost any warning when included.

That limits the possibilities a lot; as far as I know Gtest and Catch2 are not header-only.

Doctest is header only (but I never used it), so are Boost.LightweightTest and mu-t (seems experimental).
Boost.Test can be included as header only but the compile times are abysmal, and generates lots of warnings.

Not sure about others possibilities, but this is something to take into account for header-only proposals that I expect to be the common case.

Sdowney · August 5, 2024, 1:52pm

I’m not terribly concerned about the python dependency as python is easily available and python has lots of tools for managing projects. See other thread on pre-commit. I should have a draft PR soon for example and optional.
Lit, on the other hand, doesn’t seem to be packaged, so would be very complicated to get working portably? Unless I’ve missed something. An LLVM development install is a huge ask.

However, for a lot of tests, they are going to look just like the source for lit tests. Making the negative tests, the failure to compile, more reusable should be a goal. That’s outside what xUnit style frameworks like gtest and catch2 do, in any case. We’re going to have to do something different in any case.

dsankel · August 9, 2024, 8:49pm

Welcome @correaa!

Could you elaborate a bit on the use case you have in mind? People using header-only libraries I know usually paste the headers in their repository somewhere. Will your users be using CMake? How would they build the tests?

dsankel · August 9, 2024, 10:23pm

This is a good point. One solution would be to forgo the unit test framework niceties and stick with assert. Although we wouldn’t be using lit, I expect tests written that way to be extremely portable.

correaa · August 11, 2024, 9:24pm

Yes, what I am saying is that, in general, for someone developing a header only library, it is unreasonable to also require (or strongly suggest) to use a test framework that needs compilation or binaries.

This is my experience:
I developed a a Boost-like library (an array library to be specific) that is header only.
I started using Catch2 when it is was header only and all was fine.
Then Catch2 changed in a way that needed compilation.

Given the complexity, and since this was going to be a Boost library anyway, I changed to Boost.Test, which also needed compilation.
This worked well for a while, because Boost.Test comes precompiled in many systems and it is as nice as Catch2.

However this became a burden when I needed to test the library against mildly exotic systems: 32-bit systems, apple-M systems and Windows (MSVC, clang, gcc).
In these cases, preparing Boost.Test for these systems required compilation of Boost.Test (or in the case of Windows, downloading large binaries) which was too heavy in the CI.

Boost.Test can be used in header-only mode, but it is not designed for that, so compilation times were extremely large because I have a few dozens of cpp files in the test.

At the end of the day, recently, I ended up using Boost.LightweightTest (part of Boost.Core), which doesn’t have too many features but it is very lightweight and it has a proportional complexity (it is not an overkill) with the library I am trying to test.

I am not suggesting to use B.LWT specifically, I am trying to communicate the value of having the option of a lightweight, header-only framework for a small, header-only library, which I guess is going to be a common case for Beman.

For large projects, I guess it is justified to use compiled test frameworks, because the upfront cost is amortized by the complexity and compilation times of the large projects. Also they have more features, like test for thrown exceptions and nicer diagnostics.

I don’t know what is the best way to proceed, complex libraries will benefit with feature-rich frameworks, like GTest, but I have the impression, from experience, that for the majority it will add unnecessary complexity.
At the same time, proposing (as a template) two different test frameworks, one header-only and one compiled can be beneficial in this sense, but also can be confusing.

I just learned from a comment in this thread that doctest is (or can be used as) header-only, so it could be what we are looking for. (just keep an eye on compilation test if one has many individual cpp tests)

Regarding your comment: “People using header-only libraries I know usually paste the headers in their repository somewhere.”. I think what I am saying is independent of this use pattern. In any case header-only library can still benefit from using CMake (I do it this way, but if someone still wants to use it by copy the headers to a directory that is fine with me).

Jeff-Garland · August 22, 2024, 2:01pm

I’m late to the comment chain here – but we haven’t seemed to come to any resolution. I agree with @correaa here that most of the test frameworks are way too heavy. For boost.datetime I wrote a less than 100 sloc framework that did all I needed – it uses real functions instead of macros bc I hate debugging macros. It’s not really suitable, but it’s an example of the sort of thing a library author might resort to if we don’t have a recommendation on a header only framework.

github.com

boostorg/date_time/blob/develop/test/testfrmwk.hpp


#ifndef TEST_FRMWK_HPP___
#define TEST_FRMWK_HPP___

/* Copyright (c) 2002,2003 CrystalClear Software, Inc.
 * Use, modification and distribution is subject to the 
 * Boost Software License, Version 1.0. (See accompanying
 * file LICENSE_1_0.txt or http://www.boost.org/LICENSE_1_0.txt)
 * $Date$
 */


#include <iostream>
#include <string>
#include <boost/config.hpp>

//! Really simple test framework for counting and printing
class TestStats
{
public:

This file has been truncated. show original

boost.lwt looks pretty good (except macros) but obviously way more comprehensive.

Professionally we’ve switched to using boost.ut. It’s no macro (yay!), but does require c++20. You can read the docs for yourself, but it scales up from ‘very simple’ to extremely feature rich. We’ve encountered zero issues with it. Would be nice if Kris would actually put it into boost, but I can understand his reluctance to bother at this point.

ClausKlein · November 26, 2024, 6:02pm

No macros is very important in my opinion.

`run-clang-tidy` on `tests` is the ultimate check for a `unit test frameworks`.

`doctest` does not pass this clang-tidy test!

pdimov · November 28, 2024, 5:47pm

The macros are needed to capture the text of the tested expression; e.g. in Boost.LWT, BOOST_TEST_EQ(x, y) can output "Test x == y failed.".

Why is that a problem?

ClausKlein · November 29, 2024, 12:51pm

see

Jeff-Garland · December 5, 2024, 12:33am

pdimov> Why is that a problem?

Hi Peter - welcome! I doubt that it really is…despite my months old comment.

clausklein> see…

Claus – I understand that doctest fails some of the tidy tests, but not all macros do, right? And while I’m a fan of ditching macro based testing I don’t think that’s a hill we should die on. Although I’m starting to think we might need to just make one of our own since no one can ever agree on the testing framework.

river · December 16, 2024, 9:10pm

I don’t think clang-tidy is helpful in evaluating tests. And I suggest a more careful approach to incorporating clang-tidy, its often pedantic with ideology-focused suggestions that produce too much false positives to out-weight its usefulness in finding bugs.

I specifically replied to one of CK’s clang-tidy report regarding this that I wish to document here. Link: Switch Implementation by wusatosi · Pull Request #43 · bemanproject/inplace_vector · GitHub

Original Comment by CK (all error messages are shown in my reply, omitted here for cleraity):
This code need some more love:

bash-5.2$ run-clang-tidy -p build -checks='-*,bugprone-*'
Enabled checks:
    bugprone-argument-comment
...
...
...
      |           ^
/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:493:25: note: move occurred here
  493 |     vector<MoveOnly, 3> b(std::move(a));
      |                         ^
33493 warnings generated.
Suppressed 33486 warnings (33486 in non-user code).
Use -header-filter=.* to display errors from all non-system headers. Use -system-headers to display errors from system headers as well.

bash-5.2$

My reply:

Hey @.ClausKlein sorry I didn’t reply to your message before merging.

This is simply to port the reference implementation. We might incorporate suggestions from clang-tidy but the main concern currently is to review the reference implementation against the paper. Plus clang tidy gives pedantic warnings that’s often of no value.

I don’t really think we should do clang-tidy check for test files, especially when this clearly checks exception throwing:

[1/3][2.9s] /usr/local/opt/llvm/bin/clang-tidy -checks=-*,bugprone-* -p=build /Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/inplace_vector.test.cpp
/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/inplace_vector.test.cpp:75:7: warning: empty catch statements hide issues; to handle exceptions appropriately, consider re-throwing, handling, or avoiding catch altogether [bugprone-empty-catch]
   75 |     } catch (const std::out_of_range &) {
      |       ^
/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/inplace_vector.test.cpp:83:7: warning: empty catch statements hide issues; to handle exceptions appropriately, consider re-throwing, handling, or avoiding catch altogether [bugprone-empty-catch]
   83 |     } catch (const std::out_of_range &) {
      |       ^
27827 warnings generated.
Suppressed 27825 warnings (27825 in non-user code).
Use -header-filter=.* to display errors from all non-system headers. Use -system-headers to display errors from system headers as well.

I don’t think this is valuable for testing structs, also there’s no instance of misuse reported.

[3/3][4.7s] /usr/local/opt/llvm/bin/clang-tidy -checks=-*,bugprone-* -p=build /Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp
/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:139:16: warning: 2 adjacent parameters of 'non_copyable' of convertible types are easily swapped by mistake [bugprone-easily-swappable-parameters]
  139 |   non_copyable(int i, double d) : i_(i), d_(d) {}
      |                ^~~~~~~~~~~~~~~
/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:139:20: note: the first parameter in the range is 'i'
  139 |   non_copyable(int i, double d) : i_(i), d_(d) {}
      |                    ^
/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:139:30: note: the last parameter in the range is 'd'
  139 |   non_copyable(int i, double d) : i_(i), d_(d) {}
      |                              ^
/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:139:23: note: 'int' and 'double' may be implicitly converted
  139 |   non_copyable(int i, double d) : i_(i), d_(d) {}
      |                       ^

Why not throw exception for tests at main? I don’t think try-catch the main function and return -1 is of any value for tests.

/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:336:5: warning: an exception may be thrown in function 'main' which should not throw exceptions [bugprone-exception-escape]
  336 | int main() {
      |     ^

Reference implementation aimed at std-style implementation (originally under std namespace). We would like to move away from reserved identifier but it’s not top priority.

/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:338:42: warning: declaration uses identifier '__non_trivial', which is a reserved identifier [bugprone-reserved-identifier]
  338 |     using beman::__iv_detail::__storage::__non_trivial;
      |                                          ^~~~~~~~~~~~~
      |                                          _non_trivial
/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:339:42: warning: declaration uses identifier '__trivial', which is a reserved identifier [bugprone-reserved-identifier]
  339 |     using beman::__iv_detail::__storage::__trivial;
      |                                          ^~~~~~~~~
      |                                          _trivial
/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:340:42: warning: declaration uses identifier '__zero_sized', which is a reserved identifier [bugprone-reserved-identifier]
  340 |     using beman::__iv_detail::__storage::__zero_sized;
      |                                          ^~~~~~~~~~~~
      |                                          _zero_sized

The intent is literally to test the move-from object:

/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:486:11: warning: 'a' used after it was moved [bugprone-use-after-move]
  486 |     CHECK(a.size() == std::size_t{3});
      |           ^
/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:484:7: note: move occurred here
  484 |     b = std::move(a);
      |       ^
/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:495:11: warning: 'a' used after it was moved [bugprone-use-after-move]
  495 |     CHECK(a.size() == std::size_t{3});
      |           ^
/Users/clausklein/Workspace/cpp/beman-project/inplace_vector/tests/beman/inplace_vector/ref_impl.test.cpp:493:25: note: move occurred here
  493 |     vector<MoveOnly, 3> b(std::move(a));
      |                         ^

I wish this line by line explanation shows why I am not a big advocate for clang-tidy, I am sorry but this report is not helpful at all to our implementation.

perghosh · April 27, 2025, 2:37pm

Late to the party

Just wanted to contribute my opinion on how important it is to have a simple and flexible system for testing, smaller is usually better than those that have features that hardly anyone uses. They are mostly blockers for new users.

Not much is needed just to review results from code.

It’s also easy to write your own code for testing and then also get the advantage of adapting it to your specific needs without dependencies.

Jeff-Garland · April 29, 2025, 2:29am

Agree. beman.scope is adopting catch2 for the moment anyway, and it seems pretty straight forward. It’s still more than I’d like – compiles a library for a header only lib test – but not as heavy as gtest.

But my overall take is this issue will end up like licenses – pick one of these 3 solutions.

ClausKlein · April 30, 2025, 9:42am

I agree, we should only check the implementation and the examples with clang-tity

Topic		Replies	Views
Test Infrastructure: preferably not Google Test! Beman Project Development guidelines	26	336	May 8, 2025
Looking to Get Involved in the C++ Community! Beman Project Development cmake , community	9	90	March 3, 2025
Thoughts on using CPM.cmake as FetchContent Alternative Beman Project Development cmake , packaging	25	276	February 8, 2025
Are standards related non-libraries in scope? Beman Project Development	3	66	July 3, 2024
Bemanproject/beman issues Beman Project Development	8	55	March 14, 2025

Considering Lit for the Beman Project

Abstract

No macros is very important in my opinion.

run-clang-tidy on tests is the ultimate check for a unit test frameworks.

doctest does not pass this clang-tidy test!

Related topics

`run-clang-tidy` on `tests` is the ultimate check for a `unit test frameworks`.

`doctest` does not pass this clang-tidy test!