Skip to main content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
Re: [tsf-dev] TSF process feedback - part 2

Hi Derek
On 2026-05-05 00:59, Derek M Jones wrote:
It's worth thinking about what TSF is claims to do at a high-level. I think the optimal way to do this is evaluating TSF

Who is the potential customer, i.e., the people paying with their attention
and work time?

That's quite unusual as a definition of "customer", but ok; I'm sticking with my previous answer...

Beneficiaries of this will ideally include:

- engineers/product teams (e.g. demonstrate "best-practices" without need to absorb N proprietary standards) - safety assessors (e.g. reduce time spent locating and normalising work-products)
- security assessors
- quality assessors
- marketing (only if we gain more traction)

The major established users are going to stay with the established tools,
those that they have always used.

Yes I expect they will, until they (the people and the tools) are superseded.

These people have been trained to write safety cases.

In some cases, true. In others, I believe the activity has already been delegated to folks without that training, with or without the support of LLMs.

Nobody writes a safety case because they want to.

I'm sure some folks do.

The next tier down are starting to be asked about conformance to
this that and the other.
External stakeholders need to be satisfied.
Regulators, legal department, major customers.

Agreed.

There seem to be lots of tools already out there
A survey of 10 tools, filtered down from 46 tools
https://dl.acm.org/doi/fullHtml/10.1145/3342481

Thanks for this.

Grok's responses to a few basic questions
https://x.com/i/grok/share/33b5581aa03f493284b661326197609f

I'm not going there :)

Most of these are academic tools, with no support past
the end of the research grant that paid for them.

Quite.

The target customer is an infrequent user.
Deterred by the effort needed to build a case.
Deterred by the cost of acquiring necessary skills.

This may be *a* target "customer" by your definition, but not the only one.

Need to address a broad market and support an ecosystem.

True.

Where are we?

Different interpretations of a scoring function.
Different methods of combining values to create a score.

Defining one scoring function restricts the appeal of the tool.

Why not allow users to calculate any score using any method?

I agree - and I expect we'll get to that.

Provide builtin support for the common methods of
combining values and scoring functions.

Yes, makes sense.

Also support the calculation of multiple scores.

I think we already do. There's nothing to stop folks applying different algorithms (as I understand it some folks are already doing this).

Writing conformance  statements is a skill that takes
practice to learn and become good enough.

LLMs are very good at analyzing sequences of words.

Why not provide a skills assistant that:

   o checks individual compliance statements

   o checks a set of compliance statements

   o checks a compliance document

This is an interesting idea - I just need to conquer my instinctive suspicion that LLMs are mostly snake-oil.

A review of LLMs as judges
https://arxiv.org/html/2511.02203v1
More LLM-based tools at the end of this thread
https://x.com/i/grok/share/33b5581aa03f493284b661326197609f

A simplistic analysis of the contents of one of the TA-analysis conformance
files
https://x.com/i/grok/share/f70c5d3b6e654c96a7f72d0b80f3b9bd

Simplistic indeed. The first point it makes is mostly incorrect, as far as I can tell. I could also argue with a lot of the criticisms, but frankly the thought of having to debate with any of the LLM services just makes me depressed.

I used to spend a lot of time reading programming language standards,
where "shall" and "must" have specific meanings.  The apparent random
switching between the two, in these .md files certainly triggered me.

The mention of LLMs triggers me, but I'm trying to get over it :-)

Also is the "we" set of people the same as the "our" set of people?

I use we to refer to the (nascent) community of folks taking an interest in TSF... which presently includes you :)

These .md files need a lot of cleaning up.

Point taken.

A plug for my hybrid talk at the British Computer Society
tomorrow
https://www.eventbrite.co.uk/e/building-models-of-software-process-behaviours-helped-by-llms-tickets-1987814433481

Is it actually hybrid? The location says "in person"

br
Paul


Back to the top