Update OpenAI detector #2863

rgmz · 2024-05-17T13:01:53Z

Description:

User-level API keys have been deprecated, now project-level API keys (sk-proj-) are the default. They have also introduced service account API keys (sk-$name-).

https://help.openai.com/en/articles/9186755-managing-your-work-in-the-api-platform-with-projects#h_20805964da

Checklist:

Tests passing (make test-community)?
Lint passing (make lint this requires golangci-lint)?

abmussani

Thanks @rgmz for updating the open-ai detector. I've left some comments.

abmussani · 2024-05-31T15:23:53Z

pkg/detectors/openai/openai.go

+	}()
+
+	switch res.StatusCode {
+	case 200:


@rgmz previously verification was based on range of 2xx Http status. Is there any particular reason to change it ?

Many of the detectors are based on a template with generic conditions. I think it's prudent to only consider something verified if it matches known/documented status codes.

Unexpected API/site changes, and thus status code changes, will propagate as an error rather than creating false-positives.

The API reference of OpenAI has well defined the error codes (also In there python library). In my opinion, Its good to parse the response but It wont make any effect on verification.

In my opinion, Its good to parse the response but It wont make any effect on verification.

I agree that parsing error codes doesn't make sense for OpenAI (although there are some APIs where 401 vs 403 is an importance difference).

I don't think it's a good practice to be liberal with what status codes are considered "valid" when only a specific one is expected. It's a recipe for false positives, imo.

abmussani · 2024-05-31T15:40:11Z

pkg/detectors/openai/openai.go

+		if err = json.NewDecoder(res.Body).Decode(&orgs); err != nil {
+			return false, nil, err
+		}
+
+		org := orgs.Data[0]
+		extraData := map[string]string{
+			"id":          org.ID,
+			"title":       org.Title,
+			"user":        org.User,
+			"description": org.Description,
+			"role":        org.Role,
+			"is_personal": strconv.FormatBool(org.Personal),
+			"is_default":  strconv.FormatBool(org.Default),
+			"total_orgs":  fmt.Sprintf("%d", len(orgs.Data)),
+		}
+		return true, extraData, nil


@rgmz Metadata here is an additional information and the credentials are already been verified based on HTTP Status code. Base on the conversation we had on #2807 and #2808 , these are non-fatal errors and can be ignored without affecting the verification result.

..these are non-fatal errors and can be ignored without affecting the verification result.

The question I posed was actually whether it makes sense to return "verified" alongside an error to indicate a potential problem with the response. Generally speaking, I think it's much safer to handle errors rather than ignore them if err == nil { ... }.

There's a larger problem of surfacing errors and giving them precedence, but that's a different topic. e.g., the GitHub detector failing due to a DNS timeout and the JDBC detector failing because of an invalid hosts shouldn't be treated equally.

You are right that these errors should not be ignored. Unfortunately, right now, Detector result struct does not supports to hold these kind of errors (other than verification). In future, we can have separate error field(s) like @dustin-decker proposed to name it as EnrichmentError.

Unfortunately, right now, Detector result struct does not supports to hold these kind of errors (other than verification).

I would still consider them verification errors. e.g., invalid JSON can be indicative of false-positives or the API not behaving as expected (#2099).

Thanks @rgmz for sharing this case study. It is valid use case to convince me. One more case came in my mind is if an API started to redirected to valid page (with status 200) with HTML in response (may be home page), It will also be indicative of false-positives.

abmussani

Thank you. LGTM

) Co-authored-by: āh̳̕med <13666360+0x1@users.noreply.github.com>

rgmz force-pushed the feat/openai-update branch from 8e9e806 to b942f4c Compare May 17, 2024 13:08

rgmz changed the title ~~Update OpenAI detectors~~ Update OpenAI detector May 17, 2024

rgmz force-pushed the feat/openai-update branch 2 times, most recently from 6e49335 to 85c9275 Compare May 17, 2024 14:33

rgmz mentioned this pull request May 21, 2024

Update regex for OpenAI's API key #2868

Closed

2 tasks

rgmz force-pushed the feat/openai-update branch from 85c9275 to 340a50b Compare May 22, 2024 20:03

rgmz force-pushed the feat/openai-update branch from 340a50b to 6964268 Compare May 30, 2024 20:04

abmussani reviewed May 31, 2024

View reviewed changes

feat(openai): add project and service account keys

02e67c9

rgmz force-pushed the feat/openai-update branch from 6964268 to 02e67c9 Compare June 5, 2024 00:38

abmussani approved these changes Jun 5, 2024

View reviewed changes

Merge branch 'main' into feat/openai-update

397f8f5

dustin-decker approved these changes Jun 5, 2024

View reviewed changes

dustin-decker merged commit 024b219 into trufflesecurity:main Jun 5, 2024
11 of 12 checks passed

rgmz deleted the feat/openai-update branch June 5, 2024 15:33

rgmz added a commit to rgmz/trufflehog that referenced this pull request Jun 5, 2024

feat(openai): add project and service account keys (trufflesecurity#2863

71c1930

) Co-authored-by: āh̳̕med <13666360+0x1@users.noreply.github.com>

rgmz added a commit to rgmz/trufflehog that referenced this pull request Jun 6, 2024

feat(openai): add project and service account keys (trufflesecurity#2863

083443e

) Co-authored-by: āh̳̕med <13666360+0x1@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update OpenAI detector #2863

Update OpenAI detector #2863

rgmz commented May 17, 2024 •

edited

abmussani left a comment

abmussani May 31, 2024

rgmz May 31, 2024

abmussani Jun 3, 2024 •

edited

rgmz Jun 3, 2024

abmussani May 31, 2024

rgmz Jun 3, 2024

abmussani Jun 3, 2024

rgmz Jun 5, 2024 •

edited

abmussani Jun 5, 2024

abmussani left a comment

Update OpenAI detector #2863

Update OpenAI detector #2863

Conversation

rgmz commented May 17, 2024 • edited

Description:

Checklist:

abmussani left a comment

Choose a reason for hiding this comment

abmussani May 31, 2024

Choose a reason for hiding this comment

rgmz May 31, 2024

Choose a reason for hiding this comment

abmussani Jun 3, 2024 • edited

Choose a reason for hiding this comment

rgmz Jun 3, 2024

Choose a reason for hiding this comment

abmussani May 31, 2024

Choose a reason for hiding this comment

rgmz Jun 3, 2024

Choose a reason for hiding this comment

abmussani Jun 3, 2024

Choose a reason for hiding this comment

rgmz Jun 5, 2024 • edited

Choose a reason for hiding this comment

abmussani Jun 5, 2024

Choose a reason for hiding this comment

abmussani left a comment

Choose a reason for hiding this comment

rgmz commented May 17, 2024 •

edited

abmussani Jun 3, 2024 •

edited

rgmz Jun 5, 2024 •

edited