You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: index.html
+11-9
Original file line number
Diff line number
Diff line change
@@ -40,7 +40,7 @@ <h1>How's GPT-4o Doing?</h1>
40
40
<p>You can contribute your own tests, too! See the <ahref="https://github.com/roboflow/gpt-checkup?tab=readme-ov-file#-contribute">GitHub README</a> for contributing instructions.</p>
41
41
</div>
42
42
<divclass="header_subtitle">
43
-
<p>Tests are run every day at 1am PT. Last updated January 26, 2025.</p>
43
+
<p>Tests are run every day at 1am PT. Last updated January 27, 2025.</p>
44
44
<p>Made with ❤️ by the team at <ahref="https://roboflow.com">Roboflow</a>.</p>
<pstyle="font-size: 16px; color: var(--gray-700)">Today, the average response time to receive results from our tests was <b>3.74 seconds</b> per request.</p>
61
+
<pstyle="font-size: 16px; color: var(--gray-700)">Today, the average response time to receive results from our tests was <b>3.73 seconds</b> per request.</p>
62
62
<pclass="subtitle">This number only accounts for requests made by this application.</p>
63
63
</div>
64
64
<divclass="chart">
65
65
<divclass="chart_box chart_box_green">
66
-
<p>3.74 s</p>
66
+
<p>3.73 s</p>
67
67
</div>
68
68
</div>
69
69
</div>
@@ -122,7 +122,7 @@ <h3><span class="explainer_icon far fa-comment-dots"></span>Prompt</h3>
122
122
<h3><spanclass="explainer_icon far fa-image"></span>Image</h3>
123
123
<imgclass="test_image" src="images/fruit.jpeg" alt="Image of the input into GPT-4" />
124
124
<h3><spanclass="explainer_icon far fa-sparkles"></span>Result</h3>
125
-
<pre>7</pre>
125
+
<pre>8</pre>
126
126
<pclass="subtitle" style="margin-top: 16px; text-align: center">Test submitted by <ahref="https://roboflow.com" target="_blank">Roboflow</a></p>
127
127
</div>
128
128
</div>
@@ -230,7 +230,7 @@ <h3><span class="explainer_icon far fa-comment-dots"></span>Prompt</h3>
230
230
<h3><spanclass="explainer_icon far fa-image"></span>Image</h3>
231
231
<imgclass="test_image" src="images/fruit.jpeg" alt="Image of the input into GPT-4" />
232
232
<h3><spanclass="explainer_icon far fa-sparkles"></span>Result</h3>
<buttontype="button" class="dropdown dropdown_learn active">Learn about this test</button>
@@ -417,10 +417,12 @@ <h3><span class="explainer_icon far fa-comment-dots"></span>Prompt</h3>
417
417
<h3><spanclass="explainer_icon far fa-image"></span>Image</h3>
418
418
<imgclass="test_image" src="images/annotationqa.jpeg" alt="Image of the input into GPT-4" />
419
419
<h3><spanclass="explainer_icon far fa-sparkles"></span>Result</h3>
420
-
<pre>It appears that the dataset captures cars on the road with bounding boxes (red boxes). The image shows several cars labeled correctly, but there is at least one car (the white car on the right) that seems unlabeled.
420
+
<pre>To count the missing annotations, I would need to know the total number of cars visible in the image versus the number of cars with red bounding boxes. Based on the image:
421
421
422
-
Here's the result in JSON format:
422
+
1. **Cars annotated with bounding boxes:** There are 6 red bounding boxes visible.
423
+
2. **Cars visible in the scene, including unannotated ones:** It appears there is one car near the farthest end of the scene without a bounding box on it.
"result": "I was thinking earlier today that I have gone through, to use the lingo, eras of listening to each of Swift's Eras. Meta indeed. I started listening to Ms. Swift's music after hearing the *Midnights* album. A few weeks after hearing the album for the first time, I found myself playing various songs on repeat. I listened to the album in order multiple times."
25
+
},
26
+
"handwriting_ocr": {
27
+
"score": 1,
28
+
"success": true,
29
+
"price": 0.00974,
30
+
"pass_fail": "Pass",
31
+
"response_time": 8.458041667938232,
32
+
"result": "The words of songs on the album have been echoing in my head all week. \"Fades into the grey of my day old tea.\""
"result": "To count the missing annotations, I would need to know the total number of cars visible in the image versus the number of cars with red bounding boxes. Based on the image:\n\n1. **Cars annotated with bounding boxes:** There are 6 red bounding boxes visible.\n2. **Cars visible in the scene, including unannotated ones:** It appears there is one car near the farthest end of the scene without a bounding box on it.\n\n### JSON Output:\n```json\n{\n\"missing\": 1\n}\n```"
0 commit comments