Dear AIC Team,
After building the docker container like the Submission.md docs mentioned and testing it locally, it spun up the containers and the evaluation worked well. When I then uploaded and submitted it on the website, it failed without explanation and logs. Is there a chance to perhaps receive some more information regarding this, or more detailed logs to debug why it may have not worked on the the submission portal.
We’ve just rolled out changes that make the eval logs available for your submissions. The logs could provide some insights on why the submissions are failing. For example, a common case is that the AIC engine was timing out when waiting for the submitted model to be ready (see timeout limits in aic/docs/challenge_rules.md at main · intrinsic-dev/aic · GitHub )
You should see be able view these logs in the stdout files in your next submission.
Following the recent rollout, we have observed a regression in the evaluation script. Specifically, the system is failing to wait for the aic_model node to transition from the ‘unconfigured’ state to ‘active,’ resulting in an immediate timeout error.
While this issue was resolved in a prior version, it appears the fix may not have been captured in the current eval image on the submission portal. This likely accounts for the widespread failures reported by other users. Could we verify that the latest stable changes are correctly reflected in the portal’s environment?
Every transition for a node has 60 second timeout. If your node takes more than 60 seconds to configure, where different model initialisation happens, it will trigger timeout error.
I removed the warm-ups which i was doing earlier so that moving from configure to activate takes less than 60 seconds.
Currently, my submission is no longer failing; however, I am receiving a score only for Trial 1. The stdout file does not contain any logs from the aic_model node, which is making it difficult to identify the root cause of the issue.
Due to the absence of these logs, I am unable to determine the possible error or understand why the subsequent trials are not being evaluated as expected. Our team name is TCS-PLGRM.
I am dealing with the same issue and would appreciate any visibility and insights. I have had 5 failed submissions: Runs 207, 236, 238, 255, 256 , 5 different images, 5 different timing profiles (637 → 296 → 185 → 132 → 126 s), all with zero log output. All these images work perfectly fine locally and even on another machine for a sanity check. Team name is DevicetoCloud.
Is it possible to get access to container logs? I have also filed a issue on github if thats helpful.
I have written a post explaining the problem and the fix that worked for me (not using sim time and deferring imports to init) have u tried those things?