They could back when everyone was using pre-AI context engines that were actually capable of it. Autocorrect is in the same boat. It used to change things correctly to match the context, and now a days it will change words to other words that entirely don’t work within the rest of the context.
Though I am doubtful whatever detects music and sounds in the video literally ever had any kind of context seeking in the first place.
They could back when everyone was using pre-AI context engines that were actually capable of it. Autocorrect is in the same boat. It used to change things correctly to match the context, and now a days it will change words to other words that entirely don’t work within the rest of the context.
Though I am doubtful whatever detects music and sounds in the video literally ever had any kind of context seeking in the first place.