Some
websites report a "not found" error by returning a standard web page with a "200 OK" response code, falsely reporting that the page loaded properly; this is known as a
soft 404. The term "soft 404" was introduced in 2004 by Ziv Bar-Yossef
et al. Soft 404s are problematic for automated methods of discovering whether a link is broken. Some search engines, like
Yahoo and
Google, use automated processes to detect soft 404s. Soft 404s can occur as a result of configuration errors when using certain HTTP server software, for example with the
Apache software, when an Error Document 404 (specified in a
.htaccess file) is specified as an absolute path (e.g. http://example.com/error.html) rather than a relative path (/error.html). This can also be done on purpose to force some browsers (like
Internet Explorer) to display a customized 404 error message rather than replacing what is served with a browser-specific "friendly" error message (in Internet Explorer, this behavior is triggered when a 404 is served and the received HTML is shorter than a certain length, and can be manually disabled by the user). There are also "soft 3XX" errors where content is returned with a status 200 but comes from a redirected page, such as when missing pages are redirected to the domain root/home page.
Proxy servers Some
proxy servers generate a 404 error when a
500-range error code would be more correct. If the proxy server is unable to satisfy a request for a page because of a problem with the remote host (such as hostname resolution failures or refused TCP connections), this should be described as a 5xx Internal Server Error, but might deliver a 404 instead. This can confuse programs that expect and act on specific responses, as they can no longer easily distinguish between an absent web server and a missing web page on a web server that is present.
Intentional 404s In July 2004, the UK telecom provider
BT Group deployed the
Cleanfeed content blocking system, which returns a 404 error to any request for content identified as potentially illegal by the
Internet Watch Foundation. Other ISPs return a
HTTP 403 "forbidden" error in the same circumstances. The practice of employing fake 404 errors as a means to conceal
censorship has also been reported in
Thailand and
Tunisia. In Tunisia, where censorship was severe before the
2011 revolution, people became aware of the nature of the fake 404 errors and created an imaginary character named "
Ammar 404" who represents "the invisible censor". ==Microsoft Internet Server 404 substatus error codes==