Essential International Standards and Registries for Web Developers

- Programming, Quality Assurance, Security

Latest revision:

Radiating business woman

The following is a collection of free international standards, registries and references that I collected throughout the years while developing websites and web services. These references, while very precise and technical by their nature, are extremely useful in order to ensure that a specific implementation is actually correct, and to mitigate unexpected interoperability between systems on the Internet.

As it's not always clear how a technology is used based on its name or acronym, I included the primary use case for each reference along with its name and/or acronym.

Also, many of these standards are built over each other, and as such I tried to list them in that order as much as possible while maintaining clarity.

Note that many of these standards use a metalanguage defined by the following:

Real world stuff

The following provides IDs and critical information about worldwide social, political and cultural concepts often referenced.

Plain text

The following explains how text is handled by a computer.

Note that the most popular character encoding is UTF-8, a superset of ASCII.

IP adresses

The following explains how computers can identify and talk to each other on the Internet.

Domain names

The following explains how to find information about a particular domain on the Internet, including the IP adresses of its services.

Note that host names are domain names on which a website can be hosted.

Note that the original version of IDNA is not fully backwards-compatible with the current version. While the current version is used in all major browsers nowadays, some other clients may still be in transition. For more information, refer to UTS #46.

TLS

The following explains how an encrypted connection can be established between two machines over a network.

Note that SSL is an obsolete technology that was superseded by TLS.

Also note that there is another standard that enables secure storing of TLS public keys on the DNS called DANE, which technically makes certificates obsolete and also enables mandatory secure connections to servers, but it currently suffers from operational issues preventing widespread support and deployment.

Emails

The following explains how emails work.

Note that Pluralsight subscribers can watch my course Configuring and Managing SPF, DKIM, and DMARC, which cover some of these topics.

XML

The following explains how to use XML, a data format that forms the base of all web pages.

Note that newer versions of XML and XPath exist, but are seldom used.

JSON

The following explains how to use JSON, a common data format commonly used by websites.

URLs

The following explains how to interpret URLs.

HTTP

The following explains how web clients interact with websites.

Static web

The following explains how to write a web page.

Note that RSS is an obsolete technology that was superseded by Atom.

Metadata

The following describes non-standard HTML meta tags found on the Internet. Note that standard ones are documented in the HTML specifications.

Client-side programming

The following explains how to write and automatically interact with dynamic web pages. Note that some API implementations are already described in the HTML definition listed in the previous section.

External APIs

The following describe common ways web servers implement publicly-facing APIs.

Note that I am not including SOAP, WSDL or any other technology used on top of them due to their many competing versions and extensions, and as they are rarely used outside of complex financial transactions. Many API providers that do use them generally offer REST APIs anyway.

Data access management

The following explains how websites should manage access to secure data, including cases where authentication is done by a third-party.

Accessibility

The following explains how to write web pages to be accessible for people with disabilities.

Markdown

The following defines a humanly-readable plain text format that can be easily converted to hypertext with tools.

Other common data formats

The following defines a few other common data formats that can be found on the web.

Related content I wrote

Floating mathematical formulas

A Technical Introducition to MathML Core for Writing Mathematics on the Web

- Programming, Mathematics

Thanks to recent efforts, all major web browsers currently support MathML Core, a subset of MathML focused on important presentation markup, to support mathematics on the web. As of this writing, the MathML Core specifications are still not finalized, but given its strong origins and support, it can…

Fireworks

The New Open Source Video Game Randomizer List Is Now Live

- Video Games, Programming

Time to update your bookmarks! After a few months of work behind the scenes, the new open source version of The BIG List of Video Game Randomizer is now live for your enjoyment, with dark mode support and a brand new UI for better readability! The new URL is: https://randomizers.debigare.com/ (The…

Open treasure chest with a question mark in it

The Future of the Video Game Randomizer List

- Video Games, Programming, Anecdotes

It's hard to believe that it's been almost 8 years since I first posted on the ROMhacking.net forums a list of video game randomizers that I found online, and that it would evolve into the massive project it has become today, with almost 900 entries currently being listed. It's always a strange…

Stream of zeros and ones in space

Minifying JSON Text Beyond Whitespace

- Programming, Mathematics

JSON is a common data serialization format to transmit information over the Internet. However, as I mentioned in a previous article, it's far from optimal. Nevertheless, due to business requirements, producing data in this format may be necessary. I won't go into the details as to how one could…

Field of CG-rendered disembodied arms pointing at a dark sky at sunrise

Current Generative AIs Have Critical Quality Issues

- Business, Quality Assurance, Security

The hype for generative AI is real. It is now possible for anybody to dynamically generate various types of media that are good enough to be mistaken as real, at least at first glance, either for free or at a low cost. In addition, the seemingly-creative solutions they come up with, and the…

See all of my articles