We are seeing novel uses of markup languages, like publishing, syndication and even for desktop applications. Think about HTML, XML if you are uncomfortable with the markup languages phrase. There are different categories of markups - presentational (HTML), procedural (Tex, PostScript) and descriptive (XML).
The biggest advantage of a markup language is that it includes both - the data and information about the data, usually called the metadata. This metadata can be used for specifying various aspects of the data, like its presentation and behavior, and hence so many of its applications.
What Is Semantic Markup?
One more aspect of the data that can be specified using the metadata is its meaning. You can mark the data with the elements that specify what is the data about.
This also means that we need a markup language that can be extensible to express semantics. Nothing better than a descriptive markup language, like XML. XML brought to the table a culture of enclosing data between an open tag and a closed tag. Every element is composed of an open tag, data and a closed tag, e.g., <n1>Abhijit</n1> where Abhijit is the data and is enclosed between open and closed n1 tags.
Expressing the earlier example semantically would result into <name>Abhijit</name>.
In non-technical terms semantic language are labels; labels to identify the data enclosed. The strongest benefit of using semantics is illustrated nicely by Tim Bray at beginning of this article. The markup conveys meaning of the data. More importantly, the markup can be used to express relations. An apt example is that HTML cite tag can be used to specify source from where information is being cited.
The real intention of using Semantic Markup is to allow not only humans but even the machines to understand meaning of the data. For example, if the cite tag is globally used, a software program can be used to identify which source has been used by which article and create an inventory. In fact, Tim Berners Lee specified explicitly - semantic means machine processable.
Success of Semantic Markup
Semantic markup is not enforced. However, its success is affected by its uniformity and global use. Imagine if every individual came up with proprietary labels to identify data. Software program that works for one would not work for another, negating the benefit of Semantic Markup.
The ideal example to use is RSS, a feed format used for syndicating content. The basic idea of RSS would fail if feed for every web site had its own schema and tags. There would be a RSS parser for every web site that would not work for the other. The benefit lies in using the same labels for the same data. It should not be semantic only for you or me, it should be semantic for all.
This also leads to the point that descriptive markup language by itself is not semantic. Descriptive markup languages and schemas are only function as tools for creating Semantic Markup. None of them convey or hold the meaning that semantics are trying to convey.
That is why today we have XHTML, successor to HTML that is descriptive to express the semantics. However, XHTML by itself is not semantic, but it can be used efficiently to create Semantic markup.
The Larger Goal
The larger goal of using Semantic Markup is to build a Semantic Web, also called Web of data. In a simpler way, Semantic Web is about building knowledge by parsing through available information. This is exemplified by projects such as XFN which helps in representing relationships between individuals. This is one of the best ways of building a network of people.
Tim Berners Lee writes about his vision of the Semantic Web in his book Weaving The Web
I have a dream for the Web . . . and it has two parts.
In the first part, the Web becomes a much more powerful means for collaboration between people. I have always imagined the information space as something to which everyone has immediate and intuitive access, and not just to browse, but to create. [...] Furthermore, the dream of people-to-people communication through shared knowledge must be possible for groups of all sizes, interacting electronically with as much ease as they do now in person.
In the second part of the dream, collaborations extend to computers. Machines become capable of analyzing all the data on the Web - the content, links, and transactions between people and computers. A “Semantic Web,” which should make this possible, has yet to emerge, but when it does, the day-to-day mechanisms of trade, bureaucracy, and our daily lives will be handled by machines talking to machines, leaving humans to provide the inspiration and inuition. The intelligent “agents” people have touted for ages will finally materialize. This machine-understandable Web will come about through the implementation of a series of technical advancements and social agreements that are now beginning (and which I describe in the next chapter.)
The Semantic Web comprises of different technologies like RDF, XML and OWL. Again, Semantic Web is not necessary to build the knowledge, but it provides a standard way of doing it that everyone can follow.
Semantic Markup can make automation easier. One of the bigger problems in automation is that the data is not addressed the same everywhere. Semantic Markup can be used to build Semantic Web and software can be used for maximum benefit.


October 19th, 2006 at 6:05 pm
[...] My third article on Semantic HTML in fadtastic.net is up. It is a kind of follow up to my article on the generic Semantic Markup. [...]
October 19th, 2006 at 10:54 pm
[...] Semantic Markup [...]
November 13th, 2006 at 9:14 am
[...] Or if it really works well, we should have News 2.0, Radio 2.0, Humanity 2.0, or Life 2.0! I wonder the conversations when version numbers would be flying around. Now Semantic Web is being touted as Web 3.0. Were semantics not stressed on earlier? Is the software community not aware of its benefits? Does Web 3.0 now imply that every website that has to qualify should have semantic markup? [...]
November 26th, 2006 at 10:35 am
[...] I always insist of developing the markup, in fact semantic markup, before it is styled. The basic reason is that content is still the king and the markup structures the content. Markup is what the search engines see. It is important enough to design it separately without thinking about how it looks. Then implement the layout using CSS. Then apply styles. This also helps in physically separating the three aspects - markup, layout and style. Do no change the markup to ever define how it looks. If you want a visual representation, check out a design lifecycle as it should be. [...]
December 6th, 2006 at 10:12 am
[...] I am sure a lot of you are aware of WYSIWYG. Most of the editors where you can select the font, the style and the colors are WYSIWYG editors, where the underlying markup is hidden under these GUI controls for formatting. I got introduced to What You See Is What You Mean (WYSIWYM) through Peter Krantz. The difference can be expressed in one word - semantics - but makes a world of difference! [...]
January 3rd, 2007 at 11:11 am
[...] Alex Faaborg illustrates one of the best uses of the Semantic Web. Mozilla Firefox 3 might identify of microformats and associating them with specific applications. As an example, how about if you can export a contact from the HTML page to an application. It can identify a vcard or hResume and associate it with your contacts manager. [...]
January 13th, 2007 at 6:04 pm
[...] This is very interesting, it seems HTML 5 is acknowledging the Semantic Web, which was done using the divs. I think eventually usage of div will reduce, I am not sure if it can be completely eliminated. This can strong impact choosing HTML or XHTML for your web site. I am not sure if these elements will be built into the XHTML versions too. If not would HTML 5 be considered more semantic? I wonder if these things will be addressed in the specification being worked upon. [...]
April 13th, 2007 at 9:02 am
[...] additional tip that has helped me is to create a semantic markup. Of course, its benefits are not unknown, however I find it useful for even communication between project members. The markup can be [...]
May 2nd, 2007 at 8:49 am
[...] Chris Pearson has written an excellent guide to semantic markup for blogs. It is a collection of excellent tips for blogs on arriving at the semantic stage. [...]
September 15th, 2007 at 12:43 pm
[...] advantages of semantic markup are getting popular. I see that more developers are now getting involved in discussions and realize [...]