Tidying up your HTML with PHP
This talk will focus on introducing the new Tidy extension included as part of the upcoming PHP5 release and how it can be used to make working with and generating properly-formed HTML in a fast and effective manner. Specifically this session will focus on: * How to use Tidy to diagnose existing HTML for errors * Using Tidy to clean and repair HTML documents * An overview of the most useful Tidy options * Using the Tidy OO interface to navigate the HTML doc tree * Examples of how to navigate HTML effectively using Tidy With the introduction of the Tidy extension, users will no longer need to rely on mess regular expressions to mine data such as URLs, e-mail addresses, or entire tables from HTML documents. Furthermore, thanks to the diagnosing technologies provided by Tidy HTML documents can be diagnosed and even corrected on the fly to ensure complete HTML or XHTML compliance before being sent to the end user. This talk assumes users are familiar with basic PHP object-oriented and procedural constructs.
John Coggeshall is a PHP contributor consultant, and author who started losing sleep over PHP around five years ago. Lately you'll find him losing sleep meeting deadlines for books or online columns on a wide range of PHP topics or contributing to the PHP project. You can find his work online at O'Reilly Networks onlamp.com and Zend Technologies, or at his website http://www.coggeshall.org/. John has also contributed to Apress' Professional PHP4 and is currently in the process of writing the PHP Developer's Handbook published by Sams Publishing.