Network Scrapbook: 2008

Friday 7 November 2008

Recursion

Recursion is a very important feature of many a programming language. Recursion simply means a method calling itself. This may sound a little "disturbing" to any budding programmer but it is a very important attribute of the Java programming language. One may assume that this will result in endless method calls. Technically speaking, that is right. Recursion may indeed result in endless method calls that culminate in rapid consumption of memory that may even humble some more powerful computers out there. However, this problem is a result of progressive method calls not manipulating a parameter towards the base case upon which the method calls terminate. Recursion is a very important feature of Java that some tasks may not be as effectively executed using other technics. Take the case of opening a directory, printing the contents of that directory. That parent directory may also contain other directories (sub-directories) whose contents need to be printed out and so on. Recursion performs this task very quickly and easily. Practical examples of the use of
recursion are the tree or ls command line utility in most operating systems that displays the contents of a directory and virus scanners that deploy a similar mechanism to scan files and directories for viruses and any other malware. It allows a method to continually call itself until the parameter matches the specified base case wherein the recursive method calls end.

For a recursive operation to eventually complete, each time a recursive method calls itself, it manipulates its arguments so that the arguments gradually increment/ decrement or transform to a value that is comparatively similar to that of the base case. At that point, the method recognizes the base case and returns
a result to the previous copy of the method. A sequence of returns ensues until the original method call returns the final result to the caller. When a method calls itself, it passes arguments (parameters) which the called method (itself) first compares to a base case. If it does not match, the method executes it causes the problem being dealt with to become simpler (for example decrement the passed parameter) which itself will be passed as an argument yet again to the method. A little confusing, huh? Once you get round and understand this, you will appreciate the power of recursion.

I have created a simple program that is similar to the Microsoft Windows Command prompt feature "tree" with its extensions "tree /f" which display the sub-directories only or sub-directories with their contents of a parent directory. It is also similar to the "ls" feature of other NIX based operating systems. It consists of the following files;


   FileIterator.java
   IsDirectory.java
   IsNotDirectory.java

The program first opens a directory, then prints out the contents; first printing out the files first then the sub-directories, opening the sub-directories and printing out their contents (sub-directories and files) and so on.
The program makes use of two utility classes that determine if the given directory content is a file or a directory. The program is able to accomplish the following tasks;

Command line option that allows the user to specify whether
files and directories should be written out or just directories.
Indents the contents of every directory and sub-directories
Add the number of files and total size of those files in each
directory in brackets after each directory name.



//FileIterator.java

package com.blogspot.emmanueltoko;

import java.io.File; 

import java.io.IOException;

import java.util.Scanner;



/**

* Class prints out the all the sub-directories and files of a directory

* It calls utility methods recursively to open and printout the contents of a

* directory. Each directory as well as its sub-directories have their size on

* disk printed out as well as the number of files or contents contained therein

* 

* @author Emmanuel Toko

*/

public  class FileIterator



    String indent = ""; //indentation initially not set

    

    public static void main(String[] args)

    {

        File rootFolder = new File(

          "C:/Program Files");

        FileIterator fi = new FileIterator();

        

        //Used to read input from keyboard in console

        Scanner input = new Scanner(System.in);

        

        //variable to store the option entered from keyboard

        String selection = null;

        

        //Keep asking for input until the user enters input of any type

        do

        {

            System.out.print(

              "\nEnter 'tree' to output directories only\n" +

              " or 'tree /f' to output directories and files: ");

             

            selection = input.next().trim();

        } while (input == null);

        

        System.out.println();

        

        //Check user input to determine next course of action

        if (selection.equals("tree"))

        {

            fi.printDirectories(rootFolder, "");

            System.out.println(

              "\nScouring of given directories complete!!");

            System.out.println();

        }//End of if statement block

        else if (selection.equals("tree /f"))

        {

            fi.printContents(rootFolder, "");

            System.out.println(

              "\nScouring of given directories complete!!");

            System.out.println();

        } //End of else if statement block

        else

        {

            System.out.println("\nYou did not enter either option");

            System.out.println("Application will now exit");

            System.exit(0);

        } //End of else statement block

    }//End of method main

    

    /**

    * Method prints out both directories and files contained in the given

    * abstract path

    * Method print out files first then sub-directories after

    * @param folder - File object parameter with the path of the directory to

    *     be scoured.

    * @param ind - String parameter with the given indentation so that files

    *    and sub-directories are printed some spaces to the right

    *    of the parent directory

    */

    public void printContents(File folder, String ind)

    {

        /*

        * Make attempt to read the directory contents and print them out

        * However some exceptions may be thrown such as inability to

        * open a directory

        */

        

        try

        {

            //First determine whether folder parameter is a file

            File[] files = folder.listFiles(new IsNotDirectory(folder));

            for(int i=0; i < files.length; i++)


            {

                System.out.print(indent + "  ");

                System.out.printf("%s%n", files[i].getName());

            }//End of for statement block

            


            //Check if folder argument is a directory

            File[] directorys = folder.listFiles(new IsDirectory(folder));

            for(int i = 0; i < directorys.length; i++)
            {


                indent = ind + "  ";

                System.out.print(indent);

                


                long fileSize = getSize(directorys[i]);

                int fileNum = directorys[i].listFiles().length;


                


                /*


                * 1024 bytes make a KiloByte. The additional 


                * zeros are to let Java treat this variable as a 


                * long and not an int value


                */


                System.out.printf(


                    "%s (Directory Size: %.2f KB %s)%n"


                    , directorys[i].getName(), 


                     fileSize/1024.00, "| Number of files: " +


                    fileNum);


                


                printContents(directorys[i], indent);


            } //End of for statement block


        }//End of try statement block


        catch (Exception ex) //catch any unexpected exceptions


        {


            System.out.println("Exception encountered: " + 


              ex.getMessage());


        } //End of catch statement block


    } //End of method printContents


    


    /**


    * Method prints out only sub-directories given the path


    *


    * @param folder - File object parameter with the path of the directory to


    *     be scoured.


    * @param ind - String parameter with the given indentation so that files


    *     b-directories are printed some spaces to the right


    *     of the parent directory


    */


    public void printDirectories(File folder, String ind)


    {


        try


        {


            File[] directorys = folder.listFiles(new IsDirectory(folder));


            for(int i = 0; i < directorys.length; i++)


            {


                indent = ind + "  ";


                System.out.print(indent);


                


                long fileSize = getSize(directorys[i]);


                int fileNum = directorys[i].listFiles().length;


                


                System.out.printf("%s (Directory Size: %.2f KB %s)%n", 


                  directorys[i].getName(),


                  fileSize/1024.00, "| Number of files: " +


                  fileNum);


                


                printDirectories(directorys[i], indent);


            }//End of for statement block


        }//End of try statement block


        catch (Exception ex) //Catch any kind of exception that may be thrown


        {


         System.out.println("Exception: " + ex.getMessage());


        } //End of Exception catch statement block


    } //End of method printContents


    


    /**


    * Utility method that returns the size of the given directory. The size


    * returned is a variable long but is the actual number of bytes of the


    * directory. This method however returns -1 if an IOException is thrown.


    *


    * @param file a File object representing a file or directory.


    * @return the size of the given file or directory as a long value.


    */


    public long getSize(File directory) 


    {


        long size = 0;


        if(directory.isDirectory()) 


        {


            File[] files = directory.listFiles();


            


            if(files != null) 


            {


                for(int i = 0; i < files.length; i++) 


                {


                    long tmpSize = getSize(files[i]);


                    if(tmpSize != -1) 


                    {


                        size += tmpSize;


                    }//End of inner if statement block


                }//End of for statement block


                


                return size;


                


            }//End of outer if statement block


            return -1; //return -1 if given path does not exist


        }//End of outer most if statement block


        


        return directory.length();


    }//End of method getSize


 }//End of class FileIterator

The File class does not have a method of directly determining the size of a directory. Calling the getSize() method on a File object representing a directory won't give a sensible result as it does on a File object representing an actual file on disk.
Calls to method getSize() of class FileIterator should be able to handle that.

The IsDirectory class determines whether a given parameter of reference type File is a directory.
This class implements the FileFilter interface of package java.io which is a filter for abstract pathnames. Instances of this interface may be passed to the listFiles(FileFilter) method of the File class. This interface has only one method (which is overridden here) accept. This method tests whether or not the specified abstract pathname should be included in a pathname list.
Its parameter: pathname is a File reference variable that is the abstract pathname to be tested. It returns true if and only if pathname should be included.



//IsDirectory.java

package com.blogspot.emmanueltoko;



import java.io.FileFilter;

import java.io.File;



public class IsDirectory implements FileFilter

{

    private File file;

    

    public IsDirectory(File file)

    {

        this.file = file;

    }//End of one argument constructor

    

    @Override

    public boolean accept(File file)

    {

        if (file.isDirectory())

        {

            return true;

        }//End of if statement block

        else

        {

            return false;

        }//End of else statement block

    }//End of overridden method accept

}//End of class IsDirectory

The IsNotDirectory class determines whether a given parameter of reference type File is a file. This class implements the FileFilter interface of package java.io which is a filter for abstract pathnames. Instances of this interface may be passed to the listFiles(FileFilter) method of the File class. This interface has only one method (which overridden here) accept. This method tests whether or not the specified abstract pathname should be included in a pathname list.
s parameter: pathname is a File reference variable that is the abstract pathname to be tested. It returns true if and only if pathname should be included.



IsNotDirectory.java

package com.blogspot.emmanueltoko;



import java.io.FileFilter;

import java.io.File;



public class IsNotDirectory implements FileFilter

{

    private File file;

    

    public IsNotDirectory(File file)

    {

        this.file = file;

    } //End of one argument constructor

    

    @Override

    public boolean accept(File file)

    {

        if (file.isFile())

        {

            return true;

        } //End of if statement block

        else

        {

            return false;

        }//End of else statement block

    }//End of overridden method accept

}//End of class IsNotDirectory

The code has largely been well documented and so things should be clear and easily understood without alot of elaboration. Note: When you run this code directly without any modifications on Windows all will be well. However, for any other operating system or just to scan any parent folder, you many make changes to the argument to the rootFolder File constructor to your preference. If you have installed many programs, you will be inundated with many scrolling directory and file names.

There are some features of recursion that you will quickly notice if you run this program on a not so powerful computer. When you the parent directory whose contents are to be scoured contains very many contents(for example, the directory containing the the Java API documentation(if you have downloaded it)) you will notice that after the parent directory name is printed out, it may take a few seconds of inactivity before the number of files and the size of the directory is printed out.This is so because of the further recursive calls to the methods printDirectories() and getSize which first scan to the last
sub-directory of a given directory and determining the file sizes and summing them progressively before the number of files and the parent directory size are printed out. I developed and run this program on a Compaq Deskpro Pentium II 370 MHz desktop computer(incredible, huh? but yes that computer can handle Java JDK 6 without even breaking a sweat). You probably won't want to install NetBeans or Eclipse on this computer though.

Tuesday 21 October 2008

Celebrating 10 Years of NetBeans!

Welcome to the NetBeans 10th Birthday Celebration! Join us as we celebrate our vibrant community and recognize people and projects that have shaped our success!
Since 1998, NetBeans has grown from a student project into an award winning Open Source project, IDE and application platform. With millions of downloads a year worldwide, the NetBeans community boasts an active and diverse following of software developers, students, instructors and partner companies. Throughout the week, visit this page for unique content—video greetings, community profiles and more—that will highlight some of the key initiatives and dedicated individuals whose contributions have kept the NetBeans momentum going strong.

Enter the NetBeans Decathlon!

Celebrate the NetBeans spirit with the NetBeans Decathlon! Ten interactive tasks for the NetBeans community to show and share what the NetBeans Project means to them. Collect points for each completed task for a chance to win a cool limited edition NetBeans T-shirt!

Video Greeting from James Gosling

What better way to kick off our 10th birthday bash than with a retrospective and birthday greeting from the Godfather of Java, James Gosling!

NetBeans Dream Team

Over the years, the NetBeans IDE has won awards and received positive reviews from the industry, but our best endorsements still come from within the NetBeans community, from developers who use our tool almost exclusively and encourage adoption among other developers. The NetBeans Dream Team is a group of super users who do just that and more. We are proud to acknowledge their commitment to the NetBeans project.

Saturday 23 August 2008

Why PostgreSQL Instead of MySQL

Why PostgreSQL Instead of MySQL: Comparing Reliability and Speed in 2007

Introduction

For years, the common industry perception has been that MySQL is faster and easier to use than PostgreSQL. PostgreSQL is perceived as more powerful, more focused on data integrity, and stricter at complying with SQL specifications, but correspondingly slower and more complicated to use.
Like many perceptions formed in the past, these things aren't as true with the current generation of releases as they used to be. Both systems have evolved with landmark releases that make comparing the two a lot more complicated.
MySQL 5.0 (October 2005) finally added a "strict mode" to narrow the gap in terms of data integrity and spec compliance. It also added support for stored procedures, views, triggers, and cursors, all considered essential features for some classes of database deployments.
PostgreSQL 8.1 (November 2005) featured major performance improvements, particularly in scalability. Focusing on improving performance has been central to all the 8.X releases up to the current 8.3.
As innovation on these databases has progressed, each development community has actively made changes to address their respective sets of perceived disadvantages. The result that it has gotten more difficult to objectively determine which database is likely to be better suited for a given application. This document aims to clarify what situations PostgreSQL would be more appropriate for than MySQL, attempting to fairly compare the current production versions of each and discuss their strengths and weaknesses. The main areas covered here are the fundamental data integrity and speed issues of the core database software. Since it's often the case that you can trade-off performance against reliability, both these topics need to be considered together in order to get an accurate view of the landscape.
The position of this paper is that when the two are compared using the high level of data integrity demanded by a serious transactional database application, the current generation PostgreSQL performs similarly or better than MySQL (particularly under heavy user loads and with complex queries), while retaining its lead in the areas of SQL standards compliance and a rich feature set. It is also hoped that by exploring the differences between the two systems, you might come to appreciate how the fundamental approach of the PostgreSQL design team pervasively prioritizes reliable and predictable behavior. Similar portions of the MySQL implementation have some seams resulting from how features like transactional support and strict mode were added onto the software well into its design lifecycle rather than being integral from the start.

Compared Versions, Feature Sets, and Focus
The current production-ready versions as this is written in August of 2007 are PostgreSQL 8.2 and MySQL 5.0, and those are what's being compared here. Since both PostgreSQL 8.1 and 8.2 are currently supported versions with good performance, some comments here may refer to them collectively. 8.2 is moderately faster (perhaps as much as 30% so on some workloads), but deploying 8.1 is still a completely viable option right now, particularly because more operating systems vendors bundle and support it than the relatively new 8.2.
Both systems have newer versions in testing (PostgreSQL 8.3 and MySQL 5.1) at this time that offer incremental improvements to some areas this document addresses, but there are no fundamental differences so large in either new version that it's believed the general guidelines here would be invalidated. The official release of the MySQL Falcon engine in the future is one likely disruptive point on the horizon. The Asynchronous Commit feature in PostgreSQL 8.3 is another upcoming event that will shift the reliability/performance trade-off options available significantly.
What is specifically not addressed here are the feature sets of the two products in areas outside of these fundamentals. Because the scale of the changes in PostgreSQL 8.1 and MySQL 5.0, many of the documents covering this topic are too out of date to recommend. Some pages that may be helpful include:
Open Source Database Feature Comparison Matrix
MySQL vs. PostgreSQL
MySQL Development Roadmap
PostgreSQL Feature Matrix
While feature checklists are useful, some system behaviors require a fairly deep understanding of the respective systems to appreciate. For instance, the internals of how PostgreSQL compresses TOAST data are invisible to the user, but can result in a startling improvement in system performance with certain types of data.
Another area outside of the scope of this document is that more applications support MySQL as the database of choice than PostgreSQL, and certainly that is an important factor for deciding which of these databases is better suited for a particular situation. Work on adding PostgreSQL support to some popular applications can be tracked at Software Ports. One thing you should evaluate when considering how applications use MySQL is that if they were initially targeted at versions before 5.0, they may not be compatible with newer features like the strict mode introduced in that version. If that's the case, such applications may be limited to the capabilities of the older version they were written against, and it may require a sort of porting effort to take advantage of the modern MySQL features.

Reliability

Data Integrity
Before version 5.0, MySQL well deserved its reputation for allowing inconsistent data to be inserted into the database. Guaranteeing Data Integrity with MySQL 5.0 explains the issues with older MySQL versions, and how they may be addressed using the strict SQL Mode available in the current version. Of course, any MySQL client is allowed to change its own SQL Mode to override this, with the result that these validation constraints are still not necessarily enforced by the server. Some good examples on this topic can be found at When MySQL Bites: Quirks to Watch Out For.
PostgreSQL has always been strict about making sure data is valid before allowing it into the database, and there is no way for a client to bypass those checks.

Transactions and the Database Engine Core
The database core that gave MySQL its original reputation for speed is MyISAM. This engine has excellent read performance and its parser is very efficient for straightforward queries, which combine to make it very fast in read-intensive applications like web applications involving simple SELECTs. However, it is commonly known that MyISAM is more vulnerable to data corruption than most serious database applications would tolerate, and after a crash there may be a substantial delay while it rebuilds its indexes before the server can restart. Furthermore, it does not support foreign keys or transactions that would allow the database to have ACID properties. MyISAM also has issues dealing with concurrent reads and updates, since it only provides table level locking.
The integration of the InnoDB Storage Engine to MySQL greatly improved over MyISAM in terms of data integrity, adding a more robust log replaying mechanism for crash recovery and enabling ACID compliant transactions. However, this new approach comes with much more overhead, and InnoDB tables are not as fast as MyISAM ones for pure read loads. In addition, the internal MySQL metadata tables are still stored using MyISAM, which means they remain vulnerable to the traditional corruption issues associated with that storage engine. This issue is worked around using some complicated locking methods that have the potential to make a table alteration block for some time.
You should also be aware that it's possible in some environments (typically shared web hosting) to create what you believe to a transaction-safe InnoDB table, but actually get non-ACID MyISAM instead. As is too often the case with MySQL, this will not generate an error, it will quietly do the wrong thing instead. See Whoops, no InnoDB table support for details about how to confirm you got what you wanted when creating your tables on a system that that may be running an older MySQL version.
PostgreSQL has always focused on data integrity at the transaction level, keeping locking issues to a minimum, and barring hardware failure or grossly improper configuration it is difficult to corrupt a database.
It is worth observing that the database engine is part of the core of PostgreSQL, whereas InnoDB is a dual-licensed product presently licensed from Oracle Corporation. It is uncertain how Oracle may alter InnoDB in the future as they act in competition with MySQL AB, whereas PostgreSQL has no such conflict of interests. MySQL AB has been working on a new database engine core called Falcon in order to free themselves from this situation, but historically developing a database core engine that is both fast and reliable has required many years of work and testing before a mature product suitable for production use is available. Initial benchmarks suggest Falcon has plenty of rough edges that need to be addressed.

Foreign Keys
Proper implementation of design techniques like Database Normalization rely on the ability of the database to use Foreign keys to map relationships between tables. In MySQL, foreign keys are only supported with InnoDB. One problem with their implementation is that it is limited and will silently ignore some standard syntax. For example, when creating a table, even in the upcoming 5.1 release of MySQL "the CHECK clause is parsed but ignored by all storage engines". The basic design philosophy of PostgreSQL is to produce errors or warnings in similar situations where an operation is ambiguous or unsupported.

Transactional DDL
In PostgreSQL, when you are inside a transaction almost any operation can be undone. There are some irreversible operations (like creating or destroying a database or tablespace), but normal table modifications can be backed out by issuing a [http://www.postgresql.org/docs/current/interactive/sql-rollback.html ROLLBACK] via its Write-Ahead Log design. That supports backing out even large changes to DDL like table creation.
MySQL doesn't support any sort of rollback when using MyISAM. With InnoDB, the server has an implicit commit that occurs even if the normal auto-commit behavior is turned off. This means that any single table alteration or similar change is immediately committed.
Experienced PostgreSQL DBA's know to take advantage of its features here to protect themselves when doing complicated work like schema upgrades. If you put all such changes into a transaction block, you can make sure they all apply atomically or not at all. This drastically lowers the possibility that the database will be corrupted by a typo or other such error in the schema change, which is particularly important when you're modifying multiple related tables where a mistake might destroy the relational key. There is no way to similar way to safely adjust multiple schema sections with MySQL.
See Transactional DDL in PostgreSQL: A Competitive Analysis for detailed examples demonstrating these differences.

Speed

Default configuration
Historically, the initial PostgreSQL configuration was designed to support older flavors of UNIX where allocating large amounts of memory wasn't necessarily possible. The result was that its use of memory for caching results was, by default, very pessimistic. On modern systems that have lots of memory available, this severely hinders untuned PostgreSQL performance.
The defaults have gotten much less pessimistic in recent releases. System configuration is now examined at database initialization time and more memory allocated if it is possible to do so. As a result, the untuned default configurations on recent PostgreSQL versions perform significantly better than older versions. In addition, changes in cache management in versions 8.1 and 8.2 allow even modest amounts of cache to be used more effectively than they used to be.
The primary tunable for both database systems works similarly, by allocating a block of shared memory dedicated to the database. MySQL tunes this with key_buffer_size when using MyISAM, and with innodb_buffer_pool_size when using InnoDB (note that you still need some MyISAM space for system tables even when InnoDB is the main storage engine for regular tables). PostgreSQL sizes its main memory space with shared_buffers.
The MySQL key_buffer_size defaults to using 8MB of memory. Earlier PostgreSQL configurations would also allocate 8MB of memory for the shared_buffers cache if possible. On a server like a current generation Linux system, it's expected the recent PostgreSQL releases would set shared_buffers to at least 24MB by default when the database cluster is created.
It is still worthwhile to go through the configuration files to tune them to match the available memory on a database server, as all these defaults are dramatically undersized compared to the amount of RAM in current systems. For a modern dedicated server, the rule of thumb for both PostgreSQL and MySQL is to size the dedicated memory to at least 1/4 of the total RAM in the machine, perhaps increasing to as much of 1/2 of RAM on the high side of normal. It's not out of the question to push this percentage even higher when using systems with very large amounts of RAM; MySQL InnoDB guidelines suggest even 80% isn't unreasonable. Performance comparisons using the defaults with either database are completely unrealistic of how a real system would be configured. Initial guidelines in this area can be found at 5-Minute Introduction to PostgreSQL Performance, Optimizing the mysqld variables and Optimizing the MySQL Server.

Benchmarks
Benchmarks are very difficult to do well; creating truly comparable benchmarks is a complex art. Many of the older performance benchmarks that have shown MySQL to be much faster than PostgreSQL have suffered from a number of problem areas:
Configuration: It's not unheard of to see a a tuned MySQL compared to an untuned PostgreSQL instance. As mentioned above, untuned PostgreSQL used to be particularly pessimistic about what resources it had available. A truly fair comparison would match the amount of memory used by each system.
Transaction support: MyISAM benchmarks involve "transactions" that provide none of the ACID guarantees that PostgreSQL offers. This would frequently mean that apples were being compared to oranges.
Transaction grouping: Related to the above, PostgreSQL would sometimes be hindered in naive benchmarks that don't properly group transactions the way a real application would. That can add the overhead of not just one transaction, but perhaps hundreds of thousands, to the cost of doing updates.
Serial versus concurrent behaviour: A number of the behaviors of MyISAM are tuned for having a single user accessing the database. For example, its use of table locks to control access to tables means that under heavy user loads, it will slow dramatically. PostgreSQL degrades more gracefully with large numbers of simultaneous connections. Beware of naive benchmarks that involve a simple stream of database requests across a single connection.

Sun Microsystems 2007 jAppServer2004 Benchmark Results
Sun Microsystems, a neutral vendor selling hardware that runs many database types, has recently submitted test results on the well regulated SPECjAppServer2004 using both PostgreSQL and MySQL. There are just enough hardware differences between the two systems that it isn't fair to directly compare the two results. But the fact that both scores are close to one another and the configuration is similar does suggest that while there may be performance differences between the two database systems, the magnitude of that difference is not particularly large with this application type.
For comparison sake, an Oracle on HP result offers a similar magnitude of performance on less impressive hardware, suggesting both open-source databases still lag the best of the commercial products in absolute performance efficiency. Some suggest Oracle's lead is even larger if you pick examples to put it in a better light, but be sure to read Benchmark Brou-Ha-Ha for some comments on actual pricing here (and to pick up some comments on a second PostgreSQL result using a smaller server). Note that Josh Berkus is a Sun employee whose role there includes being a member of the PostgreSQL Core Team, and his commentary should be evaluated accordingly.
If you do a fair comparison that includes software licensing costs, the performance per dollar figures for both PostgreSQL and MySQL are both similar to one another and very good relative to the average for the database industry. It would however be untrue to say that these open-source solutions are always a better choice than commercial offerings like Oracle just based on that; certainly the feature sets and absolute performance of each solution need to be considered as well.

Transaction Locking and Scalability
PostgreSQL uses a robust locking model called MVCC that limits situations where individual clients interfere with each other. A short summary of the main benefit of MVCC would be "readers are never blocked by writers". MVCC is used to implement a pessimistic implementation of the four SQL standard transaction isolation levels: "when you select the level Read Uncommitted you really get Read Committed, and when you select Repeatable Read you really get Serializable, so the actual isolation level may be stricter than what you select." The default transaction isolation level is "read committed".
MySQL's InnoDB implements MVCC using a rollback segment, inspired by Oracle's design; their new Falcon engine works similarly. InnoDB databases supports all four SQL standard transaction isolation levels, with the default being "repeatable read".
When comparing the two models, PostgreSQL enforces client separation where the data operated on is always consistent under all circumstances; as the MVCC documentation states, "the reason that PostgreSQL only provides two isolation levels is that this is the only sensible way to map the standard isolation levels to the multiversion concurrency control architecture." MySQL allows configurations where client code that doesn't commit transactions properly can result in a data view that would be considered inconsistent by PostgreSQL's stricter standards. However, in situations where it's acceptable for data being read to have small inconsistencies, being able to use a less strict locking could be a performance advantage for MySQL.
Even when both systems are configured to one of the strict levels of transaction locking, the differences between the two implementations are subtle enough that which implementation will work better for a particular application is hard to state definitively. Recommended reading to understand this complicated topic is "Transactional Information Systems: Theory, Algorithms, and the Practice of Concurrency Control" by Weikum & Vossen. Speaking in the terminology used there, PostgreSQL uses multi-version timestamp ordering (MVTO) while InnoDB and Oracle use multi-version read consistency (MVRC). The main difference is that PostgreSQL is with-REDO/no-UNDO because it stores every row version in the main table, while Oracle/InnoDB implements with-REDO/with-UNDO where they reconstruct a block and/or row image from the log to provide read consistency. If you're willing to consider a third architecture, that of IBM's DB2, as a comparison point additional good references on this topic are A not-so-very technical discussion of Multi Version Concurrency Control and Leverage your PostgreSQL V8.1 skills to learn DB2. IBM is clearly not a fan of the MVCC approach.
Partially because the PostgreSQL locking implementation is very mature (it's always active and performance of the associated code is accordingly critical), even in situations where MySQL initially appears faster PostgreSQL can pull ahead and scale to higher throughput when the number of simultaneous users becomes large. A good example of such a situation is demonstrated in the tweakers.net database test.

Counting rows in a table
One operation that PostgreSQL is known to be slow performing is doing a full count of rows in a table, typically using this SQL:
SELECT COUNT(*) FROM table
The reason why this is slow is related to the MVCC implementation in PostgreSQL. The fact that multiple transactions can see different states of the data means that there can be no straightforward way for "COUNT(*)" to summarize data across the whole table; PostgreSQL must walk through all rows, in some sense. This normally results in a sequential scan reading information about every row in the table.
Some DBMSes provide the ability for "COUNT(*)" queries to work via consulting an index. Unfortunately, in PostgreSQL, this strategy does not work, as MVCC visibility information is not stored at the index level. It is necessary to actually examine the rows themselves to determine if they are visible to the transaction or not.
In MySQL, MyISAM tables cache the row count information, making this type of count operation almost instant. That is the reason why there exists so much MySQL code that uses this construct assuming it's a trivial operation. But if you're using InnoDB instead, this is no longer the case. See COUNT(*) for Innodb Tables and COUNT(*) vs COUNT(col) for notes on the limitations of MySQL in this area. MySQL designs that may be deployed on InnoDB can't assume that a full row count will be fast, and therefore are hampered by similar limitations to those present in PostgreSQL.
It is worth observing that it is only this precise form of aggregate that must be so pessimistic; if augmented with a "WHERE" clause like
SELECT COUNT(*) FROM table WHERE status = 'something'
PostgreSQL, MySQL, and most other database implementations will take advantage of available indexes against the restricted field(s) to limit how many records must be counted, which can greatly accelerate such queries. PostgreSQL will still need to read the resulting rows to verify that they exist; other database systems many only need to reference the index in this situation.
One popular approach for applications that need a row count but can tolerate it not including transactions that are in the middle of being committed is to use a trigger-based mechanism to count the rows in the table. In PostgreSQL, another alternative when only an approximate count is needed is to use the reltuples field from the pg_class catalog table.

Join Complexity
PostgreSQL uses cost-based query optimization methods in order to get good performance for many different types of joins. Query costs are assessed based on planner statistics collected when tables are analyzed combined with adjustable planner costs, and advanced features such as the Genetic Query Optimizer allow optimizing even very complicated joins efficiently.
MySQL doesn't have this level of sophistication in its planner, and the tunables for Controlling Query Optimizer Performance are crude. Developers instead do things like explicitly provide index hints to make sure joins are executed correctly. To make this task easier, MySQL provides a Query Profiler that is easier to work with than typical EXPLAIN data. Regardless of hinting, subselect optimization is a known weak spot in MySQL. There is also a fairly serious subquery null handling bug in MySQL 5.0 (which at this time appears to be still present in 5.1).
Finding order in execution provides several comparisons of how the two databases handle queries differently. Because of its more robust automatic optimization, PostgreSQL usually does a better job of handling complicated joins than MySQL--but only if the planner is properly configured (setting the effective_cache_size tunable too small is one common mistake) and statistics about the tables are kept up to date (typically via auto-vacuum). The fact that you must give the PostgreSQL optimizer correct information to work with, and can't explicitly control which join it uses, is a somewhat controversial design decision. The core PostgreSQL developers feel that it's more important to focus on improving the optimizer so it works correctly in all cases instead of just allowing queries to hint at a plan as a workaround for problems.
There are some add-on tools some find useful for exploring the PostgreSQL planner. pgAdmin includes an explain plan viewer (sample). Another option is Visual Explain, originally a RedHat component that has been kept current and improved by Enterprise DB. It comes bundled with the EnterpriseDB Advanced Server package and can be built to run against other PostgreSQL installations using the source code to their Developer Studio package.

Credits and Feedback
This document was written by Greg Smith with substantial contributions by Christopher Browne, Lukas Kahwe Smith, and other members of the PostgreSQL Advocacy mailing list. Some of the references linked to by this document point to articles also written by these authors.
Corrections, suggestions, flames, and similar feedback should be addressed to Greg, an independent consultant whose only affiliation with The PostgreSQL Global Development Group consists of submitting patches to improve the upcoming 8.3 release. He feels that PostgreSQL stands on its own merits and comparisons with MySQL should be as factual as possible, and feedback will be treated accordingly.

Monday 11 August 2008

Comments

Java programs can have two kinds of comments: implementation comments and documentation comments. Implementation comments are those found in C++, which are delimited by /*...*/, and //. Documentation comments (known as “doc comments”) are Java-only, and are delimited by /**...*/. Doc comments can be extracted to HTML files using the javadoc tool. Implementation comments are mean for commenting out code or for comments about the particular implementation. Doc comments are meant to describe the specification of the code,
from an implementation-free perspective. to be read by developers who might not necessarily have the source code at hand.

Comments should be used to give overviews of code and provide additional information that is not readily available in the code itself. Comments should contain only information that is relevant to reading and understanding the program. For example, information about how the corresponding package is built or in what directory it resides should not be included as a comment. Discussion of nontrivial or non-obvious design decisions is appropriate, but avoid duplicating information that is present in (and clear from) the code. It is too easy for redundant comments to get out of date. In general, avoid any comments that are likely to get out of date as the code evolves.

Note: The frequency of comments sometimes reflects poor quality of code. When you feel compelled to add a comment, consider rewriting the code to make it clearer. Comments should not be enclosed in large boxes drawn with asterisks or other characters. Comments should never include special characters such as form-feed and backspace.

Implementation Comment Formats
Programs can have four styles of implementation comments: block, single-line, trailing and end-of-line.

Block Comments
Block comments are used to provide descriptions of files, methods, data structures and algorithms. Block comments should be used at the beginning of each file and before each method. They can also be used in other places, such as within methods. Block comments inside a function or method should be indented to the same level as the code they describe. A block comment should be preceded by a blank line to set it apart from the rest of the code. Block comments have an asterisk “*” at the beginning of each line except the first.

/*
* Here is a block comment.
*/

Block comments can start with /*-, which is recognized by indent(1) as the beginning of a block comment that should not reformatted. Example:

/*
* Here is a block comment with some very special
* formatting that I want indent(1) to ignore.
*
* one
* two
* three
*/

Note: If you don’t use indent(1), you don’t have to use /*- in your code or make any other concessions to the possibility that someone else might run indent(1) on your code.

Single-Line Comments
Short comments can appear on a single line indented to the level of the code that follows. If a comment can’t be written in a single line, it should follow the block comment format. A single-line comment should be preceded by a blank line. Here’s an example of a single-line comment in Java code;

if (condition)
{
/* Handle the condition. */
...
}

Trailing Comments
Very short comments can appear on the same line as the code they describe, but should be shifted far enough to separate them from the statements. If more than one short comment appears in a chunk of code, they should all be indented to the same tab setting. Avoid the assembly language style of commenting every line of executable code with a trailing comment.

Here’s an example of a trailing comment in Java code:

if (a == 2)
{
return TRUE; /* special case */
}
else
{
return isprime(a); /* works only for odd a */
}

End-Of-Line Comments
The // comment delimiter begins a comment that continues to the newline. It can comment out a complete line or only a partial line. It shouldn’t be used on consecutive multiple lines for text comments; however, it can be used in consecutive multiple lines for commenting out sections of code. Examples of all three styles follow:

if (foo > 1)
{
// Do a double-flip.
...
}
else
{
return false; // Explain why here
}.
//if (bar > 1)
//{
//
// // Do a triple-flip.
// ...
//}
//else
//{
// return false;
//}

Documentation Comments
For further details, see “How to Write Doc Comments for Javadoc” which includes information on the doc comment tags (@return, @param, @see):
http://java.sun.com/products/jdk/javadoc/writingdoccomments.html
For further details about doc comments and javadoc, see the javadoc home page at:
http://java.sun.com/products/jdk/javadoc/
Doc comments describe Java classes, interfaces, constructors, methods, and fields. Each doc comment is set inside the comment delimiters /**...*/, with one comment per API. This comment should appear just before the declaration:

/**
* The Example class provides ...
*/

class Example
{
...
}

Notice that classes and interfaces are not indented, while their members are. The first line of doc comment (/**) for classes and interfaces is not indented; subsequent doc comment lines each have 1 space of indentation (to vertically align the asterisks). Members, including constructors, have 4 spaces for the first doc comment line and 5 spaces thereafter. If you need to give information about a class, interface, variable, or method that isn’t appropriate for documentation, use an implementation block comment or single-line comment immediately after the declaration. For example, details about the implementation of a class should go in in such an implementation block comment following the class statement, not in the class doc comment.
Doc comments should not be positioned inside a method or constructor definition block, because Java associates documentation comments with the first declaration after the comment.

Wednesday 6 August 2008

Sets

This interface is a member of the Java Collections Framework A collection that contains no duplicate elements. The Set interface contains only methods inherited from Collection and adds the restriction that duplicate elements are prohibited.
The additional stipulation on constructors is, not surprisingly, that all constructors must create a set that contains no duplicate elements (as defined above). The behavior of a set is not specified if the value of an object is changed in a manner that affects equals comparisons while the object is an element in the set. A special case of this prohibition is that it is not permissible for a set to contain itself as an element.
Some set implementations have restrictions on the elements that they may contain. For example, some implementations prohibit null elements, and some have restrictions on the types of their elements.
Attempting to add an ineligible element throws an unchecked exception, typically NullPointerException or ClassCastException. Attempting to query the presence of an ineligible element may throw an exception,or it may simply return false; some implementations will exhibit the former behavior and some will exhibit the latter. More generally, attempting an operation on an ineligible element whose completion would not result in the insertion of an ineligible element into the set may throw an exception or it may succeed, at the option of the implementation. Such exceptions are marked as "optional" in the specification for this interface.
Two Set instances are equal if they contain the same elements.

The Java platform contains three general-purpose Set implementations: HashSet, TreeSet, and LinkedHashSet HashSet, which stores its elements in a hash table, is the best-performing implementation; however it makes no guarantees concerning the order of iteration. This implies that when iterating through the HashSet and for example printing out the elements in the set, there is no guarantee that the elements will be printed out in the exact order that they were entered into the HashSet It makes no guarantees as to the iteration order of the set;
in particular, it does not guarantee that the order will remain constant over time. This class permits the null element.

A simple way to remove the duplicates from a Collection c, is to enter the Collection, c, as the parameter of the HashSet constructor HashSet(Collection c)
for example Set set = new HashSet(c);
NOTE: By convention all general-purpose collection implementations have a constructor that takes a Collection argument. This constructor, known as a conversion constructor, initializes the new collection to contain all of the elements in the specified collection, whatever the given collection's subinterface or implementation type. In other words, it allows you to convert the collection's type for exaple from and Integer to String

//HashSetDemo.java

import java.util.Set;
import java.util.HashSet;
import java.util.Arrays;

/**
*Demonstrates the use of HashSet
*/
public class HashSetDemo
{
     public static void main(String[] args)
     {
        String[] array = {"Java", "C", "C++", "SmallTalk", "C", "C++", "Delphi", "Java",
         "Lisp", "Ada", "C", "many others..."};

        Set set = new HashSet();

        //Add array elements to HashSet
        for (String item : array)
        {
           set.add(item);
        } //End of for statement block

        //Notice that the elements printed out are not in the order that they were
        //inserted into the array
        System.out.print(set);

        System.out.println();
     } //End of method main
} //End of class HashSetDemo

Below is a sample of the output of this application. It may(will) vary from the result produced on your computer
However, the main point is to illustrate that HashSet does not tolerate duplicates and it offers no guarantee on the order
of elements returned.

[SmallTalk, C, Ada, Lisp, many others..., C++, Delphi, Java]

Storing Binary Data in PostgreSQL

PostgreSQL™ provides two distinct ways to store binary data. Binary data can be stored in a table using the data type bytea or by using the Large Object feature which stores the binary data in a separate table in a special format and refers to that table by storing a value of type oid in your table.

NOTE: YOU SHOULD NOTE THAT POSTGRESQL DOES NOT HAVE A BLOB (Binary Large Object) DATA TYPE. SO DEFINING A COLUMN AS A BLOB FOR THE STORAGE OF IMAGES OR ANY OTHER FILE FOR THAT MATTER WILL RESULT IN AN ERROR.

In order to determine which method is appropriate you need to understand the limitations of each method. The bytea data type is not well suited for storing very large amounts of binary data. While a column of type bytea can hold up to 1 GB of binary data, it would require a huge amount of memory to process such a large value. The Large Object method for storing binary data is better suited to storing very large values, but it has its own limitations. Specifically deleting a row that contains a Large Object reference does not delete the Large Object. Deleting the Large Object is a separate operation that needs to be performed. Large Objects also have some security issues since anyone connected to the database can view and/or modify any Large Object, even if they don't have permissions to view/update the row containing the Large Object reference.

Version 7.2 was the first release of the JDBC driver that supports the bytea data type. The introduction of this functionality in 7.2 has introduced a change in behavior as compared to previous releases. Since 7.2, the methods getBytes(), setBytes(), getBinaryStream(), and setBinaryStream() operate on the bytea data type. In 7.1 and earlier, these methods operated on the oid data type associated with Large Objects. It is possible to revert the driver back to the old 7.1 behavior by setting the property compatible on the Connection object to the value 7.1. More details on connection properties are available in the section called “Connection Parameters”.

To use the bytea data type you should simply use the getBytes(), setBytes(), getBinaryStream(), or setBinaryStream() methods.

To use the Large Object functionality you can use either the LargeObject class provided by the PostgreSQL™ JDBC driver, or by using the getBLOB() and setBLOB() methods.
Important

You must access Large Objects within an SQL transaction block. You can start a transaction block by calling setAutoCommit(false).

For example, suppose you have a table containing the file names of images and you also want to store the image in a bytea column:

CREATE TABLE images (imgname text, img bytea);

To insert an image, you would use:

File file = new File("myimage.gif");
FileInputStream fis = new FileInputStream(file);
PreparedStatement ps = conn.prepareStatement("INSERT INTO images VALUES (?, ?)");
ps.setString(1, file.getName());
ps.setBinaryStream(2, fis, (int)file.length());
ps.executeUpdate();
ps.close();
fis.close();

Here, setBinaryStream() transfers a set number of bytes from a stream into the column of type bytea. This also could have been done using the setBytes() method if the contents of the image was already in a byte[].
Note

The length parameter to setBinaryStream must be correct. There is no way to indicate that the stream is of unknown length. If you are in this situation, you must read the stream yourself into temporary storage and determine the length. Now with the correct length you may send the data from temporary storage on to the driver.

Retrieving an image is even easier. (We use PreparedStatement here, but the Statement class can equally be used.)

PreparedStatement ps = conn.prepareStatement("SELECT img FROM images WHERE imgname = ?");
ps.setString(1, "myimage.gif");
ResultSet rs = ps.executeQuery();

while (rs.next())
{
byte[] imgBytes = rs.getBytes(1);
// use the data in some way here
}
rs.close();
ps.close();

Here the binary data was retrieved as an byte[]. You could have used a InputStream object instead.

Alternatively you could be storing a very large file and want to use the LargeObject API to store the file:

CREATE TABLE imageslo (imgname text, imgoid oid);

To insert an image, you would use:
// All LargeObject API calls must be within a transaction block
conn.setAutoCommit(false);

// Get the Large Object Manager to perform operations with
LargeObjectManager lobj = ((org.postgresql.PGConnection)conn).getLargeObjectAPI();

// Create a new large object
long oid = lobj.createLO(LargeObjectManager.READ | LargeObjectManager.WRITE);

// Open the large object for writing
LargeObject obj = lobj.open(oid, LargeObjectManager.WRITE);

// Now open the file
File file = new File("myimage.gif");
FileInputStream fis = new FileInputStream(file);

// Copy the data from the file to the large object
byte buf[] = new byte[2048];
int s, tl = 0;

while ((s = fis.read(buf, 0, 2048)) > 0)
{
obj.write(buf, 0, s);
tl += s;
}

// Close the large object
obj.close();

// Now insert the row into imageslo
PreparedStatement ps = conn.prepareStatement("INSERT INTO imageslo VALUES (?, ?)");
ps.setString(1, file.getName());
ps.setLong(2, oid);
ps.executeUpdate();
ps.close();
fis.close();

// Finally, commit the transaction.
conn.commit();

Retrieving the image from the Large Object:
// All LargeObject API calls must be within a transaction block
conn.setAutoCommit(false);

// Get the Large Object Manager to perform operations with
LargeObjectManager lobj = ((org.postgresql.PGConnection)conn).getLargeObjectAPI();

PreparedStatement ps = conn.prepareStatement("SELECT imgoid FROM imageslo WHERE imgname = ?");
ps.setString(1, "myimage.gif");
ResultSet rs = ps.executeQuery();

while (rs.next())
{
// Open the large object for reading
long oid = rs.getLong(1);
LargeObject obj = lobj.open(oid, LargeObjectManager.READ);

// Read the data
byte buf[] = new byte[obj.size()];
obj.read(buf, 0, obj.size());
// Do something with the data read here

// Close the object
obj.close();
}

rs.close();
ps.close();

// Finally, commit the transaction.

conn.commit();

NOTE: This is an extract from Chapter 7, PostgreSQL JDBC Documentation. Please download the whole documentation from the PostgreSQL JDBC website.

Friday 29 February 2008

Poll-taking servlet

This is a very simple Servlet that manages a simple poll taking scenario. It tries to get a peoples' opinions on their favorite programming language. The languages involved in the poll are; Java, C++, C, Python and C#. These are stored in a MySQL database created with the following script;

CREATE TABLE `elections` (
`id` int(11) NOT NULL AUTO_INCREMENT,
`language` varchar(20) DEFAULT NULL,
`votes` int(11) NOT NULL DEFAULT 0,
PRIMARY KEY (`id`)
) ENGINE=InnoDB AUTO_INCREMENT=6 DEFAULT CHARSET=latin1

in a database named 'applications'.
This is what you should expect to see after inserting all the values into your new table;

mysql> SELECT * FROM elections;
+----+----------+-------+
| id | language | votes |
+----+----------+-------+
| 1 | Java | 0 |
| 2 | C++ | 0 |
| 3 | C | 0 |
| 4 | Python | 0 |
| 5 | C# | 0 |
+----+----------+-------+
5 rows in set (0.00 sec)

For those who do not know how to insert data into tables using SQL statements, this should be of help(I expect this code to work in MySQL and PostgreSQL databases. However it may also work in others exactly as this, or may be with afew alterations then maybe with a few adjustments);

INSERT INTO elections(`language`, `votes`) VALUES ("Java", "0");
Remember the semi-colon at the end of the statement------------^.

As those with advanced knowledge of Servlet programming will notice; the database user name and passwords are open for all to see(in otherwords, a sitting or lame duck). However, this was not about security but rather about managing a voting process.

You have to create a sub-directory for your web application in the webapps sub-directory of your Apache Tomcat web-server. Mine, I called it 'Dynamic'. You can give it another name of your choice.
Then, create a sub-directory in the Dynamic. You must name it WEB-INF. That is the way Tomcat will determine that this is a web application.
You can also create another sub-directory under the Dynamic to contain other .html or .jsp web pages. Mine, I called it Pages. You can also store your images or web page assets such as CSS and JavaScript files here.
Then, under the WEB-INF sub-directory, create another sub-directory. You must name it classes. This is the directory where all your Java files (.class files) will(must) be saved
So, this below is the tree structure of my Dynamic directory which is located in the Apache Tomcat webapps directory.

C:PROGRA~1APACHE~1TOMCAT 6.0WEBAPPSDYNAMIC
+---Pages
| Vote.html
|
+---WEB-INF
| web.xml
|
+---classes
Elections.class
Elections.java

Start by placing the web.xml file into the WEB-INF sub-directory of your web application directory.
The code for the web.xml file is this;
(Remember, you will have to do the indentation for this code and the following code yourself for your easy understanding. I originally had it indented. However, due to the requirement for manual hand-coding of html for indentation to display this code in this page, it therefore displays as it does below( If you do not understand this paragraph, just hop over it. It is just a by-the-way)).

<web-app>
<description>
Dynamic site for elections
</description>
<servlet>
<servlet-name>
Vote
</servlet-name>
<servlet-class>
Elections
</servlet-class>
</servlet>

<servlet-mapping>
<servlet-name>
Vote
</servlet-name>
<url-pattern>
/Vote
</url-pattern>
</servlet-mapping>
</web-app>

Now, the web page that invokes the servlet has the following code.

<html>
<head>
<title>Vote Now</html>
</head>
<body bgcolor = "#CCCC00" font-family = "verdana" >
<h2>Please participate in our poll.</h2>
<p>
Vote for your favorite programming language; <br />
</p>
<p>
<form action = "/Dynamic/Vote" method = "post">
<input type = "radio" name = "language" value = "1" >Java<br />
<input type = "radio" name = "language" value = "2" >C++<br />
<input type = "radio" name = "language" value = "3" >C<br />
<input type = "radio" name = "language" value = "4" >Python<br />
<input type = "radio" name = "language" value = "5" >C#<br />
<input type = "submit" value = "Vote Now">
</form>
</p>
</body>
</html>

Now, the real code for the Servlet is this. It has been aptly named 'Elections.java'.

//import all packages relevant for processing the servlet and database access.

import javax.servlet.http.HttpServletResponse;
import javax.servlet.http.HttpServletRequest;
import java.io.PrintWriter;
import java.io.IOException;
import javax.servlet.http.HttpServlet;
import javax.servlet.ServletException;
import java.sql.*;

public class Elections extends HttpServlet
{
private Connection con;
private Statement stmt;
private ResultSet rs;

public void init() throws ServletException
{
final String URL = "jdbc:mysql://localhost/applications";
final String JDBC_NAME = "com.mysql.jdbc.Driver";
final String user = "root";
final String password = "mysql";

//try to load MySQL/ConnectorJ Driver.
try
{
Class.forName(JDBC_NAME);
}
catch(ClassNotFoundException ex)
{
}

try
{
con = DriverManager.getConnection(URL, user, password);
stmt = con.createStatement();
}
catch(SQLException sqlEx)
{
}
}
public void doPost(HttpServletRequest request, HttpServletResponse response) throws IOException, ServletException
{
response.setContentType("text/html");
PrintWriter out = response.getWriter();

out.print("<html><head>");

/* Try to convert the String 'value' posted from Vote.html to an int value
using method Integer method parseInt() */
int value = Integer.parseInt(request.getParameter("language"));
String query;

try
{
query = "UPDATE elections SET votes = votes + 1 WHERE id = " + value;
stmt.executeUpdate(query);

query = "SELECT SUM(votes) FROM elections";
ResultSet rsTotal = stmt.executeQuery(query);
rsTotal.next();
int total = rsTotal.getInt(1);
rsTotal.close();
rsTotal = null;

out.println("<title>");
out.print("Results until now" + value);
out.print("</title></head><body>");
out.print("<h1>Here are the results so far</h1>");

ResultSet rs = stmt.executeQuery("SELECT * FROM elections");
out.println("<p>");
out.print("<table>");
int votes;

while (rs.next())
{
out.print("<tr border = 0>");
out.println("<td>" + rs.getString(2) + "</td>");

votes = rs.getInt(3);
out.println("<td>" + " " + "Votes: " );
out.printf("%d %.2f%s%s", rs.getInt(3), (double)votes / total * 100, "%", "</td>");
for ( int i = 0; i <= rs.getInt(3); i++)
{
if ( i == 0)
{
}
else
{out.println("<td border = 0 bgcolor = blue height = 5>" + " " +"</td>");}
}
out.println("</tr>");
}
rs.close();
rs = null;

out.print("</table>");
out.println("</p>");
out.println("Total number of votes is: " + total);
out.print("</body></html>");
}

catch (Exception ex)
{
//If error is encountered, then display message to user
out.print("Encountered Error!");
out.println("</title></head><body>");
out.println("Encountered an error while processing your vote.<br />");
out.println("Please try again later!");
out.println("<br />");
out.println(ex.getMessage());
out.println("</body></html>");

}
}

//destroy any used resources
public void destroy()
{
try
{
if ( con != null )
{
con.close();
con = null;
}
}
catch (Exception ex)
{

}
}
}

To compile this, remember to include the servlet.jar file in the -classpath option of the javac statement on Command-line. From the site java.sun.com, you can download the servlet-2_5-api.jar file for compiling the servlet .java file. Alternatively, you can locate the servlet.jar file from the lib subdirectory of your Tomcat web container.
-->