The correct choice is (d) The overlapping EST sequences are computationally processed to represent a set of expressed genes
The best I can explain: The database is constructed based on combined information from dbEST, GenBank mRNA database, and “electronically spliced” genomic DNA. Only ESTs with 3’poly-A ends are clustered to minimize the problem of chimerism. The resulting 3’EST sequences provide more unique representation of the transcripts.