Gene BURPS668_A2859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2859 
Symbol 
ID4887924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2719574 
End bp2721508 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content71% 
IMG OID640132795 
Productx-prolyl-dipeptidyl aminopeptidase 
Protein accessionYP_001063851 
Protein GI126444780 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.208132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTTC ATCGAACCGA ACCGCGTCGC GCCTGGATTG CCGTGCTCGC CGCCGCCGCG 
ACGCTTGCCG CTTGCGGCGG CGATGACGGC GCGGGCGTCC CGGGTGCATC CGCGCTCGCT
CAGGTCCAGC AGGAGGGAGC CGCGCCGTCC GCGGCCGGCG CACCGCAGCT CGCCGCACGC
TTCTCGCCGT CCGGCGTGCC GTATGCCAGC CTGTCGAGCG GCGGCCGCTA TCGGCCCGTG
ATTCAGAACG GGCAGGTGCA GCCGTCGCTG TCGGGCGGCA CGATTGCCGA GGAAGCGTGG
GTCGAGACGC CCGTCGATTC CGACGGGGAC GGCGCGAAGG ATCGGATTCA CGTGCGCATC
GTGCGTCCGT CCGAAACCGC GTCGGGCGCG CGCACGCCTG TCATCGTGCT CGCGAGCCCT
TACTACAACG GGCTGGCCGA TAGCCCGAAC CACAACGTCG ACGTCGAGCT CGACGGCACG
CCGCATCCCG CCGCTTCGGC GTCCGCGCGA ATCATGGCCG CCGCGCCGCA GACGCGGATC
TGGCAGCAGC TCGACGCGGC CGCCGCCGGG CGTTCGTGGA TCGAAGGCTA TTTCGTGCCG
CGCGGCTTCA CGGTCGTGTA CGCGGATTCG CTCGGCACGG CCGGCTCGGA CGGCTGCCCG
ACGATCCTCA CGCGCGACGA ATCGGTCGCG ATGGCGTCGG TGATCCGCTG GCTCGGGCGC
GGCGCGGCCG CGAAGGACGC GAACGGCAAG CCGCTCGTCG CGACCTGGTC GACGGGGCAC
GTCGGCATGT ACGGCGTATC GTACGACGGC ACGCTGCCGA AGATGGTCGC AAGCCTGCGC
ACGCGCGGGC TCGATGCGAT CGTGCCGGTT GCCGGGCTCA CCGACATGTA CGGCTACTAC
CGCTCGGGCG GGCTCGTGCG CGCACCCGAC GGCTATCAGG GCGAGGACGT CGACGTCTAC
ATCAAGGCGC TGCTGACGAA CCCGCATCCG GAGCGCTGCA CGCATCTGAT CGACGAGGCG
CTGCAGAAGG AGGATCGCAA GACGGGCGAT TATTCGGCGT TCTGGGCGGC GCGCGAGATT
CCGAGCGCGC TCGCGGTCGC GCCCGCGCTC GTCGCGCAAG GGCTCGCCGA CGACAACGTG
AGGACCGACC AGTCGACGTC GTGGTATCTC GCGATGCGGC GTCAGGGCGT GCCCACGCAG
TTGTGGCTGC ACCGCGCGCA CCATACCGAT CCGACCCGCG TGCCCGCGAT GGCCGACGCG
TGGACCGGGC AGGTGAACCG CTGGTTCACG CGTTATCTGC TCGGCTACGA CAACGGCGTC
GAGCGCAGCC CGGGCTCGGT GATCGAGCAG TCGGACGGCA CGCTGCTGAA GGAGGCGAGC
TGGCCCGCGC GCGGCGCATC GTCCGTCACG TATTTCGCGG GCGGCGACGG CGCGGGCACC
GGCACGCTGC TGACGCAGCC GACGGGCGGC CCGCTCGCGA AGTTCACCGA CGACGCGCGC
ATCATGGCGC TCGCGCTGGC GAACGCGAAC ACGGGCGAGC ATCGCAGCCG CTTCGAGACG
GCGCCCGTCG CGAGCGCGAC GCGGCTCTCC GGCACCGCGA CCGCGCGCGT GCGCCTGACG
TTCTCGGCAA CCGCGAACGT GACCGCGCTG CTGATCGATC GCGCACCGGA CGGCAGCGCG
ACGATCATCA CCCGCGCGTG GACGGATCCG CGCAACCGTC TGTCGAGCTG GTTCTCGGAG
CCGGTGTTGC CCGGCATGCC GTACGATCTG CGCCTCGCGT TCATGCCGCG CGACTACCGG
CTCGAAGCGG GACATCGGCT CGGGCTCGTC GTGCTGTCGA GCGACAACGA GGCGACGCTG
CGGCCGACGC CGGGCACCGA GCTGACGCTC GATCCGGCCG GCACGAGCGT GACGGTGCCG
CTGCTTCCGG CTTGA
 
Protein sequence
MRFHRTEPRR AWIAVLAAAA TLAACGGDDG AGVPGASALA QVQQEGAAPS AAGAPQLAAR 
FSPSGVPYAS LSSGGRYRPV IQNGQVQPSL SGGTIAEEAW VETPVDSDGD GAKDRIHVRI
VRPSETASGA RTPVIVLASP YYNGLADSPN HNVDVELDGT PHPAASASAR IMAAAPQTRI
WQQLDAAAAG RSWIEGYFVP RGFTVVYADS LGTAGSDGCP TILTRDESVA MASVIRWLGR
GAAAKDANGK PLVATWSTGH VGMYGVSYDG TLPKMVASLR TRGLDAIVPV AGLTDMYGYY
RSGGLVRAPD GYQGEDVDVY IKALLTNPHP ERCTHLIDEA LQKEDRKTGD YSAFWAAREI
PSALAVAPAL VAQGLADDNV RTDQSTSWYL AMRRQGVPTQ LWLHRAHHTD PTRVPAMADA
WTGQVNRWFT RYLLGYDNGV ERSPGSVIEQ SDGTLLKEAS WPARGASSVT YFAGGDGAGT
GTLLTQPTGG PLAKFTDDAR IMALALANAN TGEHRSRFET APVASATRLS GTATARVRLT
FSATANVTAL LIDRAPDGSA TIITRAWTDP RNRLSSWFSE PVLPGMPYDL RLAFMPRDYR
LEAGHRLGLV VLSSDNEATL RPTPGTELTL DPAGTSVTVP LLPA