Gene BURPS1710b_A1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1100 
SymbolpepX 
ID3692695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1375823 
End bp1377967 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content71% 
IMG OID637731354 
Productx-prolyl-dipeptidyl aminopeptidase 
Protein accessionYP_336258 
Protein GI76819094 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.240812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGTCA GCGGGATGCG GGCATCGGCG CGGCGAATCG ACGTGCGGCG CGCGTTCCAG 
CGCAACGCCC GCATCGCGGC GCCGACGCAC CCGGCTTCAC GGGGGCGACG ACGTGCGCGA
TGCGGCCGGC AGCGTCCGCC GGCATGTTCA ACACGACGCC ATCCCGCACG GGATGACCGT
TTTCAGACGA TCATCGAAAC AAGGACAACA ATGAGATTTC ATCGAACCGA ACCGCGTCGC
GCCTGGATTG CCGTGCTCGC CGCCGCCGCG ATGCTTGCCG CTTGCGGCGG CGATGACGGC
GCGGGCGTCC CGGGTGCATC CGCGCTCGCT CAGGTCCAGC AGGAGGGAGC CGCGCCGTCC
GCGGCCGGCG CACCGCAGCT CGCCGCACGC TTCTCGCCGT CCGGCGTGCC GTATGCCAGC
CTGTCGAGCG GCGGCCGCTA TCGGCCCGTG ATTCAGAACG GGCAGGTGCA GCCGTCGCTG
TCGGGCGGCA CGATTGCCGA GGAAGCGTGG GTCGAGACGC CCGTCGATTC CGACGGGGAC
GGCGCGAAGG ATCGGATTCA CGTGCGCATC GTGCGTCCGT CCGAAACCGC GTCGGGCGCG
CGCACGCCTG TCATCGTGCT CGCGAGCCCT TACTACAACG GGCTGGCCGA TAGCCCGAAC
CACAACGTCG ACGTCGAGCT CGACGGCACG CCGCATCCCG CCGCTTCGGC GTCCGCGCGA
ATCATGGCCG CCGCGCCGCA GACGCGGATC TGGCAGCAGC TCGACGCGGC CGCCGCCGGG
CGTTCGTGGA TCGAAGGCTA TTTCGTGCCG CGCGGCTTCA CGGTCGTGTA CGCGGATTCG
CTCGGCACGG CCGGCTCGGA CGGCTGCCCG ACGATCCTCA CGCGCGACGA ATCGGTCGCG
ATGGCGTCGG TGATCCGCTG GCTCGGGCGC GGCGCGGCCG CGAAGGACGC GAACGGCAAG
CCGCTCGTCG CGACCTGGTC GACGGGGCAC GTCGGCATGT ACGGCGTATC GTACGACGGC
ACGCTGCCGA AGATGGTCGC AAGCCTGCGC ACGCGCGGGC TCGATGCGAT CGTGCCGGTT
GCCGGGCTCA CCGACATGTA CGGCTACTAC CGCTCGGGCG GGCTCGTGCG CGCACCCGAC
GGCTATCAGG GCGAGGACGT CGACGTCTAC ATCAAGGCGC TGCTGACGAA CCCGCATCCG
GAGCGCTGCA CGCATCTGAT CGACGAGGCG CTGCAGAAGG AGGATCGCAA GACGGGCGAT
TATTCGGCGT TCTGGGCGGC GCGCGAGATT CCGAGCGCGC TCGCGGTCGC GCCCGCGCTC
GTCGCGCAAG GGCTCGCCGA CGACAACGTG AGGACCGACC AGTCGACGTC GTGGTATCTC
GCGATGCGGC GTCAGGGCGT GCCCACGCAG TTGTGGCTGC ACCGCGCGCA CCATACCGAT
CCGACTCGCG TGCCCGCGAT GGCCGACGCG TGGACCGGGC AGGTGAACCG CTGGTTCACG
CGTTATCTGC TCGGCTACGA CAACGGCGTC GAGCGCAGCC CGGGCTCGGT GATCGAGCAG
GCGGACGGCA CGCTGCTGAA GGAGGCGAGC TGGCCCGCGC GCGGCGCATC GTCCGTCACG
TATTTCGCGG GCGGCGACGG CGCGGGCACC GGCACGCTGC TGACGCAGCC GACGGGCGGC
CCGCTCGCGA AGTTCACCGA CGACGCGCGC ATCATGGCGC TCGCGCTGGC GAACGCGAAC
ACGGGCGAGC ATCGCAGCCG CTTCGAGACG GCGCCCGTCG CGAGCGCGAC GCGGCTCTCC
GGCACCGCGA CCGCGCGCGT GCGCCTGACG TTCTCGGCAA CCGCGAACGT GACCGCGCTG
CTGATCGATC GCGCACCGGA CGGCAGCGCG ACGATCATCA CCCGCGCGTG GACGGATCCG
CGCAACCGTC TGTCGAGCTG GTTCTCGGAG CCGGTGTTGC CCGGCATGCC GTACGATCTG
CGCCTCGCGT TCATGCCGCG CGACTACCGG CTCGAAGCGG GACATCGGCT CGGGCTCGTC
GTGCTGTCGA GCGACAACGA GGCGACGCTG CGGCCGACGC CGGGCACCGA GCTGACGCTC
GATCCGGCCG GCACGAGCGT GACGGTGCCG CTGCTTCCGG CTTGA
 
Protein sequence
MPVSGMRASA RRIDVRRAFQ RNARIAAPTH PASRGRRRAR CGRQRPPACS TRRHPARDDR 
FQTIIETRTT MRFHRTEPRR AWIAVLAAAA MLAACGGDDG AGVPGASALA QVQQEGAAPS
AAGAPQLAAR FSPSGVPYAS LSSGGRYRPV IQNGQVQPSL SGGTIAEEAW VETPVDSDGD
GAKDRIHVRI VRPSETASGA RTPVIVLASP YYNGLADSPN HNVDVELDGT PHPAASASAR
IMAAAPQTRI WQQLDAAAAG RSWIEGYFVP RGFTVVYADS LGTAGSDGCP TILTRDESVA
MASVIRWLGR GAAAKDANGK PLVATWSTGH VGMYGVSYDG TLPKMVASLR TRGLDAIVPV
AGLTDMYGYY RSGGLVRAPD GYQGEDVDVY IKALLTNPHP ERCTHLIDEA LQKEDRKTGD
YSAFWAAREI PSALAVAPAL VAQGLADDNV RTDQSTSWYL AMRRQGVPTQ LWLHRAHHTD
PTRVPAMADA WTGQVNRWFT RYLLGYDNGV ERSPGSVIEQ ADGTLLKEAS WPARGASSVT
YFAGGDGAGT GTLLTQPTGG PLAKFTDDAR IMALALANAN TGEHRSRFET APVASATRLS
GTATARVRLT FSATANVTAL LIDRAPDGSA TIITRAWTDP RNRLSSWFSE PVLPGMPYDL
RLAFMPRDYR LEAGHRLGLV VLSSDNEATL RPTPGTELTL DPAGTSVTVP LLPA