Gene BURPS668_3578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3578 
Symbol 
ID4883420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3507079 
End bp3508059 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content73% 
IMG OID640129506 
Productglycerophosphoryl diester phosphodiesterase family protein 
Protein accessionYP_001060583 
Protein GI126441964 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.448018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAAC TGATCGACAC CCTCGGACGG CGCGCGGTGC TCATCGGCGT CGCGCTCGGC 
CTGGCCGCCT GCGCGGGCGG CGGCGGCCCG CCGGGCGAGG CGCCGGCGAC GCTGCCGCGC
ATCGTCGCGC ATCGCGGCGG CGCGGCCGAT GCGCCGGAGA ACACACTCGA TGCGATCCGG
GCGGCGGTCG CGAATCGGGC GGACGCGATT TGGCTGACCG TCCAACTGAG CCGCGACGGC
GTGCCGGTGC TGTATCGGCC CGCCGATCTA TCGGCGCTCA CGCGCTCGAG CGGCCCGGTC
GCCGGCCACA CGGCCGCGCA GCTCGCGCAG ATGAACGCCG GCTGGCAATT CCGCGATGCG
GGCGGGCGGT ATCCGTATCG CGCGCGCCCG GTCGGCATTC CGACGTTGCG CGACGCGCTG
CGCGCGATTC CGCCCGCGAT GCCGATCGTG CTCGACATGA AGGCGGTGCC CGCCGCGCCG
CAGGCGAAGG CCGTCGCGGA CGTGCTGACG AGCGAGGCCG CGTGGCCGCG CGTGACGATC
TATTCGACCG ATGCCGCTTA TCAGACCGCG TTCGCCTCGT ATCCGCAGGC ACGGCTCTTC
GAATCGCGCG ATGCGACGCG CGGGCGGCTC GTCGACGTGC TGCTCGGCGG CGCGTGCGAA
CGCGCGCCCG AGGCGCCTGC GACGGCGCCC ATATGGACCG GCTTCGAAAT GCATCGAAAC
ATGACGGTGA GCGAGCGCTT CACGCTCGGC GAAGGCGTAT CGCCCGTGAA GGCGACGTTG
TGGACGCCCG CGACCGTCGC GTGCTTCAGG CGGCGCGCGG ACGTGCGGAT TCTCGCGATC
GCGGTGAACG ACGCCGACGA TTACCGCATG GCCGCGTGCC TCGGGCTCGA TGCGGTGCTC
GCGGATTCGC CGCGCGAGAT GGCGGAAATC CGGTCGGCGC TGCGGGCGCG GCCGTTGCGG
TGCGAGACGG GGGCGCGATA G
 
Protein sequence
MNQLIDTLGR RAVLIGVALG LAACAGGGGP PGEAPATLPR IVAHRGGAAD APENTLDAIR 
AAVANRADAI WLTVQLSRDG VPVLYRPADL SALTRSSGPV AGHTAAQLAQ MNAGWQFRDA
GGRYPYRARP VGIPTLRDAL RAIPPAMPIV LDMKAVPAAP QAKAVADVLT SEAAWPRVTI
YSTDAAYQTA FASYPQARLF ESRDATRGRL VDVLLGGACE RAPEAPATAP IWTGFEMHRN
MTVSERFTLG EGVSPVKATL WTPATVACFR RRADVRILAI AVNDADDYRM AACLGLDAVL
ADSPREMAEI RSALRARPLR CETGAR