Gene BURPS1106A_3603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3603 
Symbol 
ID4899407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3508538 
End bp3509518 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content73% 
IMG OID640136829 
Productglycerophosphoryl diester phosphodiesterase family protein 
Protein accessionYP_001067834 
Protein GI126455274 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAAC TGATCGACAC CCTCGGACGG CGCGCGGTGC TCATCGGCGT CGCGCTCGGC 
CTGGCCGCCT GCGCGGGCGG CGGCGGCCCG CCGGGCGAGG CGCCGGCGAC GCTGCCGCGC
ATCGTCGCGC ATCGCGGCGG CGCGGCCGAT GCGCCGGAGA ACACACTCGA TGCGATCCGG
GCGGCGGTCG CGAATCGGGC GGACGCGATT TGGCTGACCG TCCAACTGAG CCGCGACGGC
GTGCCGGTGC TGTATCGGCC CGCCGATCTA TCGGCGCTCA CGCGCTCGAG CGGCCCGGTC
GCCGGCCACA CGGCCGCGCA GCTCGCGCAG ATGAACGCCG GCTGGCAATT CCGCGATGCG
GGCGGGCGGT ATCCGTATCG CGCGCGCCCG GTCGGCATTC CGACGTTGCG CGACGCGCTG
CGCGCGATTC CGCCCGCGAT GCCGATCGTG CTCGACATGA AGGCGGTGCC CGCCGCGCCG
CAGGCGAAGG CCGTCGCGGA CGTGCTGACG AGCGAGGCCG CGTGGCCGCG CGTGACGATC
TATTCGACCG GTGCCGCTTA TCAGACCGCG TTCGCCTCGT ATCCGCAGGC ACGGCTCTTC
GAATCGCGCG ATGCGACGCG CGGGCGGCTC GTCGACGTGC TGCTCGGCGG CGCGTGCGAA
CGCGCGCCCG AGGCGCCTGC GACGGCGCCC ATATGGACCG GCTTCGAAAT GCATCGAAAC
ATGACGGTGA GCGAGCGCTT CACGCTCGGC GAAGGCGTAT CGCCCGTGAA GGCGACGTTG
TGGACGCCCG CGACCGTCGC GTGCTTCAGG CGGCGCGCGG ACGTGCGGAT TCTCGCGATC
GCGGTGAACG ACGCCGACGA TTACCGCACG GCCGCGTGCC TCGGGCTCGA TGCGGTGCTC
GCGGATTCGC CGCGCGAGAT GGCGGAAATC CGGTCGGCGC TGCGGGCGCG GCCGTTGCGG
TGCGAGACGG GGGCGCGATA G
 
Protein sequence
MNQLIDTLGR RAVLIGVALG LAACAGGGGP PGEAPATLPR IVAHRGGAAD APENTLDAIR 
AAVANRADAI WLTVQLSRDG VPVLYRPADL SALTRSSGPV AGHTAAQLAQ MNAGWQFRDA
GGRYPYRARP VGIPTLRDAL RAIPPAMPIV LDMKAVPAAP QAKAVADVLT SEAAWPRVTI
YSTGAAYQTA FASYPQARLF ESRDATRGRL VDVLLGGACE RAPEAPATAP IWTGFEMHRN
MTVSERFTLG EGVSPVKATL WTPATVACFR RRADVRILAI AVNDADDYRT AACLGLDAVL
ADSPREMAEI RSALRARPLR CETGAR