Gene BURPS1710b_3705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3705 
SymbolthiE 
ID3691886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp4043224 
End bp4044327 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content75% 
IMG OID637730160 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_335069 
Protein GI76808807 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCG CGTTGCCCGA CGCGTTCTGG CCGCCCGCCG ACGAGCTTAC CGAGGCCGCC 
GAGCGGATTC GCGCGACGCT CGGTGCGTGG CCGCGGCCGG CCGTGCGCAC GCGGATCTGT
CTCGCGCCGC CGGAGCAGCC GCGCGCGGCC GACCTGTGGG TCGCCATCGC GGGCGACGCC
GGCGCGCACG CCGCGCACAT CGCGCGGCTG AACGCGGCGG GCGCGCGGGC GATCGTCATC
GACGATGCAT CGGCCACGCT CCACACGGGC GCGGCGCGCC ATGCGCTCGC GTCGCGCGCG
CCGCTCGCCG ACGACTGGAT CGCGGCGCTC GCGGCGTTTC TCGATTGCGG CTTCGCCGCG
TCCGACGCGC TCGTGCTCGC GCTCGCATGG CGCGACGGCG ACGAGGCGCG CGGCGGCGAT
CCGTGGCCCG TCGATCCGGC ACGCTTTCCG CGCGTGCTCG GCCTGCCCGC CGCGCCCGAA
CCGGCGTTTG CGCCGTGCCC GCAGCGGCTC GGCCTGTATC CGGTGCTGCC GAGCGCCGAA
TGGGTCGAGC GCGTGCTCGA TTGCGGCGTG CGGACCGTGC AACTGCGCGT GAAGGACGCC
TCGCCCGACG CGCTGCGCGC GGAGATCGAG CGGGCCGTTG CCGCGGGCCG CCGCCATCCG
GACGCGCGCG TGTTCATCAA CGATCACTGG CGGCTCGCGC TCGACGCGGG CGCATACGGC
GTGCACCTCG GCCAGGAGGA TCTGGAGACC GCCGATCTCG GCGCGATCGC GCGGGCGGGC
GCGCGGCTCG GCCTGTCGAG CCACGGGTAT TACGAAATGC TCGTCGCGCT GCAGTTCAAG
CCGAGCTATC TCGCGCTCGG CCCGGTGTTC GCGACCGCGA CGAAGGCGGT TGCCGCGCCG
CCGCAAGGCC TCGCGCGGCT TGCGCGCTAC GTGCGCTTCG CCGGGCCGCA GGCGCCGCTC
GTCGCGATCG GCGGAATCGC GCCCGACACG CTCGGCGCGG TGCTGGCGGC GGGCGTCGGC
AGCGCGGCCG TCGTCAGCGC AATCACGGCG GCGGCCGATT ACCGGGAAGC GATTGTTGCA
TTGCAGCAAA ACTTCGGACG ATAA
 
Protein sequence
MSAALPDAFW PPADELTEAA ERIRATLGAW PRPAVRTRIC LAPPEQPRAA DLWVAIAGDA 
GAHAAHIARL NAAGARAIVI DDASATLHTG AARHALASRA PLADDWIAAL AAFLDCGFAA
SDALVLALAW RDGDEARGGD PWPVDPARFP RVLGLPAAPE PAFAPCPQRL GLYPVLPSAE
WVERVLDCGV RTVQLRVKDA SPDALRAEIE RAVAAGRRHP DARVFINDHW RLALDAGAYG
VHLGQEDLET ADLGAIARAG ARLGLSSHGY YEMLVALQFK PSYLALGPVF ATATKAVAAP
PQGLARLARY VRFAGPQAPL VAIGGIAPDT LGAVLAAGVG SAAVVSAITA AADYREAIVA
LQQNFGR