Gene BURPS668_3678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3678 
SymbolthiE 
ID4883104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3598948 
End bp3600051 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content75% 
IMG OID640129606 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_001060682 
Protein GI126441181 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.336628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCG CGTTGCCCGA CGCGTTCTGG CCGCCCGCCG ACGAGCTTAC CGAGGCCGCC 
GAGCGGATTC GCGCGACGCT CGGTGCGTGG CCGCGGCCGG CCGTGCGCAC GCGGATCTGT
CTCGCGCCGC CCGAGCAGCC GCGCGCGGCC GACCTGTGGG TCGCCATCGC GGGCGACGCC
GGCGCGCACG CCGCGCACAT CGCGCGGCTG AACGCGGCGG GCGCGCAGGC GATCGTCATC
GACGATGCAT CGGCGACGCT CCACACGGGC GCGGCGCGCC ATGCGCTCGC GTCGCGCGCG
CCGCTCGCCG ACGACTGGAT CGCGGCGCTC GCGGCGTTTC TCGATTGCGG CTTCGCCGCG
TCCGACGCAC TCGTGCTCGC GCTCGCATGG CGCGACGGCG ACGAGGCGCG CGGCGGCGAT
CCGTGGCCCG TCGATCCGGC ACGCTTTCCG CGCGTGCTCG GCCTGCCCGC CGCGCCCGAA
CCGGCGTTCG CGCCGTGCCC GCAGCGGCTC GGCCTGTATC CGGTGCTGCC GAGCGCCGAA
TGGGTCGAGC GCGTGCTCGA TTGCGGCGTG CGGACCGTGC AACTGCGCGT GAAGGACGCC
TCGCCCGACG CGCTGCGCGC GGAGATCGAG CGGGCCGTTG CCGCGGGCCG CCGCCATCCG
GACGCGCGCG TGTTCATCAA CGATCACTGG CGGCTCGCGC TCGACGCGGG CGCATACGGC
GTGCACCTCG GCCAGGAGGA TCTGGAGACC GCCGATCTCG GCGCGATCGC GCGGGCGGGC
GCGCGGCTCG GCCTGTCGAG CCACGGGTAT TACGAAATGC TCGTCGCGCT GCAGTTCAAG
CCGAGCTATC TCGCGCTCGG CCCGGTGTTC GCGACCGCGA CGAAGGCGGT TGCCGCGCCG
CCGCAAGGCC TCGCGCGGCT TGCGCGCTAC GTGCGCTTCG CCGGGCCGCA GGCGCCGCTC
GTCGCGATCG GCGGAATCGC GCCCGACACG CTCGGCGCGG TGCTGGCGGC GGGCGTCGGC
AGCGCGGCCG TCGTCAGCGC GATCACGGCG GCGGCCGATT ACCGGGAAGC GATTGTTGCA
TTGCAGCAAA ACTTCGGACG ATAA
 
Protein sequence
MSAALPDAFW PPADELTEAA ERIRATLGAW PRPAVRTRIC LAPPEQPRAA DLWVAIAGDA 
GAHAAHIARL NAAGAQAIVI DDASATLHTG AARHALASRA PLADDWIAAL AAFLDCGFAA
SDALVLALAW RDGDEARGGD PWPVDPARFP RVLGLPAAPE PAFAPCPQRL GLYPVLPSAE
WVERVLDCGV RTVQLRVKDA SPDALRAEIE RAVAAGRRHP DARVFINDHW RLALDAGAYG
VHLGQEDLET ADLGAIARAG ARLGLSSHGY YEMLVALQFK PSYLALGPVF ATATKAVAAP
PQGLARLARY VRFAGPQAPL VAIGGIAPDT LGAVLAAGVG SAAVVSAITA AADYREAIVA
LQQNFGR