Gene BURPS1106A_3736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3736 
SymbolthiE 
ID4900941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3646165 
End bp3647268 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content75% 
IMG OID640136962 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_001067966 
Protein GI126453292 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCG CGTTGCCCGA CGCGTTCTGG CCGCCCGCCG ACGAGCTTAC CGAGGCCGCC 
GAGCGGATTC GCGCGACGCT CGGTGCGTGG CCGCAGCCGG CCGTGCGCAC GCGGATCTGT
CTCGCGCCGC CCGAGCAGCC GCGCGCGGCC GACCTGTGGG TCGCCATCGC GGGCGACGCC
GGCGCGCACG CCGCGCAGAT CGCGCGGCTG AACGCGGCGG GCGCGCAGGC GATCGTCATC
GACGATGCAT CGGCGACGCT CCACACGGGC GCGGCGCGCC ATGCGCTCGC GTCGCGCGCG
CCGCTCGCCG ACGACTGGAT CGCGGCGCTC GCGGCGTTTC TCGATTGCGG CTTCGCCGCG
TCCGACGCAC TCGTGCTCGC GCTCGCATGG CGCGACGGCG ACGAGGCGCG CGGCGGCGAT
CCGTGGCCCG TCGATCCGGC ACGCTTTCCG CGCGTGCTCG GCCTGCCCGC CGCGCCCGAA
CCGGCGTTCG CGCCGTGCCC GCAGCGGCTC GGCCTGTATC CGGTGCTGCC GAGTGCCGAA
TGGGTCGAGC GCGTGCTCGA TTGCGGCGTG CGGACCGTGC AACTGCGCGT GAAGGACGCC
TCGCCCGACG CGCTGCGCGC GGAGGTCGAG CGGGCCGTTG CCGCGGGCCG CCGCCATCCG
GACGCGCGCG TGTTCATCAA CGATCACTGG CGGCTCGCGC TCGACGCGGG CGCATACGGC
GTGCACCTCG GCCAGGAGGA TCTGGAGACC GCCGATCTCG GCGCGATCGC GCGGGCGGGC
GCGCGGCTCG GCCTGTCGAG CCACGGGTAT TACGAAATGC TCGTCGCGCT GCAGTTCAAG
CCGAGCTATC TCGCGCTCGG CCCGGTGTTC GCGACCGCGA CGAAGGCGGT TGCCGCGCCG
CCGCAAGGCC TCGCGCGGCT TGCGCGCTAC GTGCGCTTCG CCGGGCCGCA GGCGCCGCTC
GTCGCGATCG GCGGAATCGC GCCCGACACG CTCGGCGCGG TGCTGGCGGC GGGCGTCGGC
AGCGCGGCCG TCGTCAGCGC GATCACGGCG GCGACCGATT ACCGGGAAGC GATTGTTGCA
TTGCAGCAAA ACTTCGGACG ATAA
 
Protein sequence
MSAALPDAFW PPADELTEAA ERIRATLGAW PQPAVRTRIC LAPPEQPRAA DLWVAIAGDA 
GAHAAQIARL NAAGAQAIVI DDASATLHTG AARHALASRA PLADDWIAAL AAFLDCGFAA
SDALVLALAW RDGDEARGGD PWPVDPARFP RVLGLPAAPE PAFAPCPQRL GLYPVLPSAE
WVERVLDCGV RTVQLRVKDA SPDALRAEVE RAVAAGRRHP DARVFINDHW RLALDAGAYG
VHLGQEDLET ADLGAIARAG ARLGLSSHGY YEMLVALQFK PSYLALGPVF ATATKAVAAP
PQGLARLARY VRFAGPQAPL VAIGGIAPDT LGAVLAAGVG SAAVVSAITA ATDYREAIVA
LQQNFGR