Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3678 |
Symbol | thiE |
ID | 4883104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3598948 |
End bp | 3600051 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640129606 |
Product | thiamine-phosphate pyrophosphorylase |
Protein accession | YP_001060682 |
Protein GI | 126441181 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.336628 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCCG CGTTGCCCGA CGCGTTCTGG CCGCCCGCCG ACGAGCTTAC CGAGGCCGCC GAGCGGATTC GCGCGACGCT CGGTGCGTGG CCGCGGCCGG CCGTGCGCAC GCGGATCTGT CTCGCGCCGC CCGAGCAGCC GCGCGCGGCC GACCTGTGGG TCGCCATCGC GGGCGACGCC GGCGCGCACG CCGCGCACAT CGCGCGGCTG AACGCGGCGG GCGCGCAGGC GATCGTCATC GACGATGCAT CGGCGACGCT CCACACGGGC GCGGCGCGCC ATGCGCTCGC GTCGCGCGCG CCGCTCGCCG ACGACTGGAT CGCGGCGCTC GCGGCGTTTC TCGATTGCGG CTTCGCCGCG TCCGACGCAC TCGTGCTCGC GCTCGCATGG CGCGACGGCG ACGAGGCGCG CGGCGGCGAT CCGTGGCCCG TCGATCCGGC ACGCTTTCCG CGCGTGCTCG GCCTGCCCGC CGCGCCCGAA CCGGCGTTCG CGCCGTGCCC GCAGCGGCTC GGCCTGTATC CGGTGCTGCC GAGCGCCGAA TGGGTCGAGC GCGTGCTCGA TTGCGGCGTG CGGACCGTGC AACTGCGCGT GAAGGACGCC TCGCCCGACG CGCTGCGCGC GGAGATCGAG CGGGCCGTTG CCGCGGGCCG CCGCCATCCG GACGCGCGCG TGTTCATCAA CGATCACTGG CGGCTCGCGC TCGACGCGGG CGCATACGGC GTGCACCTCG GCCAGGAGGA TCTGGAGACC GCCGATCTCG GCGCGATCGC GCGGGCGGGC GCGCGGCTCG GCCTGTCGAG CCACGGGTAT TACGAAATGC TCGTCGCGCT GCAGTTCAAG CCGAGCTATC TCGCGCTCGG CCCGGTGTTC GCGACCGCGA CGAAGGCGGT TGCCGCGCCG CCGCAAGGCC TCGCGCGGCT TGCGCGCTAC GTGCGCTTCG CCGGGCCGCA GGCGCCGCTC GTCGCGATCG GCGGAATCGC GCCCGACACG CTCGGCGCGG TGCTGGCGGC GGGCGTCGGC AGCGCGGCCG TCGTCAGCGC GATCACGGCG GCGGCCGATT ACCGGGAAGC GATTGTTGCA TTGCAGCAAA ACTTCGGACG ATAA
|
Protein sequence | MSAALPDAFW PPADELTEAA ERIRATLGAW PRPAVRTRIC LAPPEQPRAA DLWVAIAGDA GAHAAHIARL NAAGAQAIVI DDASATLHTG AARHALASRA PLADDWIAAL AAFLDCGFAA SDALVLALAW RDGDEARGGD PWPVDPARFP RVLGLPAAPE PAFAPCPQRL GLYPVLPSAE WVERVLDCGV RTVQLRVKDA SPDALRAEIE RAVAAGRRHP DARVFINDHW RLALDAGAYG VHLGQEDLET ADLGAIARAG ARLGLSSHGY YEMLVALQFK PSYLALGPVF ATATKAVAAP PQGLARLARY VRFAGPQAPL VAIGGIAPDT LGAVLAAGVG SAAVVSAITA AADYREAIVA LQQNFGR
|
| |