Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2718 |
Symbol | |
ID | 7976536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2750718 |
End bp | 2751923 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644799516 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_002950675 |
Protein GI | 239828051 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATATG ACCGTATTTT AATTCGCTAT GGAGAAATGA CGACAAAAGG TCGAAATCGA AACTTGTTTG TTCGCCGTTT AAAAGATAAT GTCGCTAAAA AACTTCATGC GTTTCAGAAT ATTAAAATCG AATATATGCG TGATCGCATG TACATTTTAC TAAACGGAGA ACCGCATGAA CCGATTATTG ACAAATTAAA AAATGTATTT GGCATTCATT CATTTAGTTT AGCAATGAAA TGTCATAATG AATTGAACGA AATAAAAGAA ACAGCATTGG CAGCTGTTAA GCAGCTTCCT CATGAAGGAA AAACGTTTAA AATAAGCGCT CGAAGAGTGG ACAAGCAATT TCCGTATGGA AGCAGCGAGT TAAATTATGA AATTGGCGCT CATATTTTGC GCAATACAGA CGGGTTAACG GTAAATGTAC ATGATCCTGA TATCGATGTC CGCGTGGAAG TTCGCCAAGA AGGCACATAT ATCACGTGTC ACGATATTCC AGGTCCCGGG GGCTTGCCGG TGGGATCGAG CGGCAAAGCA ATGCTCATGC TTTCCGGTGG AATCGATAGC CCAGTTGCCG GGTATTTGGC AATGAAGCGC GGATTGGAAA TAGAAGCGGT TCACTTTTTT AGTCCGCCGT TTACAAGCGA TCGGGCAAAA CAAAAAGTGA TTGATTTAGT GAAAAAATTA ACCACGTACG GAGGAAAAAT AAAACTTCAC ATTGTTCCTT TTACAGAAGT GCAGCAAGCC ATTTATACAC AAGTGCCAAA CGAATACTCG CTTATTTCAA CAAGAAGAGC AATGCTAAAA ATTACTGATG CGTTGCGTCA GCGGCATCGC GCACTTGCGA TTGTGACAGG GGAAAGCCTT GGACAAGTTG CCAGCCAAAC TCTAGAAAGC ATGTTTGTCA TTAATGATGT CACCACAACG CCGATTTTGC GGCCGCTTGT ATCGATGGAT AAAATAGAAA TTATCGACAT TGCTAAAAAA ATTGACACAC ACGATATTTC GATTCTTCCA TATGAAGATT GTTGTACCAT TTTTACGCCA AGATCGCCAA AAACAAAACC AAAAAAAGAA AAAGTGGTTC ATTATGAAAG TTTTGTTGAT TTACAACCAT TAATAGAGAA GGCGATCGCA AATACGGAAA CGATGGTCAT CGACGAACAT TCCGCAACGG AAGATGAGTT TGAACAGTTA TTCTAA
|
Protein sequence | MKYDRILIRY GEMTTKGRNR NLFVRRLKDN VAKKLHAFQN IKIEYMRDRM YILLNGEPHE PIIDKLKNVF GIHSFSLAMK CHNELNEIKE TALAAVKQLP HEGKTFKISA RRVDKQFPYG SSELNYEIGA HILRNTDGLT VNVHDPDIDV RVEVRQEGTY ITCHDIPGPG GLPVGSSGKA MLMLSGGIDS PVAGYLAMKR GLEIEAVHFF SPPFTSDRAK QKVIDLVKKL TTYGGKIKLH IVPFTEVQQA IYTQVPNEYS LISTRRAMLK ITDALRQRHR ALAIVTGESL GQVASQTLES MFVINDVTTT PILRPLVSMD KIEIIDIAKK IDTHDISILP YEDCCTIFTP RSPKTKPKKE KVVHYESFVD LQPLIEKAIA NTETMVIDEH SATEDEFEQL F
|
| |