Gene GWCH70_2718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2718 
Symbol 
ID7976536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2750718 
End bp2751923 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content40% 
IMG OID644799516 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_002950675 
Protein GI239828051 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATG ACCGTATTTT AATTCGCTAT GGAGAAATGA CGACAAAAGG TCGAAATCGA 
AACTTGTTTG TTCGCCGTTT AAAAGATAAT GTCGCTAAAA AACTTCATGC GTTTCAGAAT
ATTAAAATCG AATATATGCG TGATCGCATG TACATTTTAC TAAACGGAGA ACCGCATGAA
CCGATTATTG ACAAATTAAA AAATGTATTT GGCATTCATT CATTTAGTTT AGCAATGAAA
TGTCATAATG AATTGAACGA AATAAAAGAA ACAGCATTGG CAGCTGTTAA GCAGCTTCCT
CATGAAGGAA AAACGTTTAA AATAAGCGCT CGAAGAGTGG ACAAGCAATT TCCGTATGGA
AGCAGCGAGT TAAATTATGA AATTGGCGCT CATATTTTGC GCAATACAGA CGGGTTAACG
GTAAATGTAC ATGATCCTGA TATCGATGTC CGCGTGGAAG TTCGCCAAGA AGGCACATAT
ATCACGTGTC ACGATATTCC AGGTCCCGGG GGCTTGCCGG TGGGATCGAG CGGCAAAGCA
ATGCTCATGC TTTCCGGTGG AATCGATAGC CCAGTTGCCG GGTATTTGGC AATGAAGCGC
GGATTGGAAA TAGAAGCGGT TCACTTTTTT AGTCCGCCGT TTACAAGCGA TCGGGCAAAA
CAAAAAGTGA TTGATTTAGT GAAAAAATTA ACCACGTACG GAGGAAAAAT AAAACTTCAC
ATTGTTCCTT TTACAGAAGT GCAGCAAGCC ATTTATACAC AAGTGCCAAA CGAATACTCG
CTTATTTCAA CAAGAAGAGC AATGCTAAAA ATTACTGATG CGTTGCGTCA GCGGCATCGC
GCACTTGCGA TTGTGACAGG GGAAAGCCTT GGACAAGTTG CCAGCCAAAC TCTAGAAAGC
ATGTTTGTCA TTAATGATGT CACCACAACG CCGATTTTGC GGCCGCTTGT ATCGATGGAT
AAAATAGAAA TTATCGACAT TGCTAAAAAA ATTGACACAC ACGATATTTC GATTCTTCCA
TATGAAGATT GTTGTACCAT TTTTACGCCA AGATCGCCAA AAACAAAACC AAAAAAAGAA
AAAGTGGTTC ATTATGAAAG TTTTGTTGAT TTACAACCAT TAATAGAGAA GGCGATCGCA
AATACGGAAA CGATGGTCAT CGACGAACAT TCCGCAACGG AAGATGAGTT TGAACAGTTA
TTCTAA
 
Protein sequence
MKYDRILIRY GEMTTKGRNR NLFVRRLKDN VAKKLHAFQN IKIEYMRDRM YILLNGEPHE 
PIIDKLKNVF GIHSFSLAMK CHNELNEIKE TALAAVKQLP HEGKTFKISA RRVDKQFPYG
SSELNYEIGA HILRNTDGLT VNVHDPDIDV RVEVRQEGTY ITCHDIPGPG GLPVGSSGKA
MLMLSGGIDS PVAGYLAMKR GLEIEAVHFF SPPFTSDRAK QKVIDLVKKL TTYGGKIKLH
IVPFTEVQQA IYTQVPNEYS LISTRRAMLK ITDALRQRHR ALAIVTGESL GQVASQTLES
MFVINDVTTT PILRPLVSMD KIEIIDIAKK IDTHDISILP YEDCCTIFTP RSPKTKPKKE
KVVHYESFVD LQPLIEKAIA NTETMVIDEH SATEDEFEQL F