Gene GWCH70_0374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0374 
Symbol 
ID7979275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp428536 
End bp430257 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content47% 
IMG OID644797363 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002948563 
Protein GI239825939 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAACA TCAAATCTTT CATGCAATTT CCTGCGAGTA AAAAGGTGTA TGTCGAAGGT 
TCAAGACCGG ATATTCGTGT TCCGATGAGA GAAATTACGT TAAGTCCAAC AAAGACAGAA
ACAGGAATCA TCGAAAATGA ACCTGTTCGT GTTTATGACA CAAGCGGTCC TTACACCGAT
CCTGATTTTC AGCCGGACAT TCAAAAAGGA TTGCCGCCGC TAAGAAAACG CTGGATTTTA
GAAAGGGGCG ACGTTGAAGA ATACGAGGGG CGTCCAGTGA AACCGGAAGA TAACGGATTC
CGTAATGGGA AGCAATCAGA GATTACTCTG CCTTTCGAGA GAAAACCGCT GCGCGCAAAA
AAAGGAAAAA CTGTGACGCA AATGCATTAC GCAAAACGAG GGATTATTAC GCCGGAGATG
GAATTTGTTG CGATTCGCGA AAATATAGAT CCGGAAATCG TGCGTCAAGA AGTTGCCGCA
GGCAGAGCGA TTATTCCTTC GAATATTAAC CATCCGGAAA GTGAACCGAT GATTATTGGG
AGTCGTTTTC ATGTAAAAAT TAACGCGAAT ATTGGCAATT CCGCAGTCAC ATCATCCATT
GAAGAAGAAG TCGAAAAAAT GCTTTGGGCG GTGCGCTGGG GAGCAGACAC GATTATGGAC
TTATCGACGG GAAAACACAT TCATGCGACA CGAGAATATA TTATTCGCAA TTCCCCTGTA
CCTGTCGGGA CGGTTCCGAT TTATCAAGCG CTTGAAAAAG TGAACGGCGT CGTCGAAGAC
TTAACGTGGG AAATTTACCG TGATACGCTC ATCGAACAAG CGGAACAAGG GGTGGACTAT
TTTACGATTC ACGCGGGGGT GCTGCTTCGT TACATACCGA TGACCGTAAA CCGAACGACT
GGCATTGTTT CACGCGGCGG CTCGATTATG GCGCAATGGT GTTTAGCCCA CCATGAAGAA
AACTTCTTAT ACACGCATTT TGAAGAAATA TGCGAGATTT TGAAAACGTA CGATATTGCA
GTATCGCTTG GAGACGGGCT GCGCCCAGGC TCGATTGCCG ATGCGAATGA CGAAGCGCAG
TTTGCTGAAT TGGAGACGCT CGGAGAATTA ACAGAAATCG CTTGGAAGCA TGATGTGCAA
GTGATGATTG AAGGACCAGG GCATGTGCCG ATGCATAAAA TTAAGGAAAA TGTCGATAAG
CAAATCGAAA TTTGCAAAGG TGCGCCGTTT TACACATTAG GACCGCTTAC GACCGATATC
GCGCCGGGAT ATGACCACAT TACATCCGCG ATCGGAGCAG CGATCATTGG CGCGTACGGA
ACGGCGATGC TTTGTTACGT GACGCCAAAA GAGCATTTAG GGTTGCCGAA TAAAGATGAT
GTGCGCGAAG GAGTGATCGC TTATAAAATC GCAGCACACG CGGCCGATTT AGCCAAAGGG
CACCCGGGAG CGCAACGGCG GGATGACGCG TTATCAAAAG CGCGCTTTGA GTTTCGTTGG
AACGATCAGT TCAACCTTTC GCTTGATCCA GATCGAGCTC GGGAATATCA TGATGAAACA
TTGCCGGCGG AAGGGGCGAA AGTCGCGCAC TTTTGTTCGA TGTGCGGGCC GAAGTTTTGC
TCGATGAAAA TTTCACATGA GCTGCAAAAA ACGGTGAGGG AAGAAGGGAT GAAACAAAAA
GCGAAGGAGT TTGTCGAAAA TGGCTCATCT CTCTATCGAT GA
 
Protein sequence
MQNIKSFMQF PASKKVYVEG SRPDIRVPMR EITLSPTKTE TGIIENEPVR VYDTSGPYTD 
PDFQPDIQKG LPPLRKRWIL ERGDVEEYEG RPVKPEDNGF RNGKQSEITL PFERKPLRAK
KGKTVTQMHY AKRGIITPEM EFVAIRENID PEIVRQEVAA GRAIIPSNIN HPESEPMIIG
SRFHVKINAN IGNSAVTSSI EEEVEKMLWA VRWGADTIMD LSTGKHIHAT REYIIRNSPV
PVGTVPIYQA LEKVNGVVED LTWEIYRDTL IEQAEQGVDY FTIHAGVLLR YIPMTVNRTT
GIVSRGGSIM AQWCLAHHEE NFLYTHFEEI CEILKTYDIA VSLGDGLRPG SIADANDEAQ
FAELETLGEL TEIAWKHDVQ VMIEGPGHVP MHKIKENVDK QIEICKGAPF YTLGPLTTDI
APGYDHITSA IGAAIIGAYG TAMLCYVTPK EHLGLPNKDD VREGVIAYKI AAHAADLAKG
HPGAQRRDDA LSKARFEFRW NDQFNLSLDP DRAREYHDET LPAEGAKVAH FCSMCGPKFC
SMKISHELQK TVREEGMKQK AKEFVENGSS LYR