Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0374 |
Symbol | |
ID | 7979275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 428536 |
End bp | 430257 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644797363 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002948563 |
Protein GI | 239825939 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAACA TCAAATCTTT CATGCAATTT CCTGCGAGTA AAAAGGTGTA TGTCGAAGGT TCAAGACCGG ATATTCGTGT TCCGATGAGA GAAATTACGT TAAGTCCAAC AAAGACAGAA ACAGGAATCA TCGAAAATGA ACCTGTTCGT GTTTATGACA CAAGCGGTCC TTACACCGAT CCTGATTTTC AGCCGGACAT TCAAAAAGGA TTGCCGCCGC TAAGAAAACG CTGGATTTTA GAAAGGGGCG ACGTTGAAGA ATACGAGGGG CGTCCAGTGA AACCGGAAGA TAACGGATTC CGTAATGGGA AGCAATCAGA GATTACTCTG CCTTTCGAGA GAAAACCGCT GCGCGCAAAA AAAGGAAAAA CTGTGACGCA AATGCATTAC GCAAAACGAG GGATTATTAC GCCGGAGATG GAATTTGTTG CGATTCGCGA AAATATAGAT CCGGAAATCG TGCGTCAAGA AGTTGCCGCA GGCAGAGCGA TTATTCCTTC GAATATTAAC CATCCGGAAA GTGAACCGAT GATTATTGGG AGTCGTTTTC ATGTAAAAAT TAACGCGAAT ATTGGCAATT CCGCAGTCAC ATCATCCATT GAAGAAGAAG TCGAAAAAAT GCTTTGGGCG GTGCGCTGGG GAGCAGACAC GATTATGGAC TTATCGACGG GAAAACACAT TCATGCGACA CGAGAATATA TTATTCGCAA TTCCCCTGTA CCTGTCGGGA CGGTTCCGAT TTATCAAGCG CTTGAAAAAG TGAACGGCGT CGTCGAAGAC TTAACGTGGG AAATTTACCG TGATACGCTC ATCGAACAAG CGGAACAAGG GGTGGACTAT TTTACGATTC ACGCGGGGGT GCTGCTTCGT TACATACCGA TGACCGTAAA CCGAACGACT GGCATTGTTT CACGCGGCGG CTCGATTATG GCGCAATGGT GTTTAGCCCA CCATGAAGAA AACTTCTTAT ACACGCATTT TGAAGAAATA TGCGAGATTT TGAAAACGTA CGATATTGCA GTATCGCTTG GAGACGGGCT GCGCCCAGGC TCGATTGCCG ATGCGAATGA CGAAGCGCAG TTTGCTGAAT TGGAGACGCT CGGAGAATTA ACAGAAATCG CTTGGAAGCA TGATGTGCAA GTGATGATTG AAGGACCAGG GCATGTGCCG ATGCATAAAA TTAAGGAAAA TGTCGATAAG CAAATCGAAA TTTGCAAAGG TGCGCCGTTT TACACATTAG GACCGCTTAC GACCGATATC GCGCCGGGAT ATGACCACAT TACATCCGCG ATCGGAGCAG CGATCATTGG CGCGTACGGA ACGGCGATGC TTTGTTACGT GACGCCAAAA GAGCATTTAG GGTTGCCGAA TAAAGATGAT GTGCGCGAAG GAGTGATCGC TTATAAAATC GCAGCACACG CGGCCGATTT AGCCAAAGGG CACCCGGGAG CGCAACGGCG GGATGACGCG TTATCAAAAG CGCGCTTTGA GTTTCGTTGG AACGATCAGT TCAACCTTTC GCTTGATCCA GATCGAGCTC GGGAATATCA TGATGAAACA TTGCCGGCGG AAGGGGCGAA AGTCGCGCAC TTTTGTTCGA TGTGCGGGCC GAAGTTTTGC TCGATGAAAA TTTCACATGA GCTGCAAAAA ACGGTGAGGG AAGAAGGGAT GAAACAAAAA GCGAAGGAGT TTGTCGAAAA TGGCTCATCT CTCTATCGAT GA
|
Protein sequence | MQNIKSFMQF PASKKVYVEG SRPDIRVPMR EITLSPTKTE TGIIENEPVR VYDTSGPYTD PDFQPDIQKG LPPLRKRWIL ERGDVEEYEG RPVKPEDNGF RNGKQSEITL PFERKPLRAK KGKTVTQMHY AKRGIITPEM EFVAIRENID PEIVRQEVAA GRAIIPSNIN HPESEPMIIG SRFHVKINAN IGNSAVTSSI EEEVEKMLWA VRWGADTIMD LSTGKHIHAT REYIIRNSPV PVGTVPIYQA LEKVNGVVED LTWEIYRDTL IEQAEQGVDY FTIHAGVLLR YIPMTVNRTT GIVSRGGSIM AQWCLAHHEE NFLYTHFEEI CEILKTYDIA VSLGDGLRPG SIADANDEAQ FAELETLGEL TEIAWKHDVQ VMIEGPGHVP MHKIKENVDK QIEICKGAPF YTLGPLTTDI APGYDHITSA IGAAIIGAYG TAMLCYVTPK EHLGLPNKDD VREGVIAYKI AAHAADLAKG HPGAQRRDDA LSKARFEFRW NDQFNLSLDP DRAREYHDET LPAEGAKVAH FCSMCGPKFC SMKISHELQK TVREEGMKQK AKEFVENGSS LYR
|
| |