Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1722 |
Symbol | |
ID | 3833022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1765892 |
End bp | 1767172 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829647 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_430567 |
Protein GI | 83590558 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00870726 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.564232 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGG TTTTAGATGC CCGTGCTGGA AAAATTACTC CGGAAATGGA AAAAGTGGCG GCAGATGAAA AGGTGGATGT GGAATTTGTC CGGGCAGGGG TGGCGGAGGG GACTATTGTA ATCCCCCGGA ATACCAACCG GAAGGTCCTT AAACCCTGCG GTATCGGGAG GGGTTTACGT ATCAAGGTGA ATGCCCTGAT CGGGACCTCC AGCGACCGGG ATGACCGGCA AATGGAGATG CGGAAGATCG CTGCGGCCGA GGCGGCAGGG TGTGATTCCT TTATGGATTT AAGCACCGGC GGGGATATCG ATGAGATGCG GCGGCTCACC CTTGCCCACG CCAGGGTTCC GGTAGGCAGC GTGCCCATTT ATCAGGCAGC CATCGAAGCC ATTGAAAAGC GGGGCAGTAT TGTAGCGATG ACGCCGGACG ACATGTTTGC GGCCGTGGAA AAACAGGCAA GGGACGGGAT AGATTTCATG GCCATTCACA GCGCCCTGAA TTTCGAGATC CTCGAAAGGC TTCAGGCTAG CGGCAGGGTG ACCGACATTG TCAGCCGCGG TGGGGCCTTC CTCACCGGCT GGATGCTGCA CAACCAGAAA GAGAATCCCC TTTATGAGCA GTTCGACAGG TTGCTCGAAA TCTTGCTGAA GTACGATGTC ACCCTCAGCC TCGGCGACGC CATTCGTCCG GGTTCTACAG CCGACTCCCT GGACGGGGCC CAACTGCAGG GAATGATCGT GGCCGGGGAA CTGGTCAGGC GCGCCAGGGA AGCCGGCGTG CAGGTTATGG TCGAGGGTCC GGGACATGTT CCCCTCAACC ATGTGGAAAC GACAATGAAA CTACAGAAAA GCCTGTGCGG GGGCGCGCCT TACTTTATTC TGGGTACCCT GGCTACTGAT GTGGCGCCGG GATATGACCA TATCACTGCC GCAATAGGGG GTGCCCTTGC CGGGACGGTT GGGGCGGATT TTATCTGCTA TGTGACACCG GCGGAGCATC TGGGGTTACC AACAGAGCAG GACGTTAAAG AAGGGGTGAT TGCCGCCCGC ATTGCCGCCC ATGCCGCCGA TCTGGCCAGG GGAAACAGGC AGGCCTGGGA GCGGGATCTG CAAATGGCGC GGGCGCGGGT CGCCCTCGAT GTGGAAAAGC AGATAAGCCT TGCCATTGAT CAGGAAAAGG CACGCTCGTT GCTCGACGGT ACCGGGGAAG ACGGGGTTTG TGCTGCCTGT GGGACGAACT GCGCAGCCCT GGTGGCCGCC CGTTATTTCG GGATGAACTG A
|
Protein sequence | MSQVLDARAG KITPEMEKVA ADEKVDVEFV RAGVAEGTIV IPRNTNRKVL KPCGIGRGLR IKVNALIGTS SDRDDRQMEM RKIAAAEAAG CDSFMDLSTG GDIDEMRRLT LAHARVPVGS VPIYQAAIEA IEKRGSIVAM TPDDMFAAVE KQARDGIDFM AIHSALNFEI LERLQASGRV TDIVSRGGAF LTGWMLHNQK ENPLYEQFDR LLEILLKYDV TLSLGDAIRP GSTADSLDGA QLQGMIVAGE LVRRAREAGV QVMVEGPGHV PLNHVETTMK LQKSLCGGAP YFILGTLATD VAPGYDHITA AIGGALAGTV GADFICYVTP AEHLGLPTEQ DVKEGVIAAR IAAHAADLAR GNRQAWERDL QMARARVALD VEKQISLAID QEKARSLLDG TGEDGVCAAC GTNCAALVAA RYFGMN
|
| |