Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1723 |
Symbol | |
ID | 3833023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1767185 |
End bp | 1768483 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829648 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_430568 |
Protein GI | 83590559 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000000414436 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.578112 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACCTGA TAGAAAGCGC CCGGGCCGGC TTGATTACCC CCGAAATGGA GCAGGTGGCC GTTCAGGAGG GTGTAACTCC GGAATTCGTG AGGCAGGGTG TGGCTGATGG GACGATAGTC ATCCTGCGCA ATGCCCGCCG GCAGAACGTC ACTCCAGTTG GTGTCGGTAA AGGGCTGAGG ACAAAGGTCA GCGCCAGCGT CGGTTTGTAC GGAGAGACGG GCGGTATTGA TGTGGAGGTT GCCAAGATTA AAGCCGCCGT GGAAGCGGGA ACGGACGCCA TCATGGATCT GAGCGTCAGC GGGGACATCG AGGCCATGCT TGCGGAAACG CTGGCCGTTT CCCCCAAGCC CGTCGGTACC TTGCCCCTTT ACCAGGCCAT GGCCGAAGCC GGCAGAAAAT ACGGTTCTTC CGTTAACATG AGAGATGAAG ACTTGTTTGA TGTAATTGAA CGCCACGCGG CCGCCGGGGT AGACTTTCTG GCCCTGCACT GCGGGACTAC TATGAATATT GTAGAACGCG CCAGAAACGA GGGCCGGATC GATCCTCTGG TAAGCTACGG GGGTTCCCAC CTGATCGGGT GGATGCTGGC GAACCGGAGG GAAAACCCCC TTTATGAACA CTTTGACCGG GTTCTTGCGA TTGCCCGGAA GTACGATGTT ACCATCAGCT TTGCCGACGG CATGCGACCG GGATGCCTGG CCGATTCCCT GGATGGCCCC CAGGTGGAAG AGCTGGTTGT TTTGGGAGAG CTGGTCAGGC GGGCGAGGGA AGCCGGTGTA CAGGTGATGG TAAAAGGGCC GGGTCATGTA CCCTTGCAGC AACTAAAGGC GACGGTTGTC CTGGAAAAAA GTCTCTGCCA CGGGGCGCCG TATTTTGTCT TCGGCCCCCT GGTAACAGAT ATCGCAATCG GTTATGACCA TATCAATGCT GCTATCGGGG GTGCCTTGAG CGCCTGGGCG GGTGCGGAGT TTCTCTGTTA TGTAACTGCC GCCGAACATG TGGGGATTCC GGATATTGAC CAGGTCCGGG AGGGAGTGAT TGCCGCTCGC ATTGCCGCCC ATGCCGCCGA CCTGGCCAAC GGCCTTACCT GTGCCCGGGA ATGGGATCGG GAGCTTTCCC GGGCGAGAAA AGAACTGGAC TGGAAGCGGC AGATTGCTCT CGCCATAGAC CCCGAACGGG CGGGAAGGCT GAGAGAAGAA AGAAGCGACG CCGCGGCGGC GGGATGTGCC ATGTGCGGTA AATACTGCGC CATGGAAATC GTATCCAGAT ACCTGGGCAC AGCCAGACAT ACATGTTAG
|
Protein sequence | MNLIESARAG LITPEMEQVA VQEGVTPEFV RQGVADGTIV ILRNARRQNV TPVGVGKGLR TKVSASVGLY GETGGIDVEV AKIKAAVEAG TDAIMDLSVS GDIEAMLAET LAVSPKPVGT LPLYQAMAEA GRKYGSSVNM RDEDLFDVIE RHAAAGVDFL ALHCGTTMNI VERARNEGRI DPLVSYGGSH LIGWMLANRR ENPLYEHFDR VLAIARKYDV TISFADGMRP GCLADSLDGP QVEELVVLGE LVRRAREAGV QVMVKGPGHV PLQQLKATVV LEKSLCHGAP YFVFGPLVTD IAIGYDHINA AIGGALSAWA GAEFLCYVTA AEHVGIPDID QVREGVIAAR IAAHAADLAN GLTCAREWDR ELSRARKELD WKRQIALAID PERAGRLREE RSDAAAAGCA MCGKYCAMEI VSRYLGTARH TC
|
| |