Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1846 |
Symbol | |
ID | 3831707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1904263 |
End bp | 1905384 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829778 |
Product | hypothetical protein |
Protein accession | YP_430689 |
Protein GI | 83590680 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCACCA CCACGGCAGA ATTTATACGC CGGGAGCTCT CCCGGGGGGA ACTGGCGGAC ATAGCTGCCA GGGTATATGC CGGGGAAAGG CTGACCCGGG AGGACGGTAT ACGTCTCTGG GAGAGTCAGG ACCTCCTTGG TATTGGCTAC CTGGCCGACC TGGCCCGGCA GCGGACCTGC GGTGATACCG TATATTTTAT TAATAATGCC CACATCAATT ATACCAATAT CTGTCAAAAT CTCTGCGATT TATGTGCCTT TGGGAGACAG ACAGGTACTC CGGGGGCTTA TACCCTCACC CTGGCGGAGA TTGAAGCGAA GGCCAGGGCG GCGGCTGCCG CAGGTGTCAC CGAGATCCAT ATTGTCGGCG GCCTGAATCC GGAATTACCC TATGATTACT ACCTGGAACT AATCCGAACC GTGCGCCGGG CGGCACCCGG GGCCTGCATC CAGGCCTTTG ACGCTGTGGA AATTGACTTT ATAGCCTCCC GGGCCGGGCG CCCCGTAGCT GACGTCCTCC AGGAACTCCG GCAGGCGGGC CTGGACTCCC TTCCCGGGGG TGGGGCCGAA GTTTTTGCCC CGGAGGTGCG CCGGCGCCTT TGCGCCAAAA AGATCGATGG ACAGCGATGG CTCCAGATCC ACGAGACGGC CCACCGGCTG GGCATACCCA CCAACGCTAC CATGCTCTAC GGCCACCTGG AGACGGCAGC CGACCGGGTG GATCACCTCC TGGCCCTCCG GGAACTCCAG GACCGGACAG GGGGCTTCCT GGCTTTTATT CCCCTGGCCT TTCACCCGGC CAACACGGCC TTCAGCAACC TGCCGGGAAC CACCGGCGTA GATGACTTGA AGATGTTGGC CATCAGCCGC CTGCTTCTGG ACAACTTCCG TCACATCAAA GCCTTCTGGA TTATGATTGG GCCGAAGTTA GCCCAGGTGG CCCTGCATTT TGGGGTAAAC GACCTGGACG GTACCGTCCG GGAAGAGCAT ATCTTTCACG ACGCCGGGGC TGAAACGCCC CAGTACCAGC CTGCTGAAAG CTTTTTACAG AGTATCCGGG CGGCCGGCCG GATCCCGGTG GAAAGGGATA CTTTATACCG GGAAATCCGC CGCTATGCCT GA
|
Protein sequence | MPTTTAEFIR RELSRGELAD IAARVYAGER LTREDGIRLW ESQDLLGIGY LADLARQRTC GDTVYFINNA HINYTNICQN LCDLCAFGRQ TGTPGAYTLT LAEIEAKARA AAAAGVTEIH IVGGLNPELP YDYYLELIRT VRRAAPGACI QAFDAVEIDF IASRAGRPVA DVLQELRQAG LDSLPGGGAE VFAPEVRRRL CAKKIDGQRW LQIHETAHRL GIPTNATMLY GHLETAADRV DHLLALRELQ DRTGGFLAFI PLAFHPANTA FSNLPGTTGV DDLKMLAISR LLLDNFRHIK AFWIMIGPKL AQVALHFGVN DLDGTVREEH IFHDAGAETP QYQPAESFLQ SIRAAGRIPV ERDTLYREIR RYA
|
| |