Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0198 |
Symbol | |
ID | 3832271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 193951 |
End bp | 195123 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828134 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_429076 |
Protein GI | 83589067 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATACAT CCCTACTGGT CCGTTACGGC GAGATCAGCC TCAAGGGCAA CAACCGCCCA TACTTTGAAG ATAAACTCCT GGCCAACATG CGCCGGGCCC TGGCCGGCCT GCCGCCCCGC AGAATGCGCA AGACCTTCGG CCGCGTCTTC GTGGAGCTCC ATGACGACCT GGAAGCCGTA GCCCGGCGTT TGCAGCGCGT CTTTGGCATC GTCTCCATGA GCCCGGTAGC CACAGCTCCC CTGGAGCTGG AAGCCATCAA AAAAGCCGCC CTGGCCGTCT TAAAGGATTC CCCCGGCAGC ACCTTTAAAG TCCAGGCCCA GCGGCCCAAT AAACGCTTTC CCCTCACCTC ACCTGAAGTC AACCAGGAAC TGGGGGCCTA CCTCCTCACC CACAGCCAGG GCCAACGGGT AGACGTTCAC CATCCCGACC GGGTTATCCA TGTGGAAATC CGCGACGAAG GGGCCTATAT CTACTCCCGC ATCATCCCCG GACCCGGCGG CCTGCCCGTA GGGGTCACCG GCCGGGGGCT GCTCCTGATC TCCGGCGGCA TCGACAGCCC GGTGGCCGGC TATATGGGCA TGAAACGGGG CCTGGAACTC ACGGCCCTCC ATTTTCACAG CTTCCCTTTT ACCAGCGAGC GCTCCAAGGA AAAGGTCATC GACCTCTGCC GGGTCCTGGC AGGCTACAGC GGACCTTTGC GCCTGGTGGT GGCCCCCTTT ACCAATATCC AGAAGGCCAT CCGCCAGAAC TGCCCCCAGG AGTTTTACGT CACCATCATG CGGCGGATGA TGTTCCGCAT CGCCAGGGCG GTGGCTGCCA AAGAGGAGGC CCCGGCCATC CTCACGGGGG AGAGCCTGGG CCAGGTGGCC AGCCAGACCC TCCAGAGCAT GGCGGTAATC AACAAGGTGG TCGACCTGCC GGTCTTAAGG CCCCTGGTGG CCTGGGACAA GAGCGAGATT ATCGAGGTGG CCCGCCGCAT CGGCACCTAC GACATCTCCA TCCGGCCCTA CGAGGACTGC TGCACCCTCT TTGTCCCCAA ACACCCGGCC ACCAAACCGC CCCTGGCCCG GGTGGAAGCG GCCGAAAAGA ATCTGGCCGT GGTGGAACTC GTCGCGGAGT GCCTGGAAAA TCTAGAAATC CTGACGGTGG AGCCCGAAGC TGACGTAGTG TAA
|
Protein sequence | MYTSLLVRYG EISLKGNNRP YFEDKLLANM RRALAGLPPR RMRKTFGRVF VELHDDLEAV ARRLQRVFGI VSMSPVATAP LELEAIKKAA LAVLKDSPGS TFKVQAQRPN KRFPLTSPEV NQELGAYLLT HSQGQRVDVH HPDRVIHVEI RDEGAYIYSR IIPGPGGLPV GVTGRGLLLI SGGIDSPVAG YMGMKRGLEL TALHFHSFPF TSERSKEKVI DLCRVLAGYS GPLRLVVAPF TNIQKAIRQN CPQEFYVTIM RRMMFRIARA VAAKEEAPAI LTGESLGQVA SQTLQSMAVI NKVVDLPVLR PLVAWDKSEI IEVARRIGTY DISIRPYEDC CTLFVPKHPA TKPPLARVEA AEKNLAVVEL VAECLENLEI LTVEPEADVV
|
| |