Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1844 |
Symbol | |
ID | 3831705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1902337 |
End bp | 1903419 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829776 |
Product | hypothetical protein |
Protein accession | YP_430687 |
Protein GI | 83590678 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAA CTATCGCCTG CAAAGTGCTT GACGGTGAGC GTTTGAGTTT TCACGAAGCG GAGCGTCTTT ATCAGGAGGC CGGCCTGCTG GAACTGGGTT ACCTGGCCAA CCTGGTCCGC CAGAAGCTGC ACCCCGAAGG GACGGTCACC TTTGTCGTGG ACCGGAATAT CAACTATACC AATATTTGCG TCAATGCCTG CCGTTTCTGT GCCTTTTATC GCCTGCCCGG GGATCCGGAA GGCTACCTCC TCAGCCGGGA GGAAATCGGC CGTAAAATCG AGGCGACCCT GGCTGCAGGG GGAACCCAGA TCCTGATGCA GGGGGGCCTG CATCCCGATC TAGACCTGGC CTGGTTTGAG GACCTCTTCT CCTGGATTAA ATCCCGCTAT CCGGTCACCC TTCATTCCCT ATCTCCGGTA GAAATCGATG ACCTGGCCCG GAAGGAAGGG CTGCCGGTGA TAGAGGTTTT GCGGCGGCTG AAGAAAGCAG GCCTCGATTC CCTGCCGGGC GGTGGGGCGG AGATCCTGGT CGACAGGGTT CGCGGCCGGG TTAGCCCCAA AAAGACCGGT GCGGCCCGCT GGCTGGAAGT GATGCGCGCC GCCCATGCCC TGGGGATGAA ATCAACCGCC ACCATGGTTT TCGGGTTGGG GGAAACTATG GCTGAGCGGA TCGCTCACCT GGAGGCTATC CGGCAGCTCC AGGATGAAAC GGGAGGTTTT ACAGCCTTCA TCCCCTGGAG TTTCCAGCCA GGCAATACCG AACTTGGCGG CGTGGAAGCT AGCACCACCG AATACCTGAA ACTCCTGGCC CTTTCCCGGC TTTACCTGGA TAATATCCCC AATATCCAGG TCTCCTGGGT AACCCAGGGA ACCAAGGTGG CTCAGGCGGC CCTTTTCTTT GGCGCCAACG ATTTCGGTTC CACCATGCTG GAGGAAAACG TAGTCCGGGC GGCTGGTGCT TCCTTCCGTG CCGATCGGGA GGAGATCCTT CGCTGCATCC AGGCGGCCGG TTTCCGGCCC GCCCAGCGCG ATAATGAATA TCATATTTTG CGCTACTACG AGGCAGGGCA GGTTAACCGG TGA
|
Protein sequence | MKETIACKVL DGERLSFHEA ERLYQEAGLL ELGYLANLVR QKLHPEGTVT FVVDRNINYT NICVNACRFC AFYRLPGDPE GYLLSREEIG RKIEATLAAG GTQILMQGGL HPDLDLAWFE DLFSWIKSRY PVTLHSLSPV EIDDLARKEG LPVIEVLRRL KKAGLDSLPG GGAEILVDRV RGRVSPKKTG AARWLEVMRA AHALGMKSTA TMVFGLGETM AERIAHLEAI RQLQDETGGF TAFIPWSFQP GNTELGGVEA STTEYLKLLA LSRLYLDNIP NIQVSWVTQG TKVAQAALFF GANDFGSTML EENVVRAAGA SFRADREEIL RCIQAAGFRP AQRDNEYHIL RYYEAGQVNR
|
| |