Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0512 |
Symbol | |
ID | 3831814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 531172 |
End bp | 532572 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637828446 |
Product | hypothetical protein |
Protein accession | YP_429385 |
Protein GI | 83589376 |
COG category | [S] Function unknown |
COG ID | [COG2719] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAACAAG AATTTAAAAC CATCGCCCAG GCTATTGAAA CCATCCACGA CCAGGCCAGG AAATTCGGCC TGGACTTTTT TCCGGTCTAC TTCGAGCTCT GCCCGGCTGA CGTCCTCTAT GCCTTCGGCG CTTACGGTAT GCCCACCAGG TTCGCCCACT GGACCTTCGG GAAACACTTT TATAAAATGA AGCTCCAGTA CGACTTTAAC CTCAGCCGTA TCTACGAACT GGTCATCAAC TCCAACCCGT GTTATGCCTT TCTCCTGGAG GGCAATGACC TCATCCAGAA TAAACTGGTC ATTGCCCATG TTTTCGCCCA CAGCGATTTC TTTAAAAACA ACATCTATTT TACCGCCACC TCCCGCCAGA TGGTGGAGAC CATGGCCGTT CACGCCGCCA AGATCCGGGA ATACGAATTC AAGTACGGTC ACCGGGAGGT GGAGATCTTT CTCGATGCCG TGCTGGCCAT CCAGGAGCAT ATTGAACCCC CGGGCCCCTT CGGATATAAA GAAGAAGAGA ATGAAGAAAA TGAGGATACC AGGCCGCACC GCCGGGAGAC GCCCTACGAC GACCTCTGGA TCCTGGACGG CAGACCGAAG GAACCCCCGC CGGAACGCAA CCGCAAAATA CCACCCCGGC CTACCAAGGA CATGGTCGGT TTTATCATGG CGAACAGCCC CGAACTGGAG GACTGGCAGC GGGAGGTCAT GGCCATGATT CGCGAGGAGA TGCAGTACTT CTGGCCCCAG ATGGAGACCA AGATCATCAA CGAGGGCTGG GCCGCTTACT GGCATGCCCG GATTATCCGG GAACTGGATC TGACCCCGGC CGAAACTGTT GATTTTGCCC GCCTGCACGC AAGCGTTCTT CAACCGGGCT ACCGGCAGAT CAACCCCTAC CTGGTTGGCA GCAAGATCTT TGAGGACATT GAGAAGCGCT GGGAAAACCC CAGCCAGGAG GAGCGGGAAC GCTACGGCCG TACGGGAGGA GAGGGCCGCT CCAAGATTTT TGAGGTCCGC TCCTGCGAGA ATGACATTTC CTTCCTGCGT AATTATCTGA CCAGGGAACT GGTCGAGGAA TTGGATCTGT ACCTCTACCA GAAGGTCGGC TCCGAATGGG TGGTGGTGGA AAAGGATTGG GAAAAGGTCC GGGACGGCCT GGTGAGCCGC CTGATTAATT GCGGTTACCC GTACATCGTT GTGGAGGATG CCGACTACCA GCGCCGGGGC GAACTCTACC TTAAACACCG CTACGAAGGC CTGGAACTGG ATGTCTCTTA CCTGGAAAAA ACCCTGCCCC ACGTCTACCT CCTCTGGGGC CGGCCCGTCC ACCTGGAGAC CATCATCGAC GGCAAAACAA CTGTTTTTAG CTATGATGGC AAGAAAAATT GCCGGCGCTA A
|
Protein sequence | MEQEFKTIAQ AIETIHDQAR KFGLDFFPVY FELCPADVLY AFGAYGMPTR FAHWTFGKHF YKMKLQYDFN LSRIYELVIN SNPCYAFLLE GNDLIQNKLV IAHVFAHSDF FKNNIYFTAT SRQMVETMAV HAAKIREYEF KYGHREVEIF LDAVLAIQEH IEPPGPFGYK EEENEENEDT RPHRRETPYD DLWILDGRPK EPPPERNRKI PPRPTKDMVG FIMANSPELE DWQREVMAMI REEMQYFWPQ METKIINEGW AAYWHARIIR ELDLTPAETV DFARLHASVL QPGYRQINPY LVGSKIFEDI EKRWENPSQE ERERYGRTGG EGRSKIFEVR SCENDISFLR NYLTRELVEE LDLYLYQKVG SEWVVVEKDW EKVRDGLVSR LINCGYPYIV VEDADYQRRG ELYLKHRYEG LELDVSYLEK TLPHVYLLWG RPVHLETIID GKTTVFSYDG KKNCRR
|
| |