Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0231 |
Symbol | |
ID | 3832559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 229959 |
End bp | 231398 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637828167 |
Product | radical SAM family protein |
Protein accession | YP_429109 |
Protein GI | 83589100 |
COG category | [R] General function prediction only |
COG ID | [COG0535] Predicted Fe-S oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.721774 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAATTGGG GACTAATAAA CCAAGCCAAA CCAGGGAAGG AGAGGATAGC CTTGATTAAC GCCCGTGCCC ATTTAGAGCT GGCCAAAAAG TATGTAACCG AAAAGGTCCT CCAGGAAGCC TTCAGCTATA TGGAAAAAAA CCCAGAGGAG AATTTTCCCC GCATCCTGAA TACCGCCCGG TTATTGGCCA GGGAGGAGGT ACATAAGCAA CAGATCGCCA AAGTGCTGGA GGCCTACCGG ACCAACCCCA GCATCCACGC CTACGTGAAT CGCCTCTTCA AAGTGCATCC TAATGTTAAA CAGCGCCTGA TCTACAACTG GTTCGTCAAC GCCATGCTCC TCGGTATACC TCGCCAGCAC CAGGTCTCCC AGGAAACCGG GGTTCATATA CCTAATTTCT TCCTTCTGGA CCCCACCAGC GACTGCAACC TGCGCTGTCA CGGCTGCTGG GCCGGGGAGT ATGCCCACCA CGACACCCTG GAACTGGATC TGGTGGACCG CCTCTGCCGC GAGGCCAAGG CGGTCGGCAT TTACTGGCTG GCCATGTCCG GCGGCGAGCC CTTCCGCTGG CCCCATCTCT TTGAACTGGC CGAGCGCCAT CCCGATATGG CCTTTATGCT CTATACCAAC GGCACGCTCA TCGATGACGC CGTGGCCGAC CGCATGGTGG AGGTCGGCAA CATCACGCCG GCCATTAGCC TGGAAGGCTG GCGGGAACGC ACCGACGCCC GCCGGGGCCG GGGTGTCTTT GACCGGGTAA TGGCCGCCAT GGACCGCCTG CGGGAGCGGG GTCTGGTCTT CGGGGTTTCC ATCACCATTA CCAGGGAAAA CGCGGAGGAG GTCACCAGCG ATGAGTTCAT CGACTTCCTG TTGGAGAAGG GTGTAGTCTA CGGCTGGAGT TTCCATTATA TACCCATCGG CCGGGATCCC AATCCCGAAC TCATGGTCAC TCCCGAGCAG CGGGCCTACC TGGCTGAGCG CATTCCCTAT ATTCGTAACC ACAAGGGGCT GCAGATTGCC GATTTCTGGA ATGACGGCGA GCTGACCCTG GGATGCATCG CCGGCGGCCG GCGCTACTTC CACATCACCG CCAGCGGGGC AGTGGAGCCC TGCGCCTTCA TTCACTTCTC CATGGACAAC ATCAAAGAGA AGAGCCTGCT GGAGGTTCTC CAGTCGCCCC TCTTCCGGGC CTATCAGCGC CGCCAGCCGT TTAGCGATAA CCTGCTCAGG CCCTGCCCCC TCATCGATGT CCCTGAGGGC CTGCGGCAGA TCGTAGCCGA AACCGGGGCT AAACCAACCC ACCCGGGCGC AGATACAGCC CTGAAAGGTT CTATCGGCGC CTATCTGGAC GCCAACGCCG CCCGCTGGGG CGAGGTGGCT GACAGGATCT GGCGGGAACG TCACCCGGAG CCCCAAAAAG AATTGACAGC CGGGAAGTAA
|
Protein sequence | MNWGLINQAK PGKERIALIN ARAHLELAKK YVTEKVLQEA FSYMEKNPEE NFPRILNTAR LLAREEVHKQ QIAKVLEAYR TNPSIHAYVN RLFKVHPNVK QRLIYNWFVN AMLLGIPRQH QVSQETGVHI PNFFLLDPTS DCNLRCHGCW AGEYAHHDTL ELDLVDRLCR EAKAVGIYWL AMSGGEPFRW PHLFELAERH PDMAFMLYTN GTLIDDAVAD RMVEVGNITP AISLEGWRER TDARRGRGVF DRVMAAMDRL RERGLVFGVS ITITRENAEE VTSDEFIDFL LEKGVVYGWS FHYIPIGRDP NPELMVTPEQ RAYLAERIPY IRNHKGLQIA DFWNDGELTL GCIAGGRRYF HITASGAVEP CAFIHFSMDN IKEKSLLEVL QSPLFRAYQR RQPFSDNLLR PCPLIDVPEG LRQIVAETGA KPTHPGADTA LKGSIGAYLD ANAARWGEVA DRIWRERHPE PQKELTAGK
|
| |