Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2186 |
Symbol | |
ID | 3831656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2283537 |
End bp | 2285261 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637830108 |
Product | NADH dehydrogenase (ubiquinone), 30 kDa subunit |
Protein accession | YP_431018 |
Protein GI | 83591009 |
COG category | [C] Energy production and conversion |
COG ID | [COG3261] Ni,Fe-hydrogenase III large subunit [COG3262] Ni,Fe-hydrogenase III component G |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.418476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACAGG GTTCTGAGTC CAAGCGGGCA CAAAAATATA TAAAAGATTT GCGGTCGAAA TTCCCGGGGG CGATCCTGGA GGAAAGCTCC CAGGCTCCAG ACCAGATTAC TGTGACAGTG AAACTTAACG ACCTTCCCGG CATCGTGGAA GAGCTATATT ATCGTCATAA TGGCTGGCTA TCCACCATGC TCGGTAACGA CGAGCGCGAG CTTAACGGTT GTTTCGCCCT CTATTATGTC TTGTCCATGG AAGGGAACAG CAACAAAAGG GAGAACACCT GGATAACCGT CAAAGCCCTG GTACCCGAGA AAGAGCCGCA ATTTCCATCG GTAACTCCCC GGGTGCCGGC GGCGGTGTGG TATGAGCGGG AAGTGCGGGA CATGTTCGGC CTGGAACCGG TGGGAATCCC AGATGCGCGG AGGCTGGTCT TGCCTGATGA TTGGCCGGAC AACTTGCATC CGTTGCGAAA AGACGCCATG GATTACCGGT ACCGCCCGGA ACCTGTTGAG GAGAACTACA ATTTTATCGA AGTCGAAGGG GAGGGAATTG TGGAAGTTCC ACTGGGACCC CTCCACATCA CCTCCGATGA ACCCGGACAT TTCCGCCTCT TTGTTGACGG GGAGTACATC GTGGATGCTG ACTACCGGCT CTTTTATGTA CACCGGGGTA TGGAGAAACT GGCCGAAAAC CGTATGGATT ACGACCAAAT TACCTTCCTG GCCGAGCGGA TTTGCGGCAT CTGCGGCTAC GCCCACAGTG TGGCGTATGC TGCGGCGGTG GAAACGGCCA ACGGAATTGA GGTGCCGCCG CGGGCTCAGT ATATCCGCAC CATTTTGCTA GAAGTGGAGA GGTTGCACAG CCACCTCTTG AATCTGGGGC TGGCGGCCCA TCTGGTGGGT TTCGATTCGG GTTTCATGCA TTTCTTCCGG GTACGGGAGA AGGCCATGCA GATGGCCGAG ATTCTTACCG GGGGCCGGAA AACCTATGGA ATAAACCTGA TTGGCGGCGT TCGCCGGGAT ATTTTCAAAG AAGAGCGGGA CCAGGTCTTA CGGCTGATTG CGGAGATCCG GACCGAACTG GATGAACTAC TTGATATCCT GATAAACACC CCCAACTTCA TCTCGCGGAC CCAGGGGGTA GGCCGCCTGG AACGCCAGGT AGCCCGTGAC TTCAGCCCGG TTGGTCCTAA TATGCGGGGT TCCGGGTACG CCCGTGATAC CAGAGCGGAC CACCCCTACG GCGCTTACGA CCGGGTATCC TGGGAGGTCA TTTCCAAGGA CGGTTGCGAC GTACTTTCCA GGGAACTGGT CCGGGCAGCA GAGCTGTATG AGTCCTTCAA TATAATCGAG AGGTGCTTGA CCGAAATGCC CCCCGGCCCG GTTCTCACGG AAGGGTTTGC GTATAAACCG CATACCTTCG CGTTGGGATA TACTGAAGCC CCGAGGGGAG AAAATGTCCA CTGGCTGATG ACGGGCAACA ACCAGAAACT TTATCGCTGG CGGGTACGGG CTGCCACCTA CAATAACTGG CCGGCGTTAC GGTATATGTT TAGAGGCAAC ACCGTGTCTG ATGCCCCCCT GATTGTGGCC AGCATCGATC CCTGCTATTC CTGCACCGAA AGGGTCACTA TGGTCGATGT GCGTAAAAAG AAGGCGAGAA CGATTGAATA CAAAGAACTG GAGCGTTACT GCCGGGAAAG AAAATACTCG CCGCTTAAAT TTTAG
|
Protein sequence | MGQGSESKRA QKYIKDLRSK FPGAILEESS QAPDQITVTV KLNDLPGIVE ELYYRHNGWL STMLGNDERE LNGCFALYYV LSMEGNSNKR ENTWITVKAL VPEKEPQFPS VTPRVPAAVW YEREVRDMFG LEPVGIPDAR RLVLPDDWPD NLHPLRKDAM DYRYRPEPVE ENYNFIEVEG EGIVEVPLGP LHITSDEPGH FRLFVDGEYI VDADYRLFYV HRGMEKLAEN RMDYDQITFL AERICGICGY AHSVAYAAAV ETANGIEVPP RAQYIRTILL EVERLHSHLL NLGLAAHLVG FDSGFMHFFR VREKAMQMAE ILTGGRKTYG INLIGGVRRD IFKEERDQVL RLIAEIRTEL DELLDILINT PNFISRTQGV GRLERQVARD FSPVGPNMRG SGYARDTRAD HPYGAYDRVS WEVISKDGCD VLSRELVRAA ELYESFNIIE RCLTEMPPGP VLTEGFAYKP HTFALGYTEA PRGENVHWLM TGNNQKLYRW RVRAATYNNW PALRYMFRGN TVSDAPLIVA SIDPCYSCTE RVTMVDVRKK KARTIEYKEL ERYCRERKYS PLKF
|
| |