Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0980 |
Symbol | |
ID | 3830856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1007151 |
End bp | 1008266 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828909 |
Product | NADH dehydrogenase subunit D |
Protein accession | YP_429838 |
Protein GI | 83589829 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | [TIGR01962] NADH dehydrogenase I, D subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCTG CCGCGGAGAA TTTACGAACG GAAGAGATTC AACTTAATAT GGGTCCCCAG CATCCCAGCA CCCATGGTGT TTACCGGGCC CTTTTAACCC TTGATGGTGA AAAGGTCGTC GGGGTAGAGA ATATTATCGG TTACCTGCAC CGGGGTATCG AGAAACTGGC TGAGGATCGC ACTTATACCC AGGTTATTCC CTATACCGAT CGCCTGGATT ACCTGGCCGG GATGCTGAAC AATCTGGGTT ATGTCCAGAC GGTGGAAAAG CTCCTGGGGC TCGAAGTCCC GGAGCGGGCC GAGTATTTGC GGGTAATTAT GGCCGAACTC TCACGGCTGG CCAGCCATAT GGTCATGGTA GCCTCCATGG CCCTTGATCT CTCCGGTTGG ACGGCCTGGT TCCCCCCCTT CCGGGAACGT GAGCGGATCC TGGATCTTTT TGAGATGACG TGCGGTTCCA GATTGACGGT CAGTTATATG CGCATTGGCG GTGTAGCTGC CGACATACCG CCCGGTTTTC TGCCGGCCCT GGAGAGCTTT TTAAACGACC TGCCCCGGAT GATTGCCGAA ATGAACGGGC TGATTACCGG GAATGAGATC TTTAAGGCCC GCTGCCAGGG GGTGGGTAAA ATTGACCTGG AAACGGCCCT GGCCTATGGC ATCACCGGGC CTAACCTGCG GGCCTGCGGA TTGCCCTTTG ACCTGCGAAA AGCGCGGCCC TATAGTATCT ATGATCGCTT TGATTTTGAT ATCCCCACCC TGAATAACGG GGATAGTTAC GACCGGTTTG TTATTCGCCT GCTAGAAATG GAACAGAGTG CCCGAATTAT CCGCCAGGCA ATGGAGCAAC TCCCCGATGG TCCGGTGCAG GCCAAGGTGC CGCGGGTGCT TAAACTTCCC CGGGGCGAGG TCTACCACCA GATCGAGGGT GCCAAGGGTA TCCTGGGCTT CTACCTGGTC AGCGATGGTG GCAGCAAACC CTACCGTCTC CATATCCACA GCCCCTCGTT CGTTAACTTG GGAGCCCTGC CCAGGATTTC CGTCGGTGGT ACTATCCAGG ACTTCGTCGT CAATATCGCT TCCATTGACA TCGTCCTGGG CGAGGTTGAT CGTTAA
|
Protein sequence | MAAAAENLRT EEIQLNMGPQ HPSTHGVYRA LLTLDGEKVV GVENIIGYLH RGIEKLAEDR TYTQVIPYTD RLDYLAGMLN NLGYVQTVEK LLGLEVPERA EYLRVIMAEL SRLASHMVMV ASMALDLSGW TAWFPPFRER ERILDLFEMT CGSRLTVSYM RIGGVAADIP PGFLPALESF LNDLPRMIAE MNGLITGNEI FKARCQGVGK IDLETALAYG ITGPNLRACG LPFDLRKARP YSIYDRFDFD IPTLNNGDSY DRFVIRLLEM EQSARIIRQA MEQLPDGPVQ AKVPRVLKLP RGEVYHQIEG AKGILGFYLV SDGGSKPYRL HIHSPSFVNL GALPRISVGG TIQDFVVNIA SIDIVLGEVD R
|
| |