Gene Moth_0980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0980 
Symbol 
ID3830856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1007151 
End bp1008266 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content56% 
IMG OID637828909 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_429838 
Protein GI83589829 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCTG CCGCGGAGAA TTTACGAACG GAAGAGATTC AACTTAATAT GGGTCCCCAG 
CATCCCAGCA CCCATGGTGT TTACCGGGCC CTTTTAACCC TTGATGGTGA AAAGGTCGTC
GGGGTAGAGA ATATTATCGG TTACCTGCAC CGGGGTATCG AGAAACTGGC TGAGGATCGC
ACTTATACCC AGGTTATTCC CTATACCGAT CGCCTGGATT ACCTGGCCGG GATGCTGAAC
AATCTGGGTT ATGTCCAGAC GGTGGAAAAG CTCCTGGGGC TCGAAGTCCC GGAGCGGGCC
GAGTATTTGC GGGTAATTAT GGCCGAACTC TCACGGCTGG CCAGCCATAT GGTCATGGTA
GCCTCCATGG CCCTTGATCT CTCCGGTTGG ACGGCCTGGT TCCCCCCCTT CCGGGAACGT
GAGCGGATCC TGGATCTTTT TGAGATGACG TGCGGTTCCA GATTGACGGT CAGTTATATG
CGCATTGGCG GTGTAGCTGC CGACATACCG CCCGGTTTTC TGCCGGCCCT GGAGAGCTTT
TTAAACGACC TGCCCCGGAT GATTGCCGAA ATGAACGGGC TGATTACCGG GAATGAGATC
TTTAAGGCCC GCTGCCAGGG GGTGGGTAAA ATTGACCTGG AAACGGCCCT GGCCTATGGC
ATCACCGGGC CTAACCTGCG GGCCTGCGGA TTGCCCTTTG ACCTGCGAAA AGCGCGGCCC
TATAGTATCT ATGATCGCTT TGATTTTGAT ATCCCCACCC TGAATAACGG GGATAGTTAC
GACCGGTTTG TTATTCGCCT GCTAGAAATG GAACAGAGTG CCCGAATTAT CCGCCAGGCA
ATGGAGCAAC TCCCCGATGG TCCGGTGCAG GCCAAGGTGC CGCGGGTGCT TAAACTTCCC
CGGGGCGAGG TCTACCACCA GATCGAGGGT GCCAAGGGTA TCCTGGGCTT CTACCTGGTC
AGCGATGGTG GCAGCAAACC CTACCGTCTC CATATCCACA GCCCCTCGTT CGTTAACTTG
GGAGCCCTGC CCAGGATTTC CGTCGGTGGT ACTATCCAGG ACTTCGTCGT CAATATCGCT
TCCATTGACA TCGTCCTGGG CGAGGTTGAT CGTTAA
 
Protein sequence
MAAAAENLRT EEIQLNMGPQ HPSTHGVYRA LLTLDGEKVV GVENIIGYLH RGIEKLAEDR 
TYTQVIPYTD RLDYLAGMLN NLGYVQTVEK LLGLEVPERA EYLRVIMAEL SRLASHMVMV
ASMALDLSGW TAWFPPFRER ERILDLFEMT CGSRLTVSYM RIGGVAADIP PGFLPALESF
LNDLPRMIAE MNGLITGNEI FKARCQGVGK IDLETALAYG ITGPNLRACG LPFDLRKARP
YSIYDRFDFD IPTLNNGDSY DRFVIRLLEM EQSARIIRQA MEQLPDGPVQ AKVPRVLKLP
RGEVYHQIEG AKGILGFYLV SDGGSKPYRL HIHSPSFVNL GALPRISVGG TIQDFVVNIA
SIDIVLGEVD R