Gene Moth_0986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0986 
Symbol 
ID3830862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1012452 
End bp1013981 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content54% 
IMG OID637828915 
Productproton-translocating NADH-quinone oxidoreductase, chain M 
Protein accessionYP_429844 
Protein GI83589835 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.979156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTTTC CGATTTTGAC AGCTATTATG CTGGCCCCGG TAGCCGGGCT GCTGCTCATC 
CTGCTGATCC CGGAAAGGGA GCAGCTGACC ATTAAAATCA CGGCTGCTGC CGCCACCTTT
GTTTCCCTGG TGCTGGCAAT CCTGGCTTAT GTCCAGTATG ATCATGCCCG GGGAGGCTTG
CAGTTCCTCC AGGATATCCC CTGGGTACCG GGATTCGGCA TCAATTACTC CGTTGGCGTC
GACGGCATCA GTATGCCCCT GGTCCTGCTG ACGGCCATTG TTATCTTTAC CGGTGTCTTC
GCCTCCTGGG ATATGACCCA GCGGGTGAAG GAATTCTTTA TCTTCCTGTT GATGCTGGTG
ACCGGCGTCT TCGGTGTCTT CATCAGCCGC GACCTCTTTT TCTTCTACCT CTTCTTTGAG
GTGGCCGTTA TCCCCATGTA CCTCCTCATT GGTATTTGGG GCAGCACCCG GAAGGAATAC
GCGGCTATGA AGCTGACCCT TTACCTCCTG GTCGGCAGCG CCTTTGCCCT GATTGGTATT
ATCGCCCTGT TCCTTTATGC GTCCCAGCAA CTGGGGTATG CTACCTTTGA CATCCAGACC
CTGGCCACGG TCAAGTACGA CCTGGGGTTC CAAAAGTTTA TCTTCTTCCT GATGCTCATC
GGTTTTGGCG TGCTGGTGCC CATCTGGCCC CTGCACCTGT GGTCTCCCGA CGGCCATGTG
GCGGCACCAA CGGCCGTCAG TATGCTCCAC GCCGGCGTTT TAATGAAACT AGGTGCTTAT
GGCCTTATTC GCGCCGGAGT ATTTCTGTTC CCCGAAGGAG CTAAATTTTG GGCGCCTTTG
ATTGCCGTTC TGTGTATTGT TAACGTGGTT TATGGTGCCA TGATCGCCAT GGTCCAGAGG
GATTTGAAAT TCGTTATCGG CTACAGCAGT GTGAGCCACA TGGGCTATGT TCTCCTGGGA
ATAGCTTCTT TGAATATCCT GAGCCTGGAC GGGGCTGTAT CCCAGATGTT TGCTCACGGT
ATTATGACGG CCCTCTTCTT TGCACTAGTG GGTAATATCT ATCATAAAGC CCACACCCGG
GAGATTGCCC GCTTCGGCGG CCTTGCCCAC CAGATGCCGC GGGTGGCGGC CGGTTTCCTA
ATTGGCGGCC TTGCCTCCCT GGGGCTACCC GGCCTTAATA ACTTTGTGGC CGAGTTTCTC
ATCTTCATAG GCTCCTTTAC CCGCGACCAG GCCCTGTTCG GTGGTATCCT GCCCTTCCGG
ATCCTCTCGA TCCTGGCCAT CTCCGGTATT GTCATTACCG CCACCTATAT TCTCCGGGTA
GTCATGAAGA CTTTCTTCGG ACCCAGGAAA CCGGAATGGG ATCACCTGGA GGATGCCCGC
GGGGTGGAAA TGGTGCCCGT TGTCGTCTTG ATTGCAACTT TGCTGCTCTT TGGCCTCTTA
CCCTCCTTGC AAATTGATAT GATCAATAGC GGCATAACCC CGCTGGTAGC CAAAGTTCAA
GCGGCGAAGG CGATTGGGGG TATCTTTTAA
 
Protein sequence
MNFPILTAIM LAPVAGLLLI LLIPEREQLT IKITAAAATF VSLVLAILAY VQYDHARGGL 
QFLQDIPWVP GFGINYSVGV DGISMPLVLL TAIVIFTGVF ASWDMTQRVK EFFIFLLMLV
TGVFGVFISR DLFFFYLFFE VAVIPMYLLI GIWGSTRKEY AAMKLTLYLL VGSAFALIGI
IALFLYASQQ LGYATFDIQT LATVKYDLGF QKFIFFLMLI GFGVLVPIWP LHLWSPDGHV
AAPTAVSMLH AGVLMKLGAY GLIRAGVFLF PEGAKFWAPL IAVLCIVNVV YGAMIAMVQR
DLKFVIGYSS VSHMGYVLLG IASLNILSLD GAVSQMFAHG IMTALFFALV GNIYHKAHTR
EIARFGGLAH QMPRVAAGFL IGGLASLGLP GLNNFVAEFL IFIGSFTRDQ ALFGGILPFR
ILSILAISGI VITATYILRV VMKTFFGPRK PEWDHLEDAR GVEMVPVVVL IATLLLFGLL
PSLQIDMINS GITPLVAKVQ AAKAIGGIF