Gene Moth_0985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0985 
Symbol 
ID3830861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1010581 
End bp1012455 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content55% 
IMG OID637828914 
Productproton-translocating NADH-quinone oxidoreductase, chain L 
Protein accessionYP_429843 
Protein GI83589834 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 
TIGRFAM ID[TIGR01974] proton-translocating NADH-quinone oxidoreductase, chain L 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAATC ATGCCTGGTT GATACCGGTT TTTCCGGCCC TTGCTTTTCC CATAATTATT 
TTTCTGACCC GTAGAGTGCG CCATTTAAGC GCCCTGGTGG GCATCGCCGC CATCGGCGCC
AGCTTTGTCA TGGCTGTGGG GGTATTACGG GAAGTCCTGC TTAACGGGAT AACCATGTCC
CGGCCGGTGG AGTATGCCGC CACCTGGCTG GGGGTCCCCG GTCTTTTGAA GATTGAAGCA
GGCGTCCTGA TTGATCCCCT GGCAGCGGTA ATGCTCCTGG TAGTCACCCT GGTAGCCCTG
CTGGTGGAAA TTTATTCAGT GGGTTATATG CACGGCGACC CCGGATTTTC CACTTTTTTC
GGCTACCTGT CCCTGTTCAG CGCTTCAATG CTGGGACTGG TCCTGGCCAA TAACTACTTT
ATGATATTTT TCTTCTGGGA GCTTGTTGGA CTTTGTTCCT ATCTTTTAAT AGGTTTCTAT
TACCACAAGC AGTCGGCGGC CCGGGCCGGC TTGAAGGCCT TTGTTACCAA CAGGGTGGCT
GACTTCGGTT TCATGCTGGG CTTCTTTTTC CTCTTTGCCA TCTTTGGCAC CTTTAATTTC
CGGGAACTGG CGGCAGCCAT TCCCAGTTAC AAGAATACCG GCTTCCTGGC CCTGGCGGCG
GCCCTGGTGT TTATCGGTCC TATCGGCAAG TCGGCCCAGT TCCCTTTACA TGTCTGGTTG
CCGGACGCCA TGGAGGGTCC TACACCGGTT AGCGCCCTGA TCCATGCTGC GACAATGGTG
GCAGCCGGTG TCTATTTACT GGCGCGAGCC TTTGTCCTCT TTGCCAGCCT GCCGGGGATT
ATGCTCTTAG TTGCCTATGT AGGCGGTTTT ACGGCCCTTT TTGCGGCCAC CATAGCCATT
ACCCAGCGGG ACATCAAGCG CATCCTGGCC TATTCTACCA TCAGCCAGCT GGGATATATG
GTCATGGCCA TGGGGATCGG CAGTATGACG GCCGGAATGT TCCACCTCAT GACCCATGCT
TTCTTTAAAG CCCTCCTCTT TCTGGGAGCC GGAAGCGTTA TCCATGCCCT GGAAGAACAG
GATATTTTCC GCATGGGCGG TTTATATAAG GATATGAAGG TCACGGTAAG TACCTTTGTT
ATTGCCGCCC TGGCCCTGGC CGGGGTGCCG CCCCTGGCCG GTTTCTGGAG CAAGGACGAG
ATCCTCGCCG GTGCCTTTGA TCACGGGTTT ACCGGTCTCT ACATCATTGG TACATTGGTA
GCCTTCTTGA CAGCCTTCTA TATGTTCCGG CTGATCTTCG TGGCCTTTTT CGGCGACCGC
CGTGCCGGAC TCCATGCTCA CGAATCGCCG TTAACGATGA CGGTGCCCCT AGTCATCCTA
GCGGTACTTT CAGTGGTTTC CGGTTTTGTA GGGGCGCCCT TTGTGAGCCA CGGGTTCAGC
AGCTTTGTTT ATTATGGCGA ACCCCATCTG GTAGAACCAA ACTATGGGGT GATGCTGCTT
TCAACCATCG TAGCCTTGGC TGGCATCGGC CTGGCCTGGG TCCTTTACGG TCGTCCCAGT
GATGTGCCGG CAAGGCTGGC CGAACGCTAC CACAGCATCT ATAAGCTTCT GGTCAACAAG
TACTATATTG ATGAGGTCTA CCTGTGGCTT TTCCATCGTG TCGGCCTTGG GCTGGCCGAA
GCCTTTAACT GGAACGATCG CCATGTTGTT GATGGCGTCT TTGATGGTAT CGGCGATGTA
ACCCGGTTGT CGGGCCATAG ACTACGCTTG ATCCAGACGG GAAACCTCCA GACCTACGCC
TTGGTTATCT TTACGGCCGT GGTAATCATT GCCCTCTGGA TGGCAGCACC GGTGTTGGGA
GGGGTGATCC AGTGA
 
Protein sequence
MINHAWLIPV FPALAFPIII FLTRRVRHLS ALVGIAAIGA SFVMAVGVLR EVLLNGITMS 
RPVEYAATWL GVPGLLKIEA GVLIDPLAAV MLLVVTLVAL LVEIYSVGYM HGDPGFSTFF
GYLSLFSASM LGLVLANNYF MIFFFWELVG LCSYLLIGFY YHKQSAARAG LKAFVTNRVA
DFGFMLGFFF LFAIFGTFNF RELAAAIPSY KNTGFLALAA ALVFIGPIGK SAQFPLHVWL
PDAMEGPTPV SALIHAATMV AAGVYLLARA FVLFASLPGI MLLVAYVGGF TALFAATIAI
TQRDIKRILA YSTISQLGYM VMAMGIGSMT AGMFHLMTHA FFKALLFLGA GSVIHALEEQ
DIFRMGGLYK DMKVTVSTFV IAALALAGVP PLAGFWSKDE ILAGAFDHGF TGLYIIGTLV
AFLTAFYMFR LIFVAFFGDR RAGLHAHESP LTMTVPLVIL AVLSVVSGFV GAPFVSHGFS
SFVYYGEPHL VEPNYGVMLL STIVALAGIG LAWVLYGRPS DVPARLAERY HSIYKLLVNK
YYIDEVYLWL FHRVGLGLAE AFNWNDRHVV DGVFDGIGDV TRLSGHRLRL IQTGNLQTYA
LVIFTAVVII ALWMAAPVLG GVIQ