Gene Moth_2186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2186 
Symbol 
ID3831656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2283537 
End bp2285261 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content55% 
IMG OID637830108 
ProductNADH dehydrogenase (ubiquinone), 30 kDa subunit 
Protein accessionYP_431018 
Protein GI83591009 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit
[COG3262] Ni,Fe-hydrogenase III component G 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.418476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACAGG GTTCTGAGTC CAAGCGGGCA CAAAAATATA TAAAAGATTT GCGGTCGAAA 
TTCCCGGGGG CGATCCTGGA GGAAAGCTCC CAGGCTCCAG ACCAGATTAC TGTGACAGTG
AAACTTAACG ACCTTCCCGG CATCGTGGAA GAGCTATATT ATCGTCATAA TGGCTGGCTA
TCCACCATGC TCGGTAACGA CGAGCGCGAG CTTAACGGTT GTTTCGCCCT CTATTATGTC
TTGTCCATGG AAGGGAACAG CAACAAAAGG GAGAACACCT GGATAACCGT CAAAGCCCTG
GTACCCGAGA AAGAGCCGCA ATTTCCATCG GTAACTCCCC GGGTGCCGGC GGCGGTGTGG
TATGAGCGGG AAGTGCGGGA CATGTTCGGC CTGGAACCGG TGGGAATCCC AGATGCGCGG
AGGCTGGTCT TGCCTGATGA TTGGCCGGAC AACTTGCATC CGTTGCGAAA AGACGCCATG
GATTACCGGT ACCGCCCGGA ACCTGTTGAG GAGAACTACA ATTTTATCGA AGTCGAAGGG
GAGGGAATTG TGGAAGTTCC ACTGGGACCC CTCCACATCA CCTCCGATGA ACCCGGACAT
TTCCGCCTCT TTGTTGACGG GGAGTACATC GTGGATGCTG ACTACCGGCT CTTTTATGTA
CACCGGGGTA TGGAGAAACT GGCCGAAAAC CGTATGGATT ACGACCAAAT TACCTTCCTG
GCCGAGCGGA TTTGCGGCAT CTGCGGCTAC GCCCACAGTG TGGCGTATGC TGCGGCGGTG
GAAACGGCCA ACGGAATTGA GGTGCCGCCG CGGGCTCAGT ATATCCGCAC CATTTTGCTA
GAAGTGGAGA GGTTGCACAG CCACCTCTTG AATCTGGGGC TGGCGGCCCA TCTGGTGGGT
TTCGATTCGG GTTTCATGCA TTTCTTCCGG GTACGGGAGA AGGCCATGCA GATGGCCGAG
ATTCTTACCG GGGGCCGGAA AACCTATGGA ATAAACCTGA TTGGCGGCGT TCGCCGGGAT
ATTTTCAAAG AAGAGCGGGA CCAGGTCTTA CGGCTGATTG CGGAGATCCG GACCGAACTG
GATGAACTAC TTGATATCCT GATAAACACC CCCAACTTCA TCTCGCGGAC CCAGGGGGTA
GGCCGCCTGG AACGCCAGGT AGCCCGTGAC TTCAGCCCGG TTGGTCCTAA TATGCGGGGT
TCCGGGTACG CCCGTGATAC CAGAGCGGAC CACCCCTACG GCGCTTACGA CCGGGTATCC
TGGGAGGTCA TTTCCAAGGA CGGTTGCGAC GTACTTTCCA GGGAACTGGT CCGGGCAGCA
GAGCTGTATG AGTCCTTCAA TATAATCGAG AGGTGCTTGA CCGAAATGCC CCCCGGCCCG
GTTCTCACGG AAGGGTTTGC GTATAAACCG CATACCTTCG CGTTGGGATA TACTGAAGCC
CCGAGGGGAG AAAATGTCCA CTGGCTGATG ACGGGCAACA ACCAGAAACT TTATCGCTGG
CGGGTACGGG CTGCCACCTA CAATAACTGG CCGGCGTTAC GGTATATGTT TAGAGGCAAC
ACCGTGTCTG ATGCCCCCCT GATTGTGGCC AGCATCGATC CCTGCTATTC CTGCACCGAA
AGGGTCACTA TGGTCGATGT GCGTAAAAAG AAGGCGAGAA CGATTGAATA CAAAGAACTG
GAGCGTTACT GCCGGGAAAG AAAATACTCG CCGCTTAAAT TTTAG
 
Protein sequence
MGQGSESKRA QKYIKDLRSK FPGAILEESS QAPDQITVTV KLNDLPGIVE ELYYRHNGWL 
STMLGNDERE LNGCFALYYV LSMEGNSNKR ENTWITVKAL VPEKEPQFPS VTPRVPAAVW
YEREVRDMFG LEPVGIPDAR RLVLPDDWPD NLHPLRKDAM DYRYRPEPVE ENYNFIEVEG
EGIVEVPLGP LHITSDEPGH FRLFVDGEYI VDADYRLFYV HRGMEKLAEN RMDYDQITFL
AERICGICGY AHSVAYAAAV ETANGIEVPP RAQYIRTILL EVERLHSHLL NLGLAAHLVG
FDSGFMHFFR VREKAMQMAE ILTGGRKTYG INLIGGVRRD IFKEERDQVL RLIAEIRTEL
DELLDILINT PNFISRTQGV GRLERQVARD FSPVGPNMRG SGYARDTRAD HPYGAYDRVS
WEVISKDGCD VLSRELVRAA ELYESFNIIE RCLTEMPPGP VLTEGFAYKP HTFALGYTEA
PRGENVHWLM TGNNQKLYRW RVRAATYNNW PALRYMFRGN TVSDAPLIVA SIDPCYSCTE
RVTMVDVRKK KARTIEYKEL ERYCRERKYS PLKF