Gene Moth_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1887 
Symbol 
ID3831232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1949715 
End bp1951493 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content62% 
IMG OID637829820 
ProductNADH dehydrogenase (quinone) 
Protein accessionYP_430730 
Protein GI83590721 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit
[COG3411] Ferredoxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGTA TACTAGTCTG TGCCGGGACC GGGTGTGTCG CCTCCCACTC TCGCCAGGTT 
ACCGCCCGGC TGAAGGCCGC CTTGCACGCG CATCACCTGG AAGAAAGGTT CCAGGTGGAT
AATACAGGCT GCCACGGTTT CTGCGAGCAG GGGCCTCTGG TGATTATCGA GCCGGAGGGC
ATCCTCTACT GCCGGGTGCG GGAGGAAGAC GTGGAAGCCA TCGTCACCGA ACACCTGGAG
CAGGGTCGCC TCGTTGAACG CCTCCTCTAC CAGGACCCGG TGACGAAGGA AAAGATAGCC
GCCTATAACC AGATTAAGTT CTACCGGCAG CAGTCGCGCC ATGTCCTGAA AAACTGCGGT
CATATCAATC CGGAAAATAT CGACGCCTAC CTGGCCGTCG AAGGCTACCA GGGGTTAAAA
AAGGCCCTCG CCCTGCCCCG GGAAGAAGTC ATCAATATAA TCAAGGAGTC TGGTTTAAGG
GGGCGCGGCG GTGCTGGCTT CCCCACCGGG CTGAAGTGGG AGTACACCTT TAAAGCCCCC
GGCGACCGGA AGTATGTAGT CTGTAACGCC GATGAGGGCG ACCCCGGCGC CTTTATGGAC
CGCAGCGTCC TGGAAGGCGA CCCCCACGCC GTCCTCGAAG GCATGCTCAT CGCCGCCTAC
GCCATCGGCG CCCGGGAGGG TTATATCTAC GTCCGGGCCG AGTATCCCCT GGCCGTGCAG
CGGTTGCGGA TCGCCCTGGC CCAGGCCCGG GAAAGGGGTT TTTGTGGCGA GCGTATCCTG
GGAACCGATT TTAGCTGCGA ACTCTACATC CGGGAAGGGG CCGGGGCCTT CGTCTGCGGG
GAAGAAACAG CCCTCCTGGC CTCCATCCAG GGGGAGAGGG GTATGCCCCG GCCACGGCCG
CCCTTCCCCG CCCGGCAGGG CCTCTGGGGC CAGCCCACCA ACATTAACAA CGTGGAAACC
TATGCCAACG TGCCGTTGAT CTTACGCCGG GGTGCCGGCT GGTATGCTTC CCTGGGTACG
GAGAAAAGCA AGGGCACCAA GATATTCGCC CTGACGGGGA AAGTCAAAAA CACCGGCCTG
GTTGAGGTCC CCATGGGCAT CACCCTGAGG GAGATTATCT TTAACATCGG CGGCGGCATC
CTGGAGGACC GGGGGTTCAA AGCAGTCCAG ATCGGTGGTC CTTCCGGCGG GTGTTTGCCG
GCCGAACACC TGGATCTCCC GGTGGACTAC GATTCCCTTA CCGCGGCCGG GGCCATGATG
GGTTCCGGCG GCCTGGTAGT AATGGACGAT AGTACCTGTA TGGTTGAAGT AGCCCGCTTC
TTCCTCAATT TTACCCAGGC GGAATCCTGC GGTAAATGTA CACCCTGCCG GGAGGGCATC
CAGCAGATGC TGGCCATCCT CACCCGCATC ACCAGGGGGC AGGGCCGGGA GGGCGACCTC
GAGCAACTTG AGCGTCTGGC CCGGGTTATC AAGGGTACGG CCCTTTGCGG CCTGGGGCAG
ACGGCGCCCA ACCCGGTCCT GTCCACCCTG CGCTATTTCC GCGCCGAATA TGAAGCCCAC
ATCCGGGACC ACAGGTGCCC GGCGAAAAGC TGCCGGGAAC TCCTTACCTA CCACATCGAC
CCTGATAAAT GCAACGGTTG CACCCGTTGC CGGCGCCGCT GCCCGGCGGG TGCCATCAGC
GGCGAGGCCA GGGAACCCCA TACCATTGAC CTGGAACTGT GTGCCCGCTG CGGTACCTGC
CTGGATCTAT GCCGCCAGAA AGCTATTTAT GTTGAGTAG
 
Protein sequence
MGRILVCAGT GCVASHSRQV TARLKAALHA HHLEERFQVD NTGCHGFCEQ GPLVIIEPEG 
ILYCRVREED VEAIVTEHLE QGRLVERLLY QDPVTKEKIA AYNQIKFYRQ QSRHVLKNCG
HINPENIDAY LAVEGYQGLK KALALPREEV INIIKESGLR GRGGAGFPTG LKWEYTFKAP
GDRKYVVCNA DEGDPGAFMD RSVLEGDPHA VLEGMLIAAY AIGAREGYIY VRAEYPLAVQ
RLRIALAQAR ERGFCGERIL GTDFSCELYI REGAGAFVCG EETALLASIQ GERGMPRPRP
PFPARQGLWG QPTNINNVET YANVPLILRR GAGWYASLGT EKSKGTKIFA LTGKVKNTGL
VEVPMGITLR EIIFNIGGGI LEDRGFKAVQ IGGPSGGCLP AEHLDLPVDY DSLTAAGAMM
GSGGLVVMDD STCMVEVARF FLNFTQAESC GKCTPCREGI QQMLAILTRI TRGQGREGDL
EQLERLARVI KGTALCGLGQ TAPNPVLSTL RYFRAEYEAH IRDHRCPAKS CRELLTYHID
PDKCNGCTRC RRRCPAGAIS GEAREPHTID LELCARCGTC LDLCRQKAIY VE