Gene Moth_2300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2300 
Symbol 
ID3831332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2415123 
End bp2416850 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content64% 
IMG OID637830220 
Productaldehyde ferredoxin oxidoreductase 
Protein accessionYP_431130 
Protein GI83591121 
COG category[C] Energy production and conversion 
COG ID[COG2414] Aldehyde:ferredoxin oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.189946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGGC AAATTTTACG GGTAAGTTTA AGCGACGGCA ATATCCGCCA GGAAGAAATC 
CCGGCACGGG TCTGCCGGGA CTTTATCGGC GGCCGCCCCC TGGCAGCACG TTACCTTTAC
GGCGAAGGGA CCGCCGGGGT AGACCCTCTG GGCCCGGAGA ATATTTTAAT CTTTGTTGCC
GGGCCCCTGG CCGGCAGCGG GGCCAGGGGA GTCAATCGCT GGCTGGTGGT AACAAAAAGC
CCTTTGAGTG GCGGCTTTTT CCGCTCGGTG GGCGGGAGCG ATTTCGGCGC CTGGCTCCAC
GGCGCGGGGT ACGAGATGCT GTTCCTGACG GGCCGCGCGG AGGAACCGGT GTACCTCTAC
CTGGGTCCCC GGGGACGGGT GGAGTTGCGG CCGGCCCGGG ACCTCTGGGG CAAGGGTACA
GGGGATACCC GGGCCGAACT GCGCCGACGC CACGGCCAGG GGGCCCGGGT GGCCTGTATC
GGCCCGGCCG GGGAAAATCT GGTGCGTTAC GCCAGCATTA TTAGCGACCA TAGCACGGCG
GCCCGCGGGG GCGTGGGCAC CGTCATGGGG GCGAAAAGGC TCAAGGCCCT GGTGGTGGAA
GCAGAACCCG ATGTTCCGGT AGCCAGGCAA GACCTCTGGT CGCGCATCCT GGTCCGTCAG
GCGGAATTGG TCGAACGCCG GGACCACCGG GCGCCTTTGC CGGGAGCCCT GAAGGGGGCT
ATTCTACCTA CCCGTAACTT CCAGGGGGCC TGGAAAGCGG ATATCGACCC GGTCGATCTG
CCGCGGTTGC TGGAACGCAA GGGGCCGGAG TTCCAGACTT ACTGGGCCCT GGGGGCCAAC
CCGGGCCACC TGGATCCCGA CCTGGTGACC ATGGCCAATG AACGCTGTAA CGACCTGGGC
CTGGATACGG TCTCGACCGG CGGCAGCATA GCCTTCGCCT ACGAGCTCCG GCAGAGGGGG
TTGCTGCAAG CGGAGGGCTT CGACCTGGGC TGGGAGCGAC CGGAGGCCAC TATGGAGCTA
ATTCGCATGA TAGCCTACCG GGCCGGTATC GGCGATCTTC TCGCGGAAGG CGTGGACCGT
ATGGCGGCAT ATATTGGTTC CGGTGCCCGG GAATACGCCA TGACCATCAA GGGTATTGAG
ATGCCGGGTT ATGACCCCAG GGTGCGACCC CTTCACGGCC TGGGGATGGT GGTGTCGGCC
TTGGGGGGTA GTTACTGCTA CGGCAAGGCC CTCCATGCCG GCCTCTTCGC CGCCGACCGG
GAAGGGGAAG ACCTGGTGGG CCAGGTAAGC CGCATCCAGG AAACGACGGT AGCCCTGGAA
ACAGGCGTCG GTTGCCTCTT TGCCTACCTG GGCGGGTGGT TGAGCCTGGA ACAGATGGCT
GAGATGATGA CGGCAGTAAC AGGTACGGAG GTAACTCCGG AGGACCTAAA GGGGGCGGCT
GAACGAAGCT TGACCATGGA GCGGGCCTTT AACCTGCGGG AAGGCCTGGG CCGGGAAGCC
GATACCCTGC CGGAGCGTTT CCTGCGGGAG GGTGTCGAGG CAGGCGAGAT TACAGCCGGA
CCCCTTACTG AATTACCACG CCTGGTTACG GCCTATTACC GCCGTCGCGG CTGGGATGAA
GAAGGCCGGC CAACACCTGC CACCCTGGAC CGCCTGAGCC TGGAATTGGT GCGCCTGGAC
CAGCCCCGGG AGTTCAAAGT AGCTTTCAGC CGGGAATTGA CCAAATAA
 
Protein sequence
MERQILRVSL SDGNIRQEEI PARVCRDFIG GRPLAARYLY GEGTAGVDPL GPENILIFVA 
GPLAGSGARG VNRWLVVTKS PLSGGFFRSV GGSDFGAWLH GAGYEMLFLT GRAEEPVYLY
LGPRGRVELR PARDLWGKGT GDTRAELRRR HGQGARVACI GPAGENLVRY ASIISDHSTA
ARGGVGTVMG AKRLKALVVE AEPDVPVARQ DLWSRILVRQ AELVERRDHR APLPGALKGA
ILPTRNFQGA WKADIDPVDL PRLLERKGPE FQTYWALGAN PGHLDPDLVT MANERCNDLG
LDTVSTGGSI AFAYELRQRG LLQAEGFDLG WERPEATMEL IRMIAYRAGI GDLLAEGVDR
MAAYIGSGAR EYAMTIKGIE MPGYDPRVRP LHGLGMVVSA LGGSYCYGKA LHAGLFAADR
EGEDLVGQVS RIQETTVALE TGVGCLFAYL GGWLSLEQMA EMMTAVTGTE VTPEDLKGAA
ERSLTMERAF NLREGLGREA DTLPERFLRE GVEAGEITAG PLTELPRLVT AYYRRRGWDE
EGRPTPATLD RLSLELVRLD QPREFKVAFS RELTK