Gene Moth_0722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0722 
Symbol 
ID3830998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp751738 
End bp753537 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content61% 
IMG OID637828653 
Productaldehyde ferredoxin oxidoreductase 
Protein accessionYP_429583 
Protein GI83589574 
COG category[C] Energy production and conversion 
COG ID[COG2414] Aldehyde:ferredoxin oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000472604 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.424629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGGTT GGACCGGACA GTTACTGCGC GTAAACTTGA GCAACGGTAA GTGCAGGACG 
GAAAGGTTAG ACCCGATCCT GGCCCGGGAT TACGTCGGCG CCAGGGGGCT GGCCAGCAAA
ATTCTCTGGA ATGAAATTGA TCCTCAGGTC GACCCCCTGG CCCCGGAGAA CAAGCTCATA
TTTATGACCG GGCCCCTCAC CGGGACTACG GCTATTTCCG GCAACCGTTA CAACGTCGTC
ACCAAATCGC CCCTGACCGG CGCTATCGCC GCCTCCAGCT CCGGCGGTTA CTTTGGCAGC
GAACTCAAGT ACGCCGGCTT TGATGGGATT ATCTTTGAAG GCCGGGCGCC CGAGCCTGTT
TATCTCTGGA TTGAAGACGG TTCCTTTGAG TTGCGGCCAG CCGGCGAACT TTGGGGGAAA
AACGTCCACG AGACGGAAGA CGCTATAAAA GCAGTCACCT GCCCCCATGC TAAAGTGGCT
TGCATCGGCC CGGCTGGAGA GAAACTGGTT CGTTTTGCCT GTATAATGAA TGATAAAAAC
CGGGCCGCTG GCCGTTCCGG TGTCGGCGCC GTCATGGGGT CCAAGAACCT GAAGGCCATT
GCCGTCCGCG GTCACGGCGG GGTCAAGGTG GCCGATGGGC CGGCGTTCCG GGAAGCGGTC
CTGGCGTCCC TGGCCAAGAT CAAGGCCAAT GATGTCACCC ACGGCGGCCT GCCCGCCTAC
GGCACCGGGG TCCTGGTGAA TGTCATCAAC GCCCATGGAG GCCTGCCTAC CCGAAATTTT
CAGACAGGCA TCTTTCCAGG AGCGGAAAAA ATCAGCGGTG AAGCCCTGGC GGCTACCTAC
CTGGTGCGCA AGAAGGCCTG CCTGGCCTGC CCCATGGCCT GCGGCCGCGC CACGATGGTA
CCTTCCGGTC CCTACGCCGG TCATGGTGAA GGGCCGGAGT ATGAGGCCCA GTGGTCCCTG
GGGGCCGACT GCGGCATTGA TGACCTGGCG GCCATCCTCA AGGCTAACTT CCTGGCTAAC
GAGCTGGGCT ATGACCCCAT TTCCTTCGGC TCTACCCTGG CCTGTGCTAT GGAACTATAT
GAAAAGGGTT ACCTGCCGGC CGGGGATACC GAGGTGCCCC TGGAATTCGG CAATGCCGCC
GTCATGGTGG AAACGGCCCG CAAGGTGGGC TACCGGGAGG GTATCGGCGA TCTGCTGGCG
GAGGGTTCTT ACCGCCTGGC ATCACGCTAC GGTCATCCCG AACTCTCCAT GACCAGCAAA
AAGCAGGAAT ACCCGGCCTA TGACCCGCGG GCCTTCCAGG GTATCGGCCT GAATTATGCC
ACCTCCAACC GCGGCGGCTG CCACGTCCGG GGCTATACCA TCGCTGCCGA GGCCTTGGGT
ACTCCTGTCC AGGCGGATCC CCTTTCTTCT GAGGGCAAAG CGGCCCTGGA TAAGGCCTTC
CAGGATCTGA CCGCCCTGGT GGATGCAAGT GGTATCTGCC TCTTCACCAC CTTTGCCCTG
GGGGCTCCGG ATGTCGCCAG CATGCTGGCG ACGGCCACCG GCGTGCCCTA CACTGAGGAA
AGCGGCCTCC TGGCGGGTGA AAGGATCTAT AACCTGGAGC GTCTCTTTAA TTTCGCCGCC
GGCTTAACTA AAGCCGACGA TACCCTGGCG CCGCGGCTAC TCAATGAACC CATGCCGGAG
GGGCCGGCTA AAGGCAAGAC ATCCGCCCTG ACAAAGATGC TGGCCGAGTA CTACCAGTTG
CGCGGCTGGG ACGAAGAAGG CCGGGTCACA GCAGCTACCA GGGAGAGATT GGGGCTGTAG
 
Protein sequence
MYGWTGQLLR VNLSNGKCRT ERLDPILARD YVGARGLASK ILWNEIDPQV DPLAPENKLI 
FMTGPLTGTT AISGNRYNVV TKSPLTGAIA ASSSGGYFGS ELKYAGFDGI IFEGRAPEPV
YLWIEDGSFE LRPAGELWGK NVHETEDAIK AVTCPHAKVA CIGPAGEKLV RFACIMNDKN
RAAGRSGVGA VMGSKNLKAI AVRGHGGVKV ADGPAFREAV LASLAKIKAN DVTHGGLPAY
GTGVLVNVIN AHGGLPTRNF QTGIFPGAEK ISGEALAATY LVRKKACLAC PMACGRATMV
PSGPYAGHGE GPEYEAQWSL GADCGIDDLA AILKANFLAN ELGYDPISFG STLACAMELY
EKGYLPAGDT EVPLEFGNAA VMVETARKVG YREGIGDLLA EGSYRLASRY GHPELSMTSK
KQEYPAYDPR AFQGIGLNYA TSNRGGCHVR GYTIAAEALG TPVQADPLSS EGKAALDKAF
QDLTALVDAS GICLFTTFAL GAPDVASMLA TATGVPYTEE SGLLAGERIY NLERLFNFAA
GLTKADDTLA PRLLNEPMPE GPAKGKTSAL TKMLAEYYQL RGWDEEGRVT AATRERLGL