Gene Moth_0586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0586 
Symbol 
ID3830971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp611577 
End bp612725 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content63% 
IMG OID637828527 
Productchaperone DnaJ 
Protein accessionYP_429459 
Protein GI83589450 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID[TIGR02349] chaperone protein DnaJ 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGC GCGATTACTA CGAGGTCCTG GGGGTTTCCC GGGACGCCTC GGAGGCGGAG 
ATCAAGAAGG CTTACCGCCA GCTGGCGCGC AAATACCACC CGGATATGAA CCCCGGGGAT
AAAGAGGCCG AAGAAAAGTT TAAAGAAGTT CAGGAAGCCT ACGAAGTCCT CAGTAACGCC
GAAAAGCGGG CCCGTTACGA CCAGTTCGGC CACGCCGGTA CCGAGGCTGG CGGTGGGCCC
GGATTCGGCG GTTTTGACTT TGGCGGCGCC GGGGCCGATT TTGGCTTCGG CGATATCTTT
GACATGTTCT TTGGTGGAGG TTTCGGCGGT GCGGCCCGGC GTCAGGGCCC CCAGCGAGGC
GATGACCTGC GCCTGGACCT GGAGATCTCC TTTGAGGAAG CCGCCTTCGG CGTAGAGAAG
GAGGTAGGCA TCCCCCGCCA GGAGAAATGT CCGGAGTGTG GCGGTAGCGG GGCGGCGCCC
GGCACCCACC CGAAGACCTG TCCCACCTGC CACGGGACTG GACAGATTCG TATCGCCCAG
CGGACGCCCC TGGGCCAGTT CCAGACCATT CGCACCTGCC ACCAGTGCCA CGGCCAGGGG
ACGATCATTG AGACGCCCTG TCCCAGGTGT CGCGGCCGGG GCGTAGTCCA GCGCACCCGC
AAGATCAGGG TCAAAATTCC TCCCGGCGTG GATACCGGCG CCCGGTTACG CATGGCCGGC
GAGGGCGAGA GCGGCCTGCG CGGCGGCCCG CCCGGGGATC TCTATATCTA TATAAACGTC
CGGCCCCATA AGCTCTTCCG GCGGGACGGC TACGACGTTT TCTGCGAAGT ACCGGTTTCC
ATGGTCCAGG CCGCCCTGGG TGACAGTATC AAAGTACCCA CCCTGGACGG CAAAGAAGAG
CTGCATATCC CGCCCGGTAC CCAGAGCGGT ACCAGTTTCC GCCTGAAGGG CAAGGGCATA
CCCCGCCTAA ACGGAGTCGG CCGGGGCGAC CAGCACGTCC GCATCCACGT CGAGACCCCG
ACCAACCTCA ACGAAAAGCA GAAAGAATTA TTGCGGGAAT TTGCCCGGCT CTACGGCAGC
GAACCCCGTG GCGCCAGGGA ACAGGACAAA GGCTTTTTCA GAAAGGTAAA GGACGCCTTT
ATGGGTTAG
 
Protein sequence
MAKRDYYEVL GVSRDASEAE IKKAYRQLAR KYHPDMNPGD KEAEEKFKEV QEAYEVLSNA 
EKRARYDQFG HAGTEAGGGP GFGGFDFGGA GADFGFGDIF DMFFGGGFGG AARRQGPQRG
DDLRLDLEIS FEEAAFGVEK EVGIPRQEKC PECGGSGAAP GTHPKTCPTC HGTGQIRIAQ
RTPLGQFQTI RTCHQCHGQG TIIETPCPRC RGRGVVQRTR KIRVKIPPGV DTGARLRMAG
EGESGLRGGP PGDLYIYINV RPHKLFRRDG YDVFCEVPVS MVQAALGDSI KVPTLDGKEE
LHIPPGTQSG TSFRLKGKGI PRLNGVGRGD QHVRIHVETP TNLNEKQKEL LREFARLYGS
EPRGAREQDK GFFRKVKDAF MG