Gene Moth_0585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0585 
SymboldnaK 
ID3830970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp609725 
End bp611563 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content58% 
IMG OID637828526 
Productmolecular chaperone DnaK 
Protein accessionYP_429458 
Protein GI83589449 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAG TAATCGGTAT TGACCTTGGT ACGACTAACT CCTGCGTGGC CGTCATGGAA 
GGCGGTGAGG CAGTAGTTAT TCCCAATGCC GAAGGCGGCC GGACCACTCC TTCCGTGGTC
GCCTTTACCA AAGAGGGCGA GCGGATCGTG GGCCAGGTGG CCAAACGCCA GGCCATAACC
AATCCTGACC GGACGGTAAT ATCCATTAAG CGGCATATGG GGACCAACTA CAAGGTAAAA
ATTGATAATA AAGAGTATAC GCCGCAGGAG ATCTCAGCCA TGATCCTGCA GAAACTGAAA
GCCGACGCGG AAGCCTACCT GGGCGAAAAG GTTACCCAGG CCGTCATCAC CGTGCCGGCT
TACTTCACCG ACAGCCAGCG CCAGGCAACC AAGGACGCCG GGCGCATTGC CGGCCTGGAA
GTCCTGCGCA TCATCAACGA ACCGACAGCA GCTTCCCTGG CCTATGGCCT GGATAAAGGC
GAGGACCAGA CCATCCTGGT TTACGACCTG GGCGGCGGCA CCTTTGATGT TTCCATCCTG
GAGCTGGGAG ACGGCGTCTT TGAGGTTAAA GCCACCAGCG GTAATAACCG CCTGGGAGGC
GACGACTTCG ACCAGCGAAT AATGGACTAT CTGGTGGATA TCTGCCGCCG CGAGCATGGC
GTTGACCTGA CCCAGGATAA GATGGCCATG CAGCGCCTGA AGGAGGCTGC CGAGAAAGCC
AAGATTGAGC TTTCCGGCAT GACCAGCACC AATATCAACC TGCCCTTTAT CTCGGCGACG
CCCAACGGCC CCGTCCACCT GGATGTCAAC CTGACCCGGG CCAAGTTTGA GGAACTCATC
GCCGACCTGG TAGAGAAGAC CGTGGGTCCG ACCAGGCAGG CCCTGGCCGA TGCCGGCCTG
GAACCCAAAG ATATCGATAA AGTATTGCTG GTAGGCGGTT CCACCCGGGT GCCCCTGGTA
CAGGAGACGG TGCGCAAGAT CTTGGGCCAG GAGCCCCATA AAGGCATCAA CCCCGACGAA
TGTGTTGCCC TGGGAGCGGC TATCCAGGGA GGCGTCCTTG CCGGTGAGGT CAAAGACGTG
CTGCTGCTGG ACGTCACGCC CCTTTCTCTG GGCATTGAAA CCCTGGGCGG CGTGTTTACC
AAACTCATTG AGCGCAATAC AACCATTCCT ACTTCCAAGA GCCAGATCTT CTCCACTGCC
GCCGACAACC AGACTACGGT GGAGATTCAT GTCCTCCAGG GCGAGCGGGC TATGGCCGCT
GATAACAAGA CCCTGGGCCG CTTCCAGCTG ACGGGGATCC CTCCGGCGCC CCGGGGAGTG
CCCCAGATTG AGGTCAAGTT CGACATTGAC GCCAATGGTA TCGTCCACGT CTCGGCCAAG
GATCTGGGTA CCGGCAAACA GCAGGCCATT ACCATCACCT CGTCCAGCGG CCTGAGCGAG
GAAGAGATCC AGCGCATGGT CAAGGAAGCC GAGGCTTCGG CTGAGGCCGA CCGCCGCCGT
AAGGAAGAGA TCGAGACCCG CAACCAGGCG GACTCCCTCA TCTACCAGGC CGAGCGCACC
TTGAAGGAGT TCAAGGACAA AGCCGACCAG AATGATGTAG ACCGCATCGA AAAGGCGAAG
AAAGAGCTCC AGGAGGTCCT GGACAGCAAG AATAATGACA AGATCAAAGA AAAGATGGAG
GCCCTCTCCC AGGCCCTTTA TACCCTGACC ACCAAGGTGT ACCAGCAGGC TGGTGCTCAA
GCCGGAGCCC AGGGCCAGGG CGCGGCCGGC GGCCAGAAAC AGGACGGTAA CGTCTACGAC
GCTGACTATA AAGTCGTCGA CGACGACAAG AAAGAATAG
 
Protein sequence
MSKVIGIDLG TTNSCVAVME GGEAVVIPNA EGGRTTPSVV AFTKEGERIV GQVAKRQAIT 
NPDRTVISIK RHMGTNYKVK IDNKEYTPQE ISAMILQKLK ADAEAYLGEK VTQAVITVPA
YFTDSQRQAT KDAGRIAGLE VLRIINEPTA ASLAYGLDKG EDQTILVYDL GGGTFDVSIL
ELGDGVFEVK ATSGNNRLGG DDFDQRIMDY LVDICRREHG VDLTQDKMAM QRLKEAAEKA
KIELSGMTST NINLPFISAT PNGPVHLDVN LTRAKFEELI ADLVEKTVGP TRQALADAGL
EPKDIDKVLL VGGSTRVPLV QETVRKILGQ EPHKGINPDE CVALGAAIQG GVLAGEVKDV
LLLDVTPLSL GIETLGGVFT KLIERNTTIP TSKSQIFSTA ADNQTTVEIH VLQGERAMAA
DNKTLGRFQL TGIPPAPRGV PQIEVKFDID ANGIVHVSAK DLGTGKQQAI TITSSSGLSE
EEIQRMVKEA EASAEADRRR KEEIETRNQA DSLIYQAERT LKEFKDKADQ NDVDRIEKAK
KELQEVLDSK NNDKIKEKME ALSQALYTLT TKVYQQAGAQ AGAQGQGAAG GQKQDGNVYD
ADYKVVDDDK KE