Gene TM1040_0010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0010 
SymboldnaK 
ID4078673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp10577 
End bp12505 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content58% 
IMG OID638005297 
Productmolecular chaperone DnaK 
Protein accessionYP_612005 
Protein GI99079851 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0949466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.47597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAG TGATCGGGAT TGACCTCGGG ACCACCAACT CCTGCGTTGC CATCATGGAC 
GGCTCTCAGC CGCGCGTGAT CGAAAACGCC GAGGGTGCAC GCACCACCCC TTCGATCGTT
GCCTTCACCG ATGAAGAGCG CCTTGTTGGA CAGCCTGCAA AACGCCAGGC CGTGACCAAC
CCGGACAACA CCATCTTTGG TGTGAAGCGC CTCATTGGCC GTCGGTTTGA CGACAGCGAC
CTCGCCAAGG ACAAGAAAAA CCTGCCCTTC GCCGTAATCA ACGGCGGCAA CGGCGACGCA
TGGGTCGAAG CGAAGTCCGA GAAGTACTCC CCCTCCCAGA TTTCCGCCTT CATCCTTGGT
AAGATGAAGG AAACCGCCGA GAGCTATCTT GGCGAAGAAG TGACGCAAGC GGTCATCACC
GTTCCTGCGT ATTTCAACGA CGCCCAGCGT CAGGCCACCA AAGACGCCGG CAAGATCGCT
GGCCTCGAGG TGCTGCGCAT CATCAACGAG CCGACTGCAG CGGCGCTGGC CTATGGCCTC
GACAAGGCCG AAACCCAGAC CATCGCAGTC TATGACCTTG GTGGCGGTAC CTTTGACGTT
ACCATCCTCG AGATCGACGA TGGCCTGTTT GAAGTGAAAT CCACCAACGG TGACACCTTC
CTCGGTGGTG AAGACTTCGA CATGCGCATC GTGAACTACC TCGCGGATGA GTTCAAAAAA
GAGCATGGCG TCGACCTGAC CAAAGACAAG ATGGCGCTTC AGCGTCTGAA AGAAGCGGCA
GAGAAAGCCA AGATCGAGCT GTCGTCCTCT TCGCAGACCG AGATCAACCA GCCGTTCATC
TCGATGGACC CGTCGTCTGG CCAGCCGCTG CACTTGGTCA TCAAACTGAC CCGCGCAAAG
CTCGAAAGCC TCGTGGGCGA CCTGATCAAG AACTCCATGA AGCCCTGCGC CGCCGCGCTG
AAGGATGCTG GCCTGTCCGC GTCCGACATT GACGAGGTGG TTCTGGTCGG TGGTATGACC
CGCATGCCGA AGGTCATCGA AGAGGTGACC AAATTCTTTG GCAAGGAGCC GCACAAGGGT
GTGAACCCGG ACGAAGTGGT TGCCATGGGC GCCGCCATTC AGGCTGGTGT TCTGCAGGGT
GACGTGAAGG ACGTTGTTCT TCTCGACGTG ACCCCGCTGT CGCTCGGCAT CGAAACCCTT
GGTGGCGTCT TCACCCGCCT GATCGACCGC AACACCACGA TCCCGACGAA GAAGTCTCAG
GTCTTCTCCA CCGCCGAGGA CAACCAGAAC GCCGTGACCA TTCGCGTATT CCAGGGTGAA
CGCGAAATGG CAGCCGACAA CAAGATGCTC GGTCAGTTCA ACCTCGAAGA CATCCCGCCC
GCACCGCGCG GCATGCCTCA GATCGAAGTG ACCTTTGACA TCGACGCCAA CGGTATCGTT
TCGGTGTCCG CCAAAGACAA AGGCACTGGC AAAGAGCAGA AGATCACCAT CCAAGCGTCT
GGTGGCCTGT CCGACGAGGA CATCGAAAAG ATGGTCAAGG ACGCCGAGGA CAATGCCGAG
GCTGACAAGG AACGTCGCGA GCTCATCGAA GCACGCAACC AGGCCGAGAG CCTGATCCAC
TCGACCGAGA AATCGATCGA AGATCACGGC GACAAGGTTG ATCCCTCGAC CATCGAGGCG
ATTGAACTGG CGATTGCGGC GCTCAAAGAC GATCTCGAAG GCGACAAGGC GAATGCCGAG
AAGATCAAAT CCGGCATCCA GAACGTCACC GAAGCGGCGA TGCGTCTGGG CGAGGCGATC
TACAAGGCAC AAGCCGAAGA AGGTGGCGCG GACGAGCCCT CTGCCGCTGA TGAAGATGCT
TCCGCAGGCC CCGGTGACGA CGACATCGTT GATGCCGAGT TTGAAGATCT GGACGACAAC
AAGCGCTAA
 
Protein sequence
MSKVIGIDLG TTNSCVAIMD GSQPRVIENA EGARTTPSIV AFTDEERLVG QPAKRQAVTN 
PDNTIFGVKR LIGRRFDDSD LAKDKKNLPF AVINGGNGDA WVEAKSEKYS PSQISAFILG
KMKETAESYL GEEVTQAVIT VPAYFNDAQR QATKDAGKIA GLEVLRIINE PTAAALAYGL
DKAETQTIAV YDLGGGTFDV TILEIDDGLF EVKSTNGDTF LGGEDFDMRI VNYLADEFKK
EHGVDLTKDK MALQRLKEAA EKAKIELSSS SQTEINQPFI SMDPSSGQPL HLVIKLTRAK
LESLVGDLIK NSMKPCAAAL KDAGLSASDI DEVVLVGGMT RMPKVIEEVT KFFGKEPHKG
VNPDEVVAMG AAIQAGVLQG DVKDVVLLDV TPLSLGIETL GGVFTRLIDR NTTIPTKKSQ
VFSTAEDNQN AVTIRVFQGE REMAADNKML GQFNLEDIPP APRGMPQIEV TFDIDANGIV
SVSAKDKGTG KEQKITIQAS GGLSDEDIEK MVKDAEDNAE ADKERRELIE ARNQAESLIH
STEKSIEDHG DKVDPSTIEA IELAIAALKD DLEGDKANAE KIKSGIQNVT EAAMRLGEAI
YKAQAEEGGA DEPSAADEDA SAGPGDDDIV DAEFEDLDDN KR