Gene ECH74115_0014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0014 
SymboldnaK 
ID6968061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp12180 
End bp14096 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content51% 
IMG OID643384098 
Productmolecular chaperone DnaK 
Protein accessionYP_002268621 
Protein GI209399575 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAA TAATTGGTAT CGACCTGGGT ACTACCAACT CTTGTGTAGC GATTATGGAT 
GGCACCACTC CTCGTGTGCT GGAGAACGCC GAAGGCGATC GCACCACGCC TTCTATCATT
GCCTATACCC AGGATGGTGA AACTCTGGTT GGTCAGCCGG CTAAACGTCA GGCAGTGACG
AACCCGCAAA ACACCCTGTT TGCGATTAAA CGCCTGATTG GCCGCCGCTT CCAGGACGAA
GAAGTACAGC GTGATGTTTC CATCATGCCG TTCAAAATTA TTGCTGCTGA TAACGGCGAC
GCATGGGTCG AAGTTAAAGG CCAGAAAATG GCACCGCCGC AGATTTCTGC TGAAGTGCTG
AAAAAAATGA AGAAAACCGC TGAAGATTAC CTGGGTGAAC CGGTAACTGA AGCTGTTATC
ACCGTACCGG CATACTTTAA CGATGCTCAG CGTCAGGCAA CCAAAGACGC AGGCCGTATC
GCTGGTCTGG AAGTAAAACG TATCATCAAC GAACCGACCG CAGCTGCGCT GGCTTACGGT
CTGGACAAAG GTACTGGCAA CCGTACTATC GCGGTTTATG ACTTGGGTGG TGGTACTTTC
GATATTTCTA TTATCGAAAT CGACGAAGTT GACGGCGAAA AAACCTTCGA AGTTCTGGCA
ACCAACGGTG ATACCCACCT GGGTGGTGAA GACTTCGACA GCCGTCTGAT CAACTATCTG
GTTGAAGAAT TCAAGAAAGA TCAGGGCATT GACCTGCGCA ACGATCCGCT GGCAATGCAG
CGCCTGAAAG AAGCGGCAGA AAAAGCGAAA ATCGAACTGT CTTCCGCTCA GCAGACCGAC
GTTAACCTGC CGTACATCAC TGCAGACGCG ACCGGTCCGA AACACATGAA CATCAAAGTG
ACTCGTGCGA AACTGGAAAG CCTGGTTGAA GATCTGGTTA ACCGTTCCAT CGAGCCGCTG
AAAGTTGCGC TGCAGGACGC TGGCCTGTCC GTATCTGATA TCGACGACGT TATCCTCGTT
GGTGGTCAGA CTCGTATGCC AATGGTTCAG AAGAAAGTTG CTGAGTTCTT TGGTAAAGAG
CCGCGTAAAG ACGTTAACCC GGACGAAGCT GTAGCAATCG GTGCTGCTGT TCAGGGTGGT
GTTCTGACTG GTGACGTAAA AGACGTACTG CTGCTGGATG TTACCCCGCT GTCTCTGGGT
ATCGAAACCA TGGGCGGTGT GATGACGACG CTGATCGCGA AAAACACCAC TATCCCGACC
AAGCACAGCC AGGTGTTCTC TACCGCTGAA GACAACCAGT CTGCGGTAAC CATCCATGTG
CTGCAGGGTG AACGTAAACG TGCGGCTGAC AACAAATCTC TGGGTCAGTT CAACCTGGAT
GGTATCAACC CGGCACCGCG CGGCATGCCG CAGATCGAAG TTACCTTCGA TATCGATGCT
GACGGTATCC TGCACGTTTC CGCGAAAGAT AAAAACAGCG GTAAAGAGCA GAAGATCACC
ATCAAGGCGT CTTCTGGTCT GAACGAAGAT GAAATCCAGA AAATGGTACG CGACGCCGAA
GCTAACGCCG AAGCTGACCG TAAGTTTGAA GAGCTGGTAC AGACTCGCAA CCAGGGCGAC
CATCTGCTGC ACAGCACCCG TAAGCAGGTT GAAGAAGCAG GCGACAAACT GCCGGCTGAC
GACAAAACTG CTATCGAGTC TGCGCTGACT GCACTGGAAA CTGCTCTGAA AGGTGAAGAC
AAAGCCGCTA TCGAAGCGAA AATGCAGGAA CTGGCACAGG TTTCCCAGAA ACTGATGGAA
ATCGCCCAGC AGCAACATGC CCAGCAGCAG ACTGCCGGTG CTGATGCTTC TGCAAACAAC
GCGAAAGATG ACGATGTTGT CGACGCTGAA TTTGAAGAAG TCAAAGACAA AAAATAA
 
Protein sequence
MGKIIGIDLG TTNSCVAIMD GTTPRVLENA EGDRTTPSII AYTQDGETLV GQPAKRQAVT 
NPQNTLFAIK RLIGRRFQDE EVQRDVSIMP FKIIAADNGD AWVEVKGQKM APPQISAEVL
KKMKKTAEDY LGEPVTEAVI TVPAYFNDAQ RQATKDAGRI AGLEVKRIIN EPTAAALAYG
LDKGTGNRTI AVYDLGGGTF DISIIEIDEV DGEKTFEVLA TNGDTHLGGE DFDSRLINYL
VEEFKKDQGI DLRNDPLAMQ RLKEAAEKAK IELSSAQQTD VNLPYITADA TGPKHMNIKV
TRAKLESLVE DLVNRSIEPL KVALQDAGLS VSDIDDVILV GGQTRMPMVQ KKVAEFFGKE
PRKDVNPDEA VAIGAAVQGG VLTGDVKDVL LLDVTPLSLG IETMGGVMTT LIAKNTTIPT
KHSQVFSTAE DNQSAVTIHV LQGERKRAAD NKSLGQFNLD GINPAPRGMP QIEVTFDIDA
DGILHVSAKD KNSGKEQKIT IKASSGLNED EIQKMVRDAE ANAEADRKFE ELVQTRNQGD
HLLHSTRKQV EEAGDKLPAD DKTAIESALT ALETALKGED KAAIEAKMQE LAQVSQKLME
IAQQQHAQQQ TAGADASANN AKDDDVVDAE FEEVKDKK