Gene EcolC_3642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3642 
SymboldnaK 
ID6064767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3988222 
End bp3990138 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content51% 
IMG OID641603057 
Productmolecular chaperone DnaK 
Protein accessionYP_001726580 
Protein GI170021626 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.297376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAA TAATTGGTAT CGACCTGGGT ACTACCAACT CTTGTGTAGC GATTATGGAT 
GGCACCACTC CTCGCGTGCT GGAGAACGCC GAAGGCGATC GCACCACGCC TTCTATCATT
GCCTATACCC AGGATGGTGA AACTCTGGTT GGTCAGCCGG CTAAACGTCA GGCAGTGACA
AACCCGCAAA ACACCCTGTT TGCGATTAAA CGCCTGATTG GCCGCCGCTT CCAGGACGAA
GAAGTACAGC GTGATGTTTC CATCATGCCG TTCAAAATTA TTGCTGCTGA TAACGGCGAC
GCATGGGTCG AAGTTAAAGG CCAGAAAATG GCACCGCCGC AGATTTCTGC TGAAGTGCTG
AAAAAAATGA AGAAAACCGC TGAAGATTAC TTGGGTGAAC CGGTAACTGA AGCTGTTATC
ACCGTACCGG CATACTTTAA CGATGCTCAG CGTCAGGCAA CCAAAGACGC AGGCCGTATC
GCTGGTCTGG AAGTAAAACG TATCATCAAC GAACCGACCG CAGCTGCGCT GGCTTACGGT
CTGGACAAAG GTACTGGCAA CCGTACTATC GCGGTTTATG ACCTGGGTGG TGGTACTTTC
GATATTTCTA TTATCGAAAT CGACGAAGTT GACGGCGAAA AAACCTTCGA AGTTCTGGCT
ACCAACGGTG ATACCCACCT GGGTGGTGAA GACTTCGACA GCCGTCTGAT CAACTACCTG
GTTGAAGAAT TCAAGAAAGA TCAGGGCATT GACCTGCGCA ACGATCCGCT GGCAATGCAG
CGCCTGAAAG AAGCGGCAGA AAAAGCGAAA ATCGAACTGT CTTCCGCTCA GCAGACCGAC
GTTAACCTGC CGTACATCAC TGCAGATGCG ACCGGTCCGA AACACATGAA CATCAAAGTG
ACTCGTGCGA AACTGGAAAG CCTGGTTGAA GATCTGGTAA ACCGTTCCAT TGAGCCGCTG
AAAGTTGCAC TGCAGGACGC TGGCCTGTCC GTATCTGATA TCGACGACGT TATCCTCGTT
GGTGGTCAGA CTCGTATGCC AATGGTTCAG AAGAAAGTTG CTGAGTTCTT TGGTAAAGAG
CCGCGTAAAG ACGTTAACCC GGACGAAGCT GTAGCAATCG GTGCTGCTGT TCAGGGTGGT
GTTCTGACTG GTGACGTGAA AGACGTACTG CTGCTGGACG TTACCCCGCT GTCTCTGGGT
ATCGAAACCA TGGGCGGTGT GATGACGACG CTGATCGCGA AAAACACCAC TATCCCGACC
AAGCACAGCC AGGTGTTCTC TACCGCTGAA GACAACCAGT CTGCGGTAAC CATCCATGTG
CTGCAGGGTG AACGTAAACG TGCGGCTGAT AACAAATCTC TGGGTCAGTT CAACCTGGAT
GGTATCAACC CGGCACCGCG CGGCATGCCG CAGATCGAAG TTACCTTCGA TATCGATGCT
GACGGTATCC TGCACGTTTC CGCGAAAGAT AAAAACAGCG GTAAAGAGCA GAAGATCACC
ATCAAGGCGT CTTCTGGTCT GAACGAAGAT GAAATCCAGA AAATGGTACG CGACGCAGAA
GCTAACGCCG AAGCTGACCG TAAGTTTGAA GAGCTGGTAC AGACTCGCAA CCAGGGCGAC
CATCTGCTGC ACAGCACCCG TAAGCAGGTT GAAGAAGCAG GCGACAAACT GCCGGCTGAC
GACAAAACTG CTATCGAGTC TGCACTGACT GCACTGGAAA CTGCTCTGAA AGGTGAAGAC
AAAGCCGCTA TCGAAGCGAA AATGCAGGAG CTGGCACAGG TTTCCCAGAA ACTGATGGAA
ATCGCCCAGC AGCAACATGC CCAGCAGCAG ACTGCCGGTG CTGATGCTTC TGCAAACAAC
GCGAAAGATG ACGATGTTGT CGACGCTGAA TTTGAAGAAG TCAAAGACAA AAAATAA
 
Protein sequence
MGKIIGIDLG TTNSCVAIMD GTTPRVLENA EGDRTTPSII AYTQDGETLV GQPAKRQAVT 
NPQNTLFAIK RLIGRRFQDE EVQRDVSIMP FKIIAADNGD AWVEVKGQKM APPQISAEVL
KKMKKTAEDY LGEPVTEAVI TVPAYFNDAQ RQATKDAGRI AGLEVKRIIN EPTAAALAYG
LDKGTGNRTI AVYDLGGGTF DISIIEIDEV DGEKTFEVLA TNGDTHLGGE DFDSRLINYL
VEEFKKDQGI DLRNDPLAMQ RLKEAAEKAK IELSSAQQTD VNLPYITADA TGPKHMNIKV
TRAKLESLVE DLVNRSIEPL KVALQDAGLS VSDIDDVILV GGQTRMPMVQ KKVAEFFGKE
PRKDVNPDEA VAIGAAVQGG VLTGDVKDVL LLDVTPLSLG IETMGGVMTT LIAKNTTIPT
KHSQVFSTAE DNQSAVTIHV LQGERKRAAD NKSLGQFNLD GINPAPRGMP QIEVTFDIDA
DGILHVSAKD KNSGKEQKIT IKASSGLNED EIQKMVRDAE ANAEADRKFE ELVQTRNQGD
HLLHSTRKQV EEAGDKLPAD DKTAIESALT ALETALKGED KAAIEAKMQE LAQVSQKLME
IAQQQHAQQQ TAGADASANN AKDDDVVDAE FEEVKDKK