Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0012 |
Symbol | dnaK |
ID | 6144644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 12047 |
End bp | 13963 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641614913 |
Product | molecular chaperone DnaK |
Protein accession | YP_001742129 |
Protein GI | 170682035 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | [TIGR02350] chaperone protein DnaK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAAAA TAATTGGTAT CGACCTGGGT ACTACCAACT CTTGTGTAGC GATTATGGAT GGCACCACTC CTCGTGTACT GGAGAACGCC GAAGGCGATC GCACCACGCC TTCTATCATT GCCTATACCC AGGATGGTGA AACTCTGGTT GGTCAGCCGG CTAAACGTCA GGCAGTGACG AACCCGCAAA ACACCCTGTT TGCGATTAAA CGCCTGATTG GCCGCCGCTT CCAGGACGAA GAAGTACAGC GTGATGTTTC CATCATGCCG TTCAAAATTA TTGCTGCTGA TAACGGCGAC GCATGGGTCG AAGTTAAAGG CCAGAAAATG GCACCGCCGC AGATCTCTGC TGAAGTGCTG AAAAAAATGA AGAAAACCGC TGAAGATTAC CTGGGTGAAC CGGTAACTGA AGCTGTTATC ACCGTACCGG CATACTTTAA CGATGCTCAG CGTCAGGCAA CCAAAGACGC AGGCCGTATC GCTGGTCTGG AAGTAAAACG TATCATCAAC GAACCGACCG CAGCTGCGCT GGCTTACGGT CTGGACAAAG GTACTGGCAA CCGTACTATC GCGGTTTATG ACCTGGGTGG TGGTACTTTC GATATTTCTA TTATCGAAAT CGACGAAGTT GACGGCGAAA AAACCTTCGA AGTTCTGGCA ACCAACGGTG ATACCCACCT GGGGGGTGAA GACTTCGACA GCCGTCTGAT CAACTACCTG GTTGAAGAAT TCAAGAAAGA TCAGGGCATT GACCTGCGCA ACGATCCGCT GGCAATGCAG CGCCTGAAAG AAGCGGCAGA AAAAGCGAAA ATCGAACTGT CTTCCGCTCA GCAGACCGAC GTTAACCTGC CGTACATCAC TGCAGACGCG ACCGGTCCGA AACACATGAA CATCAAAGTG ACTCGTGCGA AACTGGAAAG CCTGGTTGAA GATCTGGTAA ACCGTTCCAT TGAGCCGCTG AAAGTTGCAC TGCAGGACGC TGGCCTGTCC GTATCTGATA TCGACGACGT TATCCTCGTT GGTGGTCAGA CTCGTATGCC AATGGTTCAG AAGAAAGTTG CTGAATTCTT TGGTAAAGAG CCGCGTAAAG ACGTTAACCC GGACGAAGCT GTAGCAATCG GTGCTGCTGT TCAGGGTGGT GTTCTGACTG GTGACGTGAA AGACGTACTG CTGCTGGACG TTACCCCGCT GTCTCTGGGT ATCGAAACCA TGGGCGGTGT GATGACGACG CTGATCGCGA AAAACACCAC TATCCCGACC AAGCACAGCC AGGTGTTCTC TACCGCTGAA GACAACCAGT CTGCGGTAAC CATCCACGTG CTGCAGGGTG AACGTAAACG TGCGGCTGAT AACAAATCTC TGGGTCAGTT CAACCTGGAT GGTATCAACC CGGCACCGCG CGGCATGCCG CAGATCGAAG TTACCTTCGA TATCGATGCT GACGGTATCC TGCACGTTTC CGCGAAAGAT AAAAACAGCG GTAAAGAGCA GAAGATCACC ATCAAGGCTT CTTCTGGTCT GAACGAAGAT GAAATCCAGA AAATGGTACG CGACGCAGAA GCTAACGCCG AAGCTGACCG TAAGTTTGAA GAGCTGGTAC AGACTCGCAA CCAGGGCGAC CATCTGCTGC ACAGCACCCG TAAGCAGGTT GAAGAAGCAG GCGACAAACT GCCGGCTGAC GACAAAACTG CTATCGAGTC TGCACTGACT GCACTGGAAA CTGCTCTGAA AGGTGAAGAC AAAGCCGCTA TCGAAGCGAA AATGCAGGAA CTGGCACAGG TTTCCCAGAA ACTGATGGAA ATCGCCCAGC AGCAACATGC CCAGCAGCAG ACTGCCGGTG CTGATGCTTC TGCAAACAAC GCGAAAGATG ACGATGTTGT CGACGCTGAA TTTGAAGAAG TCAAAGACAA AAAATAA
|
Protein sequence | MGKIIGIDLG TTNSCVAIMD GTTPRVLENA EGDRTTPSII AYTQDGETLV GQPAKRQAVT NPQNTLFAIK RLIGRRFQDE EVQRDVSIMP FKIIAADNGD AWVEVKGQKM APPQISAEVL KKMKKTAEDY LGEPVTEAVI TVPAYFNDAQ RQATKDAGRI AGLEVKRIIN EPTAAALAYG LDKGTGNRTI AVYDLGGGTF DISIIEIDEV DGEKTFEVLA TNGDTHLGGE DFDSRLINYL VEEFKKDQGI DLRNDPLAMQ RLKEAAEKAK IELSSAQQTD VNLPYITADA TGPKHMNIKV TRAKLESLVE DLVNRSIEPL KVALQDAGLS VSDIDDVILV GGQTRMPMVQ KKVAEFFGKE PRKDVNPDEA VAIGAAVQGG VLTGDVKDVL LLDVTPLSLG IETMGGVMTT LIAKNTTIPT KHSQVFSTAE DNQSAVTIHV LQGERKRAAD NKSLGQFNLD GINPAPRGMP QIEVTFDIDA DGILHVSAKD KNSGKEQKIT IKASSGLNED EIQKMVRDAE ANAEADRKFE ELVQTRNQGD HLLHSTRKQV EEAGDKLPAD DKTAIESALT ALETALKGED KAAIEAKMQE LAQVSQKLME IAQQQHAQQQ TAGADASANN AKDDDVVDAE FEEVKDKK
|
| |