Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0471 |
Symbol | dnaK |
ID | 3927807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 448535 |
End bp | 450442 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637901594 |
Product | molecular chaperone DnaK |
Protein accession | YP_507287 |
Protein GI | 88658092 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | [TIGR02350] chaperone protein DnaK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0555822 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGTTA TAGGTATAGA TTTAGGTACG ACAAATTCTT GTGTTGCAGT TATGGAGGGA GGTGATGCAA AAGCTATTGA AAATAGTGAA GGGGCTAGAA CTACACCTTC AATAGTTGCA TTTACTGATT CAGAAAGGTT AGTTGGGGAT CCAGCAAAAC GTCAAGCTAC TACAAACGCT AAGAATACTA TATATGCTAG TAAAAGGCTT ATAGGACGTA GATATCAAGA TGTGAAAGAT ATAAAATCAT CTTATGATGT GGTGTCTGCT AAAAACGGTG ATGCTTGGAT AAAAGTACTT GGCAAAGAAT ATTCTCCAAG TCAGATTGGT GCATTTGTTT TGGAAAAAAT GAAAGAAACA GCAGAAAGAC ATCTTGGACA TAAAGTTGAG AAGGCTGTGA TTACAGTACC TGCATATTTT AATGATGCGC AACGTCAAGC AACAAAGGAT GCAGGAAGGA TAGCAGGATT AGATGTTATT AGAATAATTA ATGAACCTAC AGCTGCTGCT TTGGCGTATG GGTTAAATAA AAGTGATAAA CAAAAAGTAA TAGCAGTCTA TGATTTAGGT GGTGGTACTT TTGATGTTTC AATATTAGAA ATTGCTGACG GTGTTTTTGA AGTAAAAGCA ACAAATGGTG ATACAATGCT TGGTGGTGAA GATTTTGACC ACGCTATTAT GAACTATTTG ATGGATGATT TTAAAAAGAC TACAGGCATA GATTTACATA ATGATTCTAT GGCAGTACAG AGAATTAAGG AAGCATCAGA AAAGGCAAAG ATAGAATTGT CTAACCGTAT GGAAACTGAT ATAAATCTGC CATTTATTTC TAGTGATAGT ACTGGGCCTA AGCATTTAAG TTTAAAATTG ACTAGAGCAA AGTTTGAAAA TTTAGTTGAT GATCTAATTC AAAGAACTAT TGAACCATGT AAGAAGGCTC TTAAGGATGC TGGAATATCT GCAGATAAAA TAGATGAGGT TGTATTAGTT GGTGGGATGA CTAGGGTTCC TAAAGTAATA CAAAAGGTAA AAGAATTTTT TGGTAGAGAA CCTCATAAGG GTGTTAATCC AGATGAAGTT GTAGCTATAG GTGCTGCTAT ACAGGGAAGT ATTCTTGCTG GTGATGTTAG AGATGTATTA TTATTAGATG TAACTCCACT ATCTTTAGGT ATAGAAACTT TAGGTGGTGT ATTTACTCCA TTGATTGAAA GGAATACTAC AATTCCTACA AAGAAATCTC AAGTATTCTC AACAGCAGAA GATGGTCAAA CTGCAGTTAC TATTAAGGTG TACCAAGGTG AGAGGAAAAT GGCAGCTGAT AATAAATTAT TAGGTCAGTT TAGCTTGGAA GGTATACCTT CAGCTCCTCG TGGTATGCCT CAAATTGAAG TTACCTTTGA CATAGATGCT AATGGTATAG TACATGTTTC AGCAAAAGAT AAAGCTTCAG GTAAAGAGCA AGCTATTAAG ATTCAGTCTT CTGGAGGATT AAGTGATGAT GAAATTCAAC GAATGATAAA AGAAGCTGAG CAAAAAGCTG GAGAGGATGA GAAGCGTAAG AAATTTATTG AACTGAAAAA TAATGGTGAA AATTTAGTGC ACTCTACGGA AAAATCTTTA AACGAATATG GAGATAAGAT TCCAAATTCT GATAGGCTTG AAATTGAGAA TGCTATTAGA GATGTAAAAG ATGGTCTCAG TAGTAGTGAT ATGGAAAGTG TAGATGTTTT ACAGCAAAAA GTTGATCATT TGATGAAAGT ATCAATGAAG CTAGGTGAAG CTTTATATGG TAATGCTGCT AATAATCCTT CATCTGCTGA GAACAGTACA GCAAGTAATA ACGAAGAAGA AGATTCTAAA GTTGTTGATT CTGATTATCA AGAGATTGAT AAGAAGGATA GCAAATAG
|
Protein sequence | MAVIGIDLGT TNSCVAVMEG GDAKAIENSE GARTTPSIVA FTDSERLVGD PAKRQATTNA KNTIYASKRL IGRRYQDVKD IKSSYDVVSA KNGDAWIKVL GKEYSPSQIG AFVLEKMKET AERHLGHKVE KAVITVPAYF NDAQRQATKD AGRIAGLDVI RIINEPTAAA LAYGLNKSDK QKVIAVYDLG GGTFDVSILE IADGVFEVKA TNGDTMLGGE DFDHAIMNYL MDDFKKTTGI DLHNDSMAVQ RIKEASEKAK IELSNRMETD INLPFISSDS TGPKHLSLKL TRAKFENLVD DLIQRTIEPC KKALKDAGIS ADKIDEVVLV GGMTRVPKVI QKVKEFFGRE PHKGVNPDEV VAIGAAIQGS ILAGDVRDVL LLDVTPLSLG IETLGGVFTP LIERNTTIPT KKSQVFSTAE DGQTAVTIKV YQGERKMAAD NKLLGQFSLE GIPSAPRGMP QIEVTFDIDA NGIVHVSAKD KASGKEQAIK IQSSGGLSDD EIQRMIKEAE QKAGEDEKRK KFIELKNNGE NLVHSTEKSL NEYGDKIPNS DRLEIENAIR DVKDGLSSSD MESVDVLQQK VDHLMKVSMK LGEALYGNAA NNPSSAENST ASNNEEEDSK VVDSDYQEID KKDSK
|
| |