Gene ECH_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0471 
SymboldnaK 
ID3927807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp448535 
End bp450442 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content35% 
IMG OID637901594 
Productmolecular chaperone DnaK 
Protein accessionYP_507287 
Protein GI88658092 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0555822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTTA TAGGTATAGA TTTAGGTACG ACAAATTCTT GTGTTGCAGT TATGGAGGGA 
GGTGATGCAA AAGCTATTGA AAATAGTGAA GGGGCTAGAA CTACACCTTC AATAGTTGCA
TTTACTGATT CAGAAAGGTT AGTTGGGGAT CCAGCAAAAC GTCAAGCTAC TACAAACGCT
AAGAATACTA TATATGCTAG TAAAAGGCTT ATAGGACGTA GATATCAAGA TGTGAAAGAT
ATAAAATCAT CTTATGATGT GGTGTCTGCT AAAAACGGTG ATGCTTGGAT AAAAGTACTT
GGCAAAGAAT ATTCTCCAAG TCAGATTGGT GCATTTGTTT TGGAAAAAAT GAAAGAAACA
GCAGAAAGAC ATCTTGGACA TAAAGTTGAG AAGGCTGTGA TTACAGTACC TGCATATTTT
AATGATGCGC AACGTCAAGC AACAAAGGAT GCAGGAAGGA TAGCAGGATT AGATGTTATT
AGAATAATTA ATGAACCTAC AGCTGCTGCT TTGGCGTATG GGTTAAATAA AAGTGATAAA
CAAAAAGTAA TAGCAGTCTA TGATTTAGGT GGTGGTACTT TTGATGTTTC AATATTAGAA
ATTGCTGACG GTGTTTTTGA AGTAAAAGCA ACAAATGGTG ATACAATGCT TGGTGGTGAA
GATTTTGACC ACGCTATTAT GAACTATTTG ATGGATGATT TTAAAAAGAC TACAGGCATA
GATTTACATA ATGATTCTAT GGCAGTACAG AGAATTAAGG AAGCATCAGA AAAGGCAAAG
ATAGAATTGT CTAACCGTAT GGAAACTGAT ATAAATCTGC CATTTATTTC TAGTGATAGT
ACTGGGCCTA AGCATTTAAG TTTAAAATTG ACTAGAGCAA AGTTTGAAAA TTTAGTTGAT
GATCTAATTC AAAGAACTAT TGAACCATGT AAGAAGGCTC TTAAGGATGC TGGAATATCT
GCAGATAAAA TAGATGAGGT TGTATTAGTT GGTGGGATGA CTAGGGTTCC TAAAGTAATA
CAAAAGGTAA AAGAATTTTT TGGTAGAGAA CCTCATAAGG GTGTTAATCC AGATGAAGTT
GTAGCTATAG GTGCTGCTAT ACAGGGAAGT ATTCTTGCTG GTGATGTTAG AGATGTATTA
TTATTAGATG TAACTCCACT ATCTTTAGGT ATAGAAACTT TAGGTGGTGT ATTTACTCCA
TTGATTGAAA GGAATACTAC AATTCCTACA AAGAAATCTC AAGTATTCTC AACAGCAGAA
GATGGTCAAA CTGCAGTTAC TATTAAGGTG TACCAAGGTG AGAGGAAAAT GGCAGCTGAT
AATAAATTAT TAGGTCAGTT TAGCTTGGAA GGTATACCTT CAGCTCCTCG TGGTATGCCT
CAAATTGAAG TTACCTTTGA CATAGATGCT AATGGTATAG TACATGTTTC AGCAAAAGAT
AAAGCTTCAG GTAAAGAGCA AGCTATTAAG ATTCAGTCTT CTGGAGGATT AAGTGATGAT
GAAATTCAAC GAATGATAAA AGAAGCTGAG CAAAAAGCTG GAGAGGATGA GAAGCGTAAG
AAATTTATTG AACTGAAAAA TAATGGTGAA AATTTAGTGC ACTCTACGGA AAAATCTTTA
AACGAATATG GAGATAAGAT TCCAAATTCT GATAGGCTTG AAATTGAGAA TGCTATTAGA
GATGTAAAAG ATGGTCTCAG TAGTAGTGAT ATGGAAAGTG TAGATGTTTT ACAGCAAAAA
GTTGATCATT TGATGAAAGT ATCAATGAAG CTAGGTGAAG CTTTATATGG TAATGCTGCT
AATAATCCTT CATCTGCTGA GAACAGTACA GCAAGTAATA ACGAAGAAGA AGATTCTAAA
GTTGTTGATT CTGATTATCA AGAGATTGAT AAGAAGGATA GCAAATAG
 
Protein sequence
MAVIGIDLGT TNSCVAVMEG GDAKAIENSE GARTTPSIVA FTDSERLVGD PAKRQATTNA 
KNTIYASKRL IGRRYQDVKD IKSSYDVVSA KNGDAWIKVL GKEYSPSQIG AFVLEKMKET
AERHLGHKVE KAVITVPAYF NDAQRQATKD AGRIAGLDVI RIINEPTAAA LAYGLNKSDK
QKVIAVYDLG GGTFDVSILE IADGVFEVKA TNGDTMLGGE DFDHAIMNYL MDDFKKTTGI
DLHNDSMAVQ RIKEASEKAK IELSNRMETD INLPFISSDS TGPKHLSLKL TRAKFENLVD
DLIQRTIEPC KKALKDAGIS ADKIDEVVLV GGMTRVPKVI QKVKEFFGRE PHKGVNPDEV
VAIGAAIQGS ILAGDVRDVL LLDVTPLSLG IETLGGVFTP LIERNTTIPT KKSQVFSTAE
DGQTAVTIKV YQGERKMAAD NKLLGQFSLE GIPSAPRGMP QIEVTFDIDA NGIVHVSAKD
KASGKEQAIK IQSSGGLSDD EIQRMIKEAE QKAGEDEKRK KFIELKNNGE NLVHSTEKSL
NEYGDKIPNS DRLEIENAIR DVKDGLSSSD MESVDVLQQK VDHLMKVSMK LGEALYGNAA
NNPSSAENST ASNNEEEDSK VVDSDYQEID KKDSK