Gene ECH_0055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0055 
Symbolpgk 
ID3926974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp47910 
End bp49061 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content31% 
IMG OID637901179 
Productphosphoglycerate kinase 
Protein accessionYP_506886 
Protein GI88658248 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0126] 3-phosphoglycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.186403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TACAAGATTT TAGTTGTAGT GGTAAAACTG TATTGCTGCG TGCGGATTTA 
AATGTTCCAG TAGATAATGG GATAGTTTTG GATGATACTA GGATTGTTAG ATTAACTACA
ACTATAAAAT ATTTATTGAG TAATGATGCG AAAATTGTCA TAATGTCGCA CTATGGATCT
CCTAAATCTT ATGATAAAGA ATTTTCTTTA AAGTTTGTGG TTGAGTATTT GAATAAAATA
TTTGCAACAA ATGTAGTATT TATAGATGGT GTAATTGGAG ATTATGTAGA ACAGACTATT
CAATCTGTTC CAGCAGGGAC TATATTGCTG CTAGAAAATT TGAGATTTTA CGCAGAAGAA
GAAAAGAATG ATTTGAATTT TGCAAAACAA CTTGCGTTGT TGGCTGATAT ATATGTTAAT
GATGCTTTTT CTTGTTTACA TCGTAAGCAT GCTTCTATAG ATGCGATTAC TAGAGTTATG
CCGTCTTTTA TTGGCTTTAA TTTTCAAGAA GAAATGAAAT ATTTGAGTTG TGTTGTTTCA
AATAGTGAGA AGCCAGTAGC TGTTATAGTT GGTGGTTCAA AAATATCAAC AAAAGTTCAT
ATGTTAAAAA ATTTGATTAA AAAAATAGAT TTTTTGATAG TGGGAGGAGC CATTGCGAAT
AATTTTTTGT TATCACAAGG TTTAAAAATA GGTAAGTCGT TATACGAAGA GTTAGAAAAA
GATCTTGTAA CAGAAATTGT AGATCTTGCT AAGAGATATG AATGTAAGAT AATTGTCCCT
GTTGATTACG TAGTGGCTAA AAATTACATT TGTGGGGATA GTACAATAAA AGACAATGAC
ACTTTAGAGT CTGATGATAT GATATTAGAT GTAGGACCTC AAACTGTTAA CATGATTGCT
GCTACGATAA ATAAATGTAG AACAGTGCTA TGGAATGGTC CGTGTGGTAT GTTTGAAAAA
GAACCTTTTT CTAAAGGAAC ATTTAGTGTT GCGAACTTGT TGTCAAAATT GACTAAGGTA
GGAAAGCTAA AAAGTATTGT TGGAGGTGGA GATAGTATAT GTGCAATAAA ATTATCTGGA
CTTTCAAATG AAGACTTTAC TTATATTTCT ACAGGAGGAG GAGCTTTATT GCATTTTTTG
AGTATCGCAT GA
 
Protein sequence
MKKIQDFSCS GKTVLLRADL NVPVDNGIVL DDTRIVRLTT TIKYLLSNDA KIVIMSHYGS 
PKSYDKEFSL KFVVEYLNKI FATNVVFIDG VIGDYVEQTI QSVPAGTILL LENLRFYAEE
EKNDLNFAKQ LALLADIYVN DAFSCLHRKH ASIDAITRVM PSFIGFNFQE EMKYLSCVVS
NSEKPVAVIV GGSKISTKVH MLKNLIKKID FLIVGGAIAN NFLLSQGLKI GKSLYEELEK
DLVTEIVDLA KRYECKIIVP VDYVVAKNYI CGDSTIKDND TLESDDMILD VGPQTVNMIA
ATINKCRTVL WNGPCGMFEK EPFSKGTFSV ANLLSKLTKV GKLKSIVGGG DSICAIKLSG
LSNEDFTYIS TGGGALLHFL SIA