Gene ECH_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0235 
Symbol 
ID3927482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp221811 
End bp223076 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content31% 
IMG OID637901359 
ProductM16 family peptidase 
Protein accessionYP_507056 
Protein GI88657608 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACCAA AAATTACACA ACTTAGCAAC AATTTCACTA TAATAACTGA CACAATGCCA 
TATGTAGAAT CCGTATCTAT CAACATTTGG GTAAACGTCG GAAGTAGGTA TGAGAATATA
AACATAACAG GTATCTCTCA TTTTTTAGAA CACATGGCTT TTAAAGGCAC TAAAACTCGC
ACTGCACTTG ATATAGCACA AATTTTTGAT GATATAGGTG GAAATTTTAA CGCTCACACA
GACAGAGAAC ATACTGTTTA CCATGTAAAA ACACTAAAAA GAGACATTAA AATAGCTATA
GAAGTACTTG CAGACATAAT ACTAAATTCA CAATTTCCGG AAGAAGAAAT ATACAAAGAA
AAAGGAGTAG TGTTACAAGA GATATATCAA ACAAATGATT CTCCTACTAG TATAATTTTT
GATAAGTATA TAGAAGCTGC GTATCCTAAT CAAATATTTG GTAAATCCAT TTTAGGTACC
CCAGAATCAG TAAATAGCCT ATCTAAAGCA GATTTACACA TCTACATGAG TGAATATTAT
CACGCTGGCA ACATGTTACT ATCAGTAGCT GGAAACATAT CACATGAAGA AGTCATTGAT
TTAGTATCTC AGTATTTTTC TCATATGAAA AAATCACAAC GTAAAATAGC AGATCCATCA
ATTTATCGCA GCGGAGAATA TAGAGAAATA AGAAACTTAG AACAAGTACA TCTTGTCATA
GGATTCCCTA GTGTTTCATA TAAAGATGAC TTGTTTTATA CTATACAAAT TTTAGATTCA
ATCTTAGGAA ATGGCATGTC ATCACGTCTT TTCCAAAAAA TCCGTGAACA ATTAGGATTA
GTCTATACTA TTTCATCTTT CAACTCAAGT TACAGTGATA ACGGCATTTT CTCTATATAT
GCAGCAACAG ATAAAAGTAA TTTAAGTCAA TTACTTTCCA CTATAGCTTC TGAAGTAAAA
AATATCATAA CAAACTTACA AGAAAACGAG ATAACAAGAG CAAAAGGTAA ATTAACATCT
GAAATATTAA TGTCAAGAGA AAGCACTACT GCACGCGCTG AATCCTTAGG GTACTATTAT
TCCCATTACA ATCGGTACAT TTCAAAAGAA GAATTAATAA AGAAAATATC TACAATTACA
GTCACAGACA TTCAGAACTG TATTAATAAT CTACTAGGTA GCAACAACAA AATAACCTTA
GCAGCTATAG GTCAAATTGA AAACCTACCT TCTTATGATG ATATAGCCCA AATGTTTTAC
ATATAA
 
Protein sequence
MSPKITQLSN NFTIITDTMP YVESVSINIW VNVGSRYENI NITGISHFLE HMAFKGTKTR 
TALDIAQIFD DIGGNFNAHT DREHTVYHVK TLKRDIKIAI EVLADIILNS QFPEEEIYKE
KGVVLQEIYQ TNDSPTSIIF DKYIEAAYPN QIFGKSILGT PESVNSLSKA DLHIYMSEYY
HAGNMLLSVA GNISHEEVID LVSQYFSHMK KSQRKIADPS IYRSGEYREI RNLEQVHLVI
GFPSVSYKDD LFYTIQILDS ILGNGMSSRL FQKIREQLGL VYTISSFNSS YSDNGIFSIY
AATDKSNLSQ LLSTIASEVK NIITNLQENE ITRAKGKLTS EILMSRESTT ARAESLGYYY
SHYNRYISKE ELIKKISTIT VTDIQNCINN LLGSNNKITL AAIGQIENLP SYDDIAQMFY
I