Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0235 |
Symbol | |
ID | 3927482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 221811 |
End bp | 223076 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637901359 |
Product | M16 family peptidase |
Protein accession | YP_507056 |
Protein GI | 88657608 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACCAA AAATTACACA ACTTAGCAAC AATTTCACTA TAATAACTGA CACAATGCCA TATGTAGAAT CCGTATCTAT CAACATTTGG GTAAACGTCG GAAGTAGGTA TGAGAATATA AACATAACAG GTATCTCTCA TTTTTTAGAA CACATGGCTT TTAAAGGCAC TAAAACTCGC ACTGCACTTG ATATAGCACA AATTTTTGAT GATATAGGTG GAAATTTTAA CGCTCACACA GACAGAGAAC ATACTGTTTA CCATGTAAAA ACACTAAAAA GAGACATTAA AATAGCTATA GAAGTACTTG CAGACATAAT ACTAAATTCA CAATTTCCGG AAGAAGAAAT ATACAAAGAA AAAGGAGTAG TGTTACAAGA GATATATCAA ACAAATGATT CTCCTACTAG TATAATTTTT GATAAGTATA TAGAAGCTGC GTATCCTAAT CAAATATTTG GTAAATCCAT TTTAGGTACC CCAGAATCAG TAAATAGCCT ATCTAAAGCA GATTTACACA TCTACATGAG TGAATATTAT CACGCTGGCA ACATGTTACT ATCAGTAGCT GGAAACATAT CACATGAAGA AGTCATTGAT TTAGTATCTC AGTATTTTTC TCATATGAAA AAATCACAAC GTAAAATAGC AGATCCATCA ATTTATCGCA GCGGAGAATA TAGAGAAATA AGAAACTTAG AACAAGTACA TCTTGTCATA GGATTCCCTA GTGTTTCATA TAAAGATGAC TTGTTTTATA CTATACAAAT TTTAGATTCA ATCTTAGGAA ATGGCATGTC ATCACGTCTT TTCCAAAAAA TCCGTGAACA ATTAGGATTA GTCTATACTA TTTCATCTTT CAACTCAAGT TACAGTGATA ACGGCATTTT CTCTATATAT GCAGCAACAG ATAAAAGTAA TTTAAGTCAA TTACTTTCCA CTATAGCTTC TGAAGTAAAA AATATCATAA CAAACTTACA AGAAAACGAG ATAACAAGAG CAAAAGGTAA ATTAACATCT GAAATATTAA TGTCAAGAGA AAGCACTACT GCACGCGCTG AATCCTTAGG GTACTATTAT TCCCATTACA ATCGGTACAT TTCAAAAGAA GAATTAATAA AGAAAATATC TACAATTACA GTCACAGACA TTCAGAACTG TATTAATAAT CTACTAGGTA GCAACAACAA AATAACCTTA GCAGCTATAG GTCAAATTGA AAACCTACCT TCTTATGATG ATATAGCCCA AATGTTTTAC ATATAA
|
Protein sequence | MSPKITQLSN NFTIITDTMP YVESVSINIW VNVGSRYENI NITGISHFLE HMAFKGTKTR TALDIAQIFD DIGGNFNAHT DREHTVYHVK TLKRDIKIAI EVLADIILNS QFPEEEIYKE KGVVLQEIYQ TNDSPTSIIF DKYIEAAYPN QIFGKSILGT PESVNSLSKA DLHIYMSEYY HAGNMLLSVA GNISHEEVID LVSQYFSHMK KSQRKIADPS IYRSGEYREI RNLEQVHLVI GFPSVSYKDD LFYTIQILDS ILGNGMSSRL FQKIREQLGL VYTISSFNSS YSDNGIFSIY AATDKSNLSQ LLSTIASEVK NIITNLQENE ITRAKGKLTS EILMSRESTT ARAESLGYYY SHYNRYISKE ELIKKISTIT VTDIQNCINN LLGSNNKITL AAIGQIENLP SYDDIAQMFY I
|
| |