Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1057 |
Symbol | |
ID | 3927287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 1084236 |
End bp | 1085555 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 637902171 |
Product | M16 family peptidase |
Protein accession | YP_507842 |
Protein GI | 88658542 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTAAAT TTTTTACTTG TTTTTTTACA ATCTTCTTCA CAATAGCTAA TCATGCTTTA TCTTTTAACA TTAAAGTTAC ACATGAAAAG CTAGATAATG GCATGGAAGT ATATGTTATC CCTAATCATC GCGCACCCGC AGTTATGCAT ATGGTATTAT ATAAAGTTGG TGGGACTGAT GATCCAGTAG GTTATTCTGG ACTTGCACAT TTTTTTGAAC ATTTAATGTT CAGTGGTACA GAAAAATTCC CTAATCTTAT AACTACTCTA AGTGATATAG GCGGAAACTT CAATGCAAGT ACATCTGAAT TTTGCACTAT ATATTATGAA CTAATACCAA AACAACATTT ATCTCTTGCA ATGGATATTG AATCAGACAG AATGCAAAAT TTTAAAATTA CTGATAAGGC ATTAATAAGA GAGCAAAAGG TAGTATTAGA AGAAAGAAAA ATGAGAGTTG AAAGTCAGGC AAAAAACATA CTGCAAGAAG AGATGGAAAA CACATTCTAT TATAATGGAT ATGGTAGACC AGTAGTAGGG TGGGAACATG AAATCAGCAA TTACAATAGA GAAGTTGCTG AAGCATTTTA TAAACTTCAC TACAGCCCTA ACAACGCTAT ATTAGTTGTA ACTGGAGATG TCGATCCACA GGAAACAATC AACCTTGCAC AACAGTACTA TGGGAAGATA GAACCTAATC ACAAAAAATC CACACGTGTT TTTAGAGCAG AACCTTCACA CAAAGCAAAC ATTACATTAA CATTAGAAGA TAGTTCAGTA GAAATCCCAG AATTATTTTT AATGTATCAA ATACCAAGCG GTATCGCAAA TAAAAACTAT ATACTCAATA TGATGGCAGC AGAAATACTT GGTAACGGTA AATTCAGTTT GCTTTACAAT GATCTAGTAA TGAATAACTC AATAGTCACA TCAATAGGCA CCAATTATAA CTATTTAACT GATAGTGATA ACTACCTCTT TATAGAAGCC GTACCTAAAG ATGGGATCTC TACAGAAACT GTAGAAAAAG AAATCCACAA ATGTATAAAT AGTTATCTTG AAAATGGCAT TTCACCAGAA TATTTAGAAA GTGCAAAACA AAAAGTAAAA GCACACTTAA CTTATTCTCT TGATGGATTA AGCTTTATAT CATATTTCTA CGGCATGAAC TTAATTCTAG GAGTACCATT ATCAGAAATT AACAATATTT ACGATACAAT AGATAAAATA AAGATTGAGG ATATTGATTC CACTATGGAA AACATCTTCT TAAAGAACGT AAGATTAGCT GGACATTTAT TACCTAAATT GGGAGAATAG
|
Protein sequence | MVKFFTCFFT IFFTIANHAL SFNIKVTHEK LDNGMEVYVI PNHRAPAVMH MVLYKVGGTD DPVGYSGLAH FFEHLMFSGT EKFPNLITTL SDIGGNFNAS TSEFCTIYYE LIPKQHLSLA MDIESDRMQN FKITDKALIR EQKVVLEERK MRVESQAKNI LQEEMENTFY YNGYGRPVVG WEHEISNYNR EVAEAFYKLH YSPNNAILVV TGDVDPQETI NLAQQYYGKI EPNHKKSTRV FRAEPSHKAN ITLTLEDSSV EIPELFLMYQ IPSGIANKNY ILNMMAAEIL GNGKFSLLYN DLVMNNSIVT SIGTNYNYLT DSDNYLFIEA VPKDGISTET VEKEIHKCIN SYLENGISPE YLESAKQKVK AHLTYSLDGL SFISYFYGMN LILGVPLSEI NNIYDTIDKI KIEDIDSTME NIFLKNVRLA GHLLPKLGE
|
| |