Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ecaj_0845 |
Symbol | |
ID | 3617766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia canis str. Jake |
Kingdom | Bacteria |
Replicon accession | NC_007354 |
Strand | + |
Start bp | 1192902 |
End bp | 1194317 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 637698793 |
Product | peptidase S1 |
Protein accession | YP_303474 |
Protein GI | 73667458 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGAT TTTTATTATT ACTGACACTA ATATTAGCAA ACATTCCTAT AAGTAATTTA ATAGCTCAGG GCACAGAACA ATATGACCCT AGGTCAGGTT TTTCTAAACT AATCAAAGAA TCTACACCAG CAGTTGTTAA TGTTAGTATA GTACATGATG TAACAAATGA ACAGTTTCCA CTAATAACTA TTGAAGAATT GTTACGTAGT ATATTAGAAG GTAAACAAAT AAAAAAAGAT GTTCCTCAAG AAATATTAAG TGCAGGCTCA GGGTTTGTAG TAGATGAATC CGGCATAATT GTTACTAATT ATCATGTTGT ACATAATGCA AAAGAGGTAT ATATTACATT CAGTAATAAT AAATCAATTC CCGCTAAAAT TTTAGGAGTA GATCCACAAA CAGACCTAGC AGTATTAAAA GTTGAAGTTA ACGAAAAACT TCCTTATTTA GATTTTGGAG ATTCTGACAC AGCAATGGTG GGAGATTGGG TAGTCGCAAT AGGTAATCCA TTTGGTCTTG GTGGTTCTGC AAGTATTGGT ATCATATCTG CACGAGCAAG AGATCTTAAT ATTGGAACAG CAACAGAGTT CTTACAAACT GATGCAGCTA TTAATAAAGG TAATTCTGGA GGACCACTAT TCAACGTAGA CGGCAAAGTT ATTGGTATTA ACACAGCAAT ATTATCAACA CAAAAAGGAG GAGGCAATAT AGGAGTTGGA TTTGCCATAC CATCAAACAG CGCTGTTCCT ATTATCAAAG TATTATCTCA AGGTAAAAAA GTAGAGCACG GTTGGCTCGG CGTAGTTATG CAACCAATTA CTGAAGAGTT AGTAGAACCA TTTAAATTAA AAGAAGTAAG TGGAGCTTTA ATTACAAACA TCGTTAAAGG CAGCCCAGCA GATAAAGCAA AATTACTTCC AGGTGATATC ATTCTTGAAT TTAATGGTAC TAAAATTAAT TCAATATCAC AATTACATCA ATTAGTGTTA AGATCAGAAG CAAATAATGA AGTGACATTA GTTGTATCAC GCAATGGTAG TATTATAAAT ATATCAGTCA AAATAGGAAA ATTTGAAAAT CCTGATCCTT CAGAAAATGA ACTTCCTAAA GATTCAGTTC AATCACATGA ATTGGGGCTA ACAGTAGGTA ACATAAAACA TAACCAAATC ATGTCTAATG ATACTACAGA AGAAGAAGTT AAAGGAGTAA TGATATTAAA TGTAGACTAT ACAAGCAATG CTTCAACTAA AAATATAAGA AAAGGTGATA TTATATTACA AATCAATCAA TCACCAATTA ACAATCTCGA AGACTTTAAA AATGTTATGA AAAAAGTACG TAAAAATAAA TCTGCAGCAT TACTAATAAG CAGAGACAAT ATATCAATGT TCGTTACAGT CAAGCTAAAA CAGTAA
|
Protein sequence | MRRFLLLLTL ILANIPISNL IAQGTEQYDP RSGFSKLIKE STPAVVNVSI VHDVTNEQFP LITIEELLRS ILEGKQIKKD VPQEILSAGS GFVVDESGII VTNYHVVHNA KEVYITFSNN KSIPAKILGV DPQTDLAVLK VEVNEKLPYL DFGDSDTAMV GDWVVAIGNP FGLGGSASIG IISARARDLN IGTATEFLQT DAAINKGNSG GPLFNVDGKV IGINTAILST QKGGGNIGVG FAIPSNSAVP IIKVLSQGKK VEHGWLGVVM QPITEELVEP FKLKEVSGAL ITNIVKGSPA DKAKLLPGDI ILEFNGTKIN SISQLHQLVL RSEANNEVTL VVSRNGSIIN ISVKIGKFEN PDPSENELPK DSVQSHELGL TVGNIKHNQI MSNDTTEEEV KGVMILNVDY TSNASTKNIR KGDIILQINQ SPINNLEDFK NVMKKVRKNK SAALLISRDN ISMFVTVKLK Q
|
| |