Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1052 |
Symbol | |
ID | 3927606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 1080495 |
End bp | 1081910 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637902166 |
Product | serine protease |
Protein accession | YP_507837 |
Protein GI | 88658017 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.72618 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAT TTCTATTACT AATATTAGTA CTAATATTGG CAAATGTTCC TATAGGTAGT TTTGCTGATC AGAAAGCAGA ACAATATGAT CCAAGGTCAG GCTTTTCTAA ATTAATCAAA GAATCTACAC CAGCAGTAGT TAATATCAGT ATAGTACATG ATTTAATACA GGAACAATTT CCTTTAATAA CTCTCGAAGA ACTTTTAAGG AATATATTAG AAGGCAAGCC AGTAAAAAAA GACATACCAC AAGAAGTATT AAGCGCAGGG TCGGGATTTG TTGTAGATGA ATCAGGTATT ATCGTCACCA ACTATCATGT TGTACACAAT GCAAAAGAAG TTTATGTCAC ATTTAGTGAC AACAAGTCAA TTCCTGCTAA GATTTTAGGA GTTGACCCAC AAACAGACCT AGCAGTGTTA AAAGTTGAAG TTAATGAAAA ACTTCCTTAT CTAGAATTTG GAGATTCTGA CAAGACTATG GTTGGAGACT GGGTAGTTGC CATAGGTAAC CCATTTGGTC TTGGTGGTTC TGCAAGTATT GGTATTATAT CTGCACGCGC AAGAGATCTT AACATTGGCA CAGCAACAGA ATTTTTACAA ACTGATGCTG CAATTAATAA GGGTAACTCT GGTGGTCCTC TATTTAATAT AGATGGTAAA GTAATTGGTA TTAACACAGC AATATTATCT ACACAAAAAG GTGGTGGTAA CATAGGAGTT GGATTTGCTA TCCCATCAAA TAATGCTGTT TCTATAATAA AAGTTTTATC CCAAGGGAAA AAAGTAGAAC ATGGTTGGCT CGGTGTAGTG ATGCAACCGA TAACTGAAGA ACTAGTAGAA CCATTACAAC TAAAAGAAGT GGGTGGAGCT TTAATTACTA ATGTAGTCAA AGGTAGTCCA GCAAGTAAAG CAAACTTGCT TCCAGGGGAT ATTATACTTG AGTTTAATGG TACTAAGATT AATTCAATAT CACAACTACA TCAATTAGTA CTAAGATCAG AAGCAGACAA TGAAGTAAAA TTACTTGTGT CACGTAATGG TAGTATTATA AGTATACTAG TCAAAATAGG GAAATTTGAG AATCCTGATA TTTCAGAAAA CGGAATGCCT AAGGATGCTA TCCAATCCCC GGAGTTAGGG TTAACAGTAG GTAGTGTACA ACGCAATAAC ATATATAATA ATATTGAGGA AGAAGAAGCA AAAGGAGTGG TGATATTAGA TATAGATAGT ACAAGTAATG CTTCAACCAG AAATATAAGA AAAGGCGATA TAATATTACA AATCAATCAA TCACCAGTAA ATAATCTCGA AGATTTTAAA AATGTTATGA AAAAAGTACG TAAAAATAAA TCTGTAGCAT TGTTAATCAG TAGGGATAAT ATATCAGCTT TTGTGACTGT CAAGCTGAAG CAGTAG
|
Protein sequence | MKRFLLLILV LILANVPIGS FADQKAEQYD PRSGFSKLIK ESTPAVVNIS IVHDLIQEQF PLITLEELLR NILEGKPVKK DIPQEVLSAG SGFVVDESGI IVTNYHVVHN AKEVYVTFSD NKSIPAKILG VDPQTDLAVL KVEVNEKLPY LEFGDSDKTM VGDWVVAIGN PFGLGGSASI GIISARARDL NIGTATEFLQ TDAAINKGNS GGPLFNIDGK VIGINTAILS TQKGGGNIGV GFAIPSNNAV SIIKVLSQGK KVEHGWLGVV MQPITEELVE PLQLKEVGGA LITNVVKGSP ASKANLLPGD IILEFNGTKI NSISQLHQLV LRSEADNEVK LLVSRNGSII SILVKIGKFE NPDISENGMP KDAIQSPELG LTVGSVQRNN IYNNIEEEEA KGVVILDIDS TSNASTRNIR KGDIILQINQ SPVNNLEDFK NVMKKVRKNK SVALLISRDN ISAFVTVKLK Q
|
| |