Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0347 |
Symbol | |
ID | 3926978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 336854 |
End bp | 338161 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 637901471 |
Product | M48 family peptidase |
Protein accession | YP_507167 |
Protein GI | 88658252 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTA AAAGCGTGAT ATTATTTCTA AGTATTTTGC TTTATAGCAA TGTATCATTT TGTCAAGATG GTTACATGAT ATTTAGAGAT AGTGAAGTCG AGGCTGTAAT AAAGAAAATA GCATTTCCAA TATTTATTGC AGCTAAAATT AATCCTGAGA CTGTTAGAGT GTTTATTGTT AATGATAAAA TGGTAAATGC TTATGTTGAT GGTAATAATA ATGATGTGTT TTTGAATTAT GGGTTATTTG AGTTTTCAAA TGATCCTAGT GTACTTATTG GGGTTTTAGC CCATGAAGTT GGTCATATAT CTCAAAAACA TGTGTTATTC CGTAGAAGTA AAGTACAAAA TTCTATGATT TTGTCTGGGA TAGGATATGT TCTAGGTATT ATTACTGCAA TTACAGTAAA TCCTGATATG GGACAGGCAA TAGCACTTGC TACTAATGAT ATTAGTAAAA AAATGTTTTT TCTTTATAGT CGTTTACAGG AGGCGTCTGC AGATCAATGT GCATTAAGAT ATTTAGATGA AGCTGGGTAT AGCAACGATG GATTAATTAA AATGTTTAAG CATTTTTATT CACTGGAAGC ACAATATCGA GGAAATATTG ATCAATACTT ATTATCGCAT CCTCTTAGTT ATGATAGGCT GTTGCAAATA CAAAATTATC GCAATCGTAA TGAGGTTCAT GGTTTTTCTG ATGAAGATGT ACAGAAATTT AAGCGAGTAG TAGAAAAAAT TAATGCGTTT TTTAACCCAG TAGAACGTTT GGTTAATGAT AAAAATGATA TAAATCAATT ATCTCCATAC ATACAATCTA TTATTTTTTA TAAGCAATCT GATGTTTCAA AAGCCTTAGA AAAACTTGAT AATCTAATAC TACAATCCCC TGAAGATCCT TATCTTTATG AGCTGAAAGC ACAAATTTTG TATAAGGCAG GTGACATTAA AAAGTCTGTA GAAAATTATA AATTAGCGCT TAAGTTTTCT TTCGATGATG TTTTAATAAA ACTTGAAACA TCACAAGCTT TGTTATTGTA TGATCAGAAG GAAGCAGTAA ATTATTTGGA ACAAGTGACA TACCAAGAAC CAGATAATGT TTTTGCTTGG AAGCAATTGG CTGTAGCTTA TGGTAAAATA GGGGATTTGG GAATGTCGTA TTTTTCACTG GCAAATAAAT CTTTTTTTGA AAATAATAGA AGAGATTTTG ATAAATACTT TAGCTTAGCA AGAAAGTATT TACCAAAAGA TAGCGTACAC TTAGAACGTA TGCGTGATCT AAGGATAAAT TTATTAAGTA ATACATAA
|
Protein sequence | MNIKSVILFL SILLYSNVSF CQDGYMIFRD SEVEAVIKKI AFPIFIAAKI NPETVRVFIV NDKMVNAYVD GNNNDVFLNY GLFEFSNDPS VLIGVLAHEV GHISQKHVLF RRSKVQNSMI LSGIGYVLGI ITAITVNPDM GQAIALATND ISKKMFFLYS RLQEASADQC ALRYLDEAGY SNDGLIKMFK HFYSLEAQYR GNIDQYLLSH PLSYDRLLQI QNYRNRNEVH GFSDEDVQKF KRVVEKINAF FNPVERLVND KNDINQLSPY IQSIIFYKQS DVSKALEKLD NLILQSPEDP YLYELKAQIL YKAGDIKKSV ENYKLALKFS FDDVLIKLET SQALLLYDQK EAVNYLEQVT YQEPDNVFAW KQLAVAYGKI GDLGMSYFSL ANKSFFENNR RDFDKYFSLA RKYLPKDSVH LERMRDLRIN LLSNT
|
| |