Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1058 |
Symbol | |
ID | 3927992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 1085558 |
End bp | 1086913 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 637902172 |
Product | M16 family peptidase |
Protein accession | YP_507843 |
Protein GI | 88658196 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAATA CATTATTCTA CATAATAACA TTGATTTTTT TTGCATATAA TGCATATGCA GATGATATTA ATATCAACAT AAAGGAAGCC ACTATTAATA ATAACATACG CTACTTATAT GTTGAACATC ACGACTTACC AACAATTTCT TTAACACTTG CATTCAAAAA AGCAGGATAT GCATATGATG CTTCTGACAA ACAAGGACTT GCACATTTCA CATCACAAAT ATTACAAGAA GGATCAGAAA GTAATCATGC TCTAGAATTT GCAAAACAAT TAGAAGGTAA AGGTATAGAC TTAAAATTTC ATGTAGACAT AGATAATTTC TATATATCTA TAAAAACACT ATCAGAAAAC TTTGAAGAAG CTTTAACTTT ATTAAGTGAT TGCTTATTCA ATCCAGTAAC TGACCCAGAA ATATTTCATA GGGTAATAGC AGAACAAAGT GCACATGTAA AATCTTTATA TGGATCTCCT AAATTCATAG CAGCAACTGA AATTAATCAT GCCATATTTA AAGGACACCC ATACTCTAAT AAAATCTATG GTACATTAAA TACTATTAAC AACATTACCC AAGAAGATGT ATCATCATAC ATAAAAAACA GTTTTGATAA GGACCAAATC GTCATTAGTG CAGCAGGAGA TATAGATTCA GCAAAACTAT CGAATTTATT AGATAAATAT ATCCTATCAA AATTACCGTC TGGTAATAAC AAAAATACTA TACCAGATGC CACCGTTAAC AGAGAACAGA AACTCTTATA TGTAAGAAGA AATGTACCAC AAAGTGTCAT AATGTTTGCT ACAGACACAG TATCATACAA TGACGAGGAT TATTATGCAT CCAACTTATT CAACAATATG CTAGGAGGAT TAAGCCTTAA TTCAATATTG ATGATAGAGT TACGAGACAA ATTAGGACTA ACTTACCATG CTAGTAGCAT GCTAGATAAT ATGAATCATA GTAACGTGTT ACTCGGCATA ATAACTACTG ATAATACTAC AGTAACAAAA TGCATATCTG TATTAAAAGA AATTATAGAA AATATTAAAA ATAATGGAAT TAATCAGGAA ACTTTTTTAA CTGCAAAATC TAGTATTACT AATTCTTTCA TTTTATCAAT GTTAAACAAT GATAATGTTG CAAATACATT ATTAAACCTA CAATTACGCG GTCTAGATCC AAGTTATATA AACAAACATA ATTCCTACTA TAAAACCCTC ACAATAGAAG AAGTAACCAA AGTTGCTAGG AAAATCTTAT CTAATGATTT AGTAATAATT GAAGTGGGAA AAAACAATAA TATAAACGGT AAACAGATAG AAGCTAAAGA AAACATACTT GGCTAA
|
Protein sequence | MRNTLFYIIT LIFFAYNAYA DDININIKEA TINNNIRYLY VEHHDLPTIS LTLAFKKAGY AYDASDKQGL AHFTSQILQE GSESNHALEF AKQLEGKGID LKFHVDIDNF YISIKTLSEN FEEALTLLSD CLFNPVTDPE IFHRVIAEQS AHVKSLYGSP KFIAATEINH AIFKGHPYSN KIYGTLNTIN NITQEDVSSY IKNSFDKDQI VISAAGDIDS AKLSNLLDKY ILSKLPSGNN KNTIPDATVN REQKLLYVRR NVPQSVIMFA TDTVSYNDED YYASNLFNNM LGGLSLNSIL MIELRDKLGL TYHASSMLDN MNHSNVLLGI ITTDNTTVTK CISVLKEIIE NIKNNGINQE TFLTAKSSIT NSFILSMLNN DNVANTLLNL QLRGLDPSYI NKHNSYYKTL TIEEVTKVAR KILSNDLVII EVGKNNNING KQIEAKENIL G
|
| |