Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0369 |
Symbol | pepA |
ID | 3928040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 361011 |
End bp | 362513 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637901493 |
Product | leucyl aminopeptidase |
Protein accession | YP_507189 |
Protein GI | 88658513 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGATA TATCGTTTTC AAGTTTAATG CCAGGGGTAT CTTTGTTTTT AAAGACAACA GCAATAGTTG TGGGTATCTT TGAAGGTAGT AATCACTTGG AAGACTATAG TGCCTTAGCT GAGCGTAGTG AGCAAATTAT GAAGGTGGTA GAAGGTTACA AGTCTTTTGA TGGTAAGTTT GCAGAAGTAT TGCCAATTAC TGGATTAGAT GCAGGGTATC CTATAGTAAT AGTAGTAGGG TTGGGTAAAC CTGAAGAGTT TGATGAAAAC AAATCTTTAA GGATTGGTGG TGTTATATAT TCTGAACTTA ATAGGATGAA AATATCAGAG GCCTCAATTA TAAGTAGTAA TGATAGTGAT ATTATGGCTA ATGTTGCCTA TGGGGCATTT TTACGCAGTT TTAAGTTTGA TAAATATTTT GTTCAGAAAA AAGATGAAAA TGCAACTTAT GTGCGTAAGT TAGAGTTCTT TTCAAAAACT AATCCTCAAA AAACAGCTGT TTTGTTTGAT AATTTAAAAG CCGAAGGTGA GTCAGTATTT TTAGCTCGTT CTTTTGTTTC AGAACCTCCT AATATTCTTT ATCCAGAAAT TTATGCTCAG ATGATATATG AGGAATTAAG TAAAGTAGAT GTTAAAGTTG AAATATTTGA TGAAGATTAT ATGAAAGCAA ATCAGATGAT GGCACTTCTA GGTGTAGGAC AAGGTAGTGC TAAGAAATCT AGGCTTGTGG TTATGAGGTG GAATGGAGGA AAAGAAACAG ACAGTCCGAT AGCATTTGTT GGAAAAGGAG TGACGTTTGA TACTGGTGGA ATATCTTTAA AACCTTCTAG GGGTATGTGG GATATGAAAT ATGATATGGC TGGTTCTGCG TCTGTTGTAG GTATTATGCG GACTCTTGCA GCAAGAAAAG CAAAAGTTAA TGCTGTAGGT GTAGTAGGGT TAGTTGAAAA TGCTGTAGGA GGAAATGCGC AAAGACCAAG TGATGTTGTA ACTTCAATGT CTGGGCAAAC TATTGAAGTA TTGAATACTG ATGCAGAAGG TAGACTAGTT TTGGCTGATG CTTTATGGTA TACTCAGAAA ATGTTTTCTC CAAAATTAAT GATAGATTTA GCAACATTAA CTGGTGCAGT GGTTGTAGCA CTAGGAAATA ATCAATATGG TGGGATTTTT TCAAACGATG ATGCAATTGC CAATCAATTA ATTGTTGCTG GTAATGAGTC CGGTGAGAAA TTGTGGAGAT TACCTTTAGA TGATGCATAT GATAAACTTA TAGATTCTTC GATTGCTGAT GTGCAAAATA TTTCAACAAA AGGGTATGGT GCAGATAGCA TTACTGCTGC ACAATTTTTA CAGAGATTTG TAAATAAGAC TCCTTGGGTG CATCTGGATA TTGCTGGAAT GGCGTGGGAT AATGAAGGTA ATGAAATATG CCCTAAAGGT GCAACTGGGT TTGGTGTAAG GTTACTGAAT AGGTTGATTT TAAAATACTA TGAGGCTAAT TAA
|
Protein sequence | MIDISFSSLM PGVSLFLKTT AIVVGIFEGS NHLEDYSALA ERSEQIMKVV EGYKSFDGKF AEVLPITGLD AGYPIVIVVG LGKPEEFDEN KSLRIGGVIY SELNRMKISE ASIISSNDSD IMANVAYGAF LRSFKFDKYF VQKKDENATY VRKLEFFSKT NPQKTAVLFD NLKAEGESVF LARSFVSEPP NILYPEIYAQ MIYEELSKVD VKVEIFDEDY MKANQMMALL GVGQGSAKKS RLVVMRWNGG KETDSPIAFV GKGVTFDTGG ISLKPSRGMW DMKYDMAGSA SVVGIMRTLA ARKAKVNAVG VVGLVENAVG GNAQRPSDVV TSMSGQTIEV LNTDAEGRLV LADALWYTQK MFSPKLMIDL ATLTGAVVVA LGNNQYGGIF SNDDAIANQL IVAGNESGEK LWRLPLDDAY DKLIDSSIAD VQNISTKGYG ADSITAAQFL QRFVNKTPWV HLDIAGMAWD NEGNEICPKG ATGFGVRLLN RLILKYYEAN
|
| |