Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4201 |
Symbol | pepP |
ID | 6970341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3895041 |
End bp | 3896366 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387945 |
Product | proline aminopeptidase P II |
Protein accession | YP_002272384 |
Protein GI | 209398591 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000052463 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 84 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGA TATCCCGGCA AGAGTTTCAG CGTCGCCGTC AGGCCCTGGT GGAGCAAATG CAACCCGGCA GCGCCGCGCT GATTTTTGCT GCACCAGAAG TAACACGTAG CGCCGACAGC GAATACCCCT ATCGTCAGAA CAGTGACTTC TGGTACTTCA CCGGCTTTAA CGAACCGGAA GCGGTGTTGG TGCTGATTAA AAGCGATGAC ACTCATAACC ACAGCGTTCT GTTTAACCGC GTTCGCGACC TGACGGCGGA GATCTGGTTT GGCCGTCGCT TAGGCCAGGA TGCCGCGCCA GAGAAACTGG GCGTTGACCG CGCACTGGCA TTCAGCGAAA TTAATCAGCA ACTTTATCAA CTACTTAATG GTCTGGATGT GGTTTACCAT GCTCAGGGCG AATATGCATA TGCTGATGAA ATCGTGAACA GTGCGCTGGA AAAACTGCGT AAAGGTTCGC GGCAAAATCT CACCGCACCG GCAACGATGA TCGACTGGCG TCCTGTTGTT CATGAAATGC GCCTGTTCAA ATCGCCAGAA GAGATTGCCG TACTCCGCCG CGCGGGAGAA ATCACCGCCA TGGCACATAC ACGGGCGATG GAAAAATGCC GTCCGGGAAT GTTCGAGTAC CATCTGGAAG GCGAAATTCA CCACGAATTT AACCGCCACG GTGCGCGCTA TCCGTCCTAT AACACCATTG TAGGCAGCGG TGAAAACGGC TGCATTTTAC ACTACACCGA AAACGAGTGT GAAATGCGCG ACGGCGACCT GGTGTTGATT GACGCAGGCT GCGAATACAA AGGTTACGCA GGCGATATCA CCCGTACCTT CCCGGTCAAC GGCAAATTCA CCCAGGCCCA GCGCGAAATC TACGACATTG TGCTGGAGTC TCTTGAAACC AGCCTGCGCC TGTATCGTCC GGGAACCTCC ATTCTGGAAG TCACCGGTGA AGTGGTGCGC ATTATGGTTA GCGGCCTGGT AAAACTCGGC ATCCTGAAAG GTGATATTGA TGAACTGATC GCTCAGAACG CCCATCGTCC TTTCTTTATG CATGGCCTTA GCCACTGGTT AGGACTGGAT GTCCATGACG TTGGCGTTTA TGGTCAGGAT CGCTCGCGCA TTCTGGAACC GGGGATGGTA CTGACCGTAG AGCCTGGTCT GTATATCGCG CCGGATGCGG ATGTGCCAGA ACAATATCGC GGTATCGGCA TTCGTATTGA AGACGACATT GTGATTACCG AAACCGGCAA CGAAAACCTC ACCGCCAGCG TGGTGAAAAA GCCGGAAGAA ATCGAAGCGT TGATGGCTGC TGCGAGAAAG CAATGA
|
Protein sequence | MSEISRQEFQ RRRQALVEQM QPGSAALIFA APEVTRSADS EYPYRQNSDF WYFTGFNEPE AVLVLIKSDD THNHSVLFNR VRDLTAEIWF GRRLGQDAAP EKLGVDRALA FSEINQQLYQ LLNGLDVVYH AQGEYAYADE IVNSALEKLR KGSRQNLTAP ATMIDWRPVV HEMRLFKSPE EIAVLRRAGE ITAMAHTRAM EKCRPGMFEY HLEGEIHHEF NRHGARYPSY NTIVGSGENG CILHYTENEC EMRDGDLVLI DAGCEYKGYA GDITRTFPVN GKFTQAQREI YDIVLESLET SLRLYRPGTS ILEVTGEVVR IMVSGLVKLG ILKGDIDELI AQNAHRPFFM HGLSHWLGLD VHDVGVYGQD RSRILEPGMV LTVEPGLYIA PDADVPEQYR GIGIRIEDDI VITETGNENL TASVVKKPEE IEALMAAARK Q
|
| |