Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3042 |
Symbol | pepP |
ID | 6145089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3131293 |
End bp | 3132618 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617911 |
Product | proline aminopeptidase P II |
Protein accession | YP_001745062 |
Protein GI | 170680916 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000751227 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATA TATCCCGGCA AGAGTTTCAG CGTCGCCGTC AGGCCCTGGT GGAGCAAATG CAACCCGGCA GCGCCGCGCT GATTTTTGCT GCACCAGAAG TAACACGTAG CGCCGACAGC GAATACCCCT ATCGTCAGAA CAGTGACTTC TGGTACTTCA CCGGCTTTAA TGAACCGGAA GCGGTGCTGG TGCTGATTAA AAGCGATGAC ACTCATAACC ACAGCGTTTT GTTTAACCGC GTTCGCGACC TGACGGCAGA GATCTGGTTT GGTCGTCGCT TAGGCCAGGA TGCCGCGCCA GAGAAACTGG GCGTTGACCG CGCACTGGCA TTCAGTGAAA TCAATCAGCA ACTTTATCAA CTACTTAACG GCCTGGATGT GGTTTACCAT GCTCAGGGCG AATATGCATA TGCTGATGAA ATCGTGAACA GTGCGCTGGA AAAACTGCGT AAAGGCTCGC GACAAAATCT CACCGCCCCG GCAACGATGA TCGACTGGCG TCCCATGGTA CATGAAATGC GCCTGTTCAA ATCACCAGAA GAGATTGCCG TACTTCGCCG CGCGGGGGAA ATCACCGCTC TGGCGCATAC CCGGGCGATG GAAAAATGCC GTCCGGGAAT GTTCGAGTAC CATCTGGAAG GCGAAATTCA CCACGAATTT AACCGCCACG GTGCGCGCTA TCCGTCCTAC AACACCATTG TCGGCAGCGG TGAAAACGGC TGTATTCTGC ACTACACCGA AAACGAGTGT GAACTGCGTG ACGGTGACCT GGTGTTGATT GACGCGGGCT GTGAATACAA AGGTTACGCG GGCGATATTA CCCGTACCTT CCCGGTCAAC GGCAAATTCA CCCAGGCCCA GCGTGAAATC TACGACATTG TGCTGGAGTC TCTTGAAACC AGCCTGCGCC TGTATCGTCC GGGAACCTCC ATTCTGGAAG TCACCAGTGA AGTGGTGCGT ATCATGGTTA GCGGCCTGGT AAAACTCGGC ATCCTGAAAG GTGATGTTGA TGAACTGATC GCTCAGAACG CCCATCGTCC TTTCTTTATG CATGGCCTTA GCCACTGGTT AGGACTGGAT GTCCATGACG TTGGCGTTTA TGGTCAGGAT CGCTCGCGCA TTCTGGAACC GGGGATGGTA CTGACCGTAG AGCCTGGTCT GTATATCGCG CCGGATGCGG ATGTGCCAGA ACAATATCGC GGTATCGGCA TTCGTATTGA AGACGACATT GTGATTACCG AAACCGGCAA CGAAAACCTC ACCGCCAGCG TGGTGAAAAA GCCGGAAGAA ATCGAAGCGT TGATGGCTGC TGCGAGAAAG CAATGA
|
Protein sequence | MSDISRQEFQ RRRQALVEQM QPGSAALIFA APEVTRSADS EYPYRQNSDF WYFTGFNEPE AVLVLIKSDD THNHSVLFNR VRDLTAEIWF GRRLGQDAAP EKLGVDRALA FSEINQQLYQ LLNGLDVVYH AQGEYAYADE IVNSALEKLR KGSRQNLTAP ATMIDWRPMV HEMRLFKSPE EIAVLRRAGE ITALAHTRAM EKCRPGMFEY HLEGEIHHEF NRHGARYPSY NTIVGSGENG CILHYTENEC ELRDGDLVLI DAGCEYKGYA GDITRTFPVN GKFTQAQREI YDIVLESLET SLRLYRPGTS ILEVTSEVVR IMVSGLVKLG ILKGDVDELI AQNAHRPFFM HGLSHWLGLD VHDVGVYGQD RSRILEPGMV LTVEPGLYIA PDADVPEQYR GIGIRIEDDI VITETGNENL TASVVKKPEE IEALMAAARK Q
|
| |