Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5286 |
Symbol | pepQ |
ID | 6970191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4930396 |
End bp | 4931727 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643388950 |
Product | proline dipeptidase |
Protein accession | YP_002273364 |
Protein GI | 209398417 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0321906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.0291317 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCAC TGGCCTCGCT CTATAAAAAT CATATAGCTA CATTGCAGGA ACGAACTCGC GATGCGCTGA CGCGCTTCAA GCTGGATGCG TTACTTATTC ACTCCGGCGA ACTGTTCAAC GTTTTTCTCG ACGATCATCC CTATCCGTTT AAAGTGAACC CGCAATTCAA AGCGTGGGTG CCGGTAACTC AGGTGCCAAA CTGCTGGCTG CTGGTGGATG GCGTGAATAA GCCGAAACTG TGGTTCTATC TGCCGGTTGA TTACTGGCAC AACGTCGAAC CGCTGCCGAC CTCCTTCTGG ACTGAAGATG TAGAAGTGAT CGCGCTGCCG AAAGCCGATG GCATTGGTAG TCTGTTGCCT GCTGCGCGCG GCAATATCGG TTATATCGGT CCGGTGCCGG AACGTGCGCT GCAACTGGGT ATTGAGGCCA GCAATATCAA CCCGAAAGGG GTTATCGACT ACCTGCATTA CTACCGCTCC TTCAAAACCG AGTACGAGCT GGCCTGTATG CGTGAAGCGC AGAAAATGGC GGTCAACGGT CATCGCGCGG CAGAAGAAGC GTTCCGTTCT GGCATGAGCG AGTTCGATAT CAATATTGCC TATCTGACTG CGACCGGTCA TCGTGATACC GACGTACCTT ACAGCAACAT TGTGGCACTC AACGAACACG CTGCGGTGCT GCATTACACC AAACTGGATC ATCAGGCGTC GGAAGAGATG CGCAGCTTCC TGCTGGATGC CGGGGCCGAA TATAACGGCT ATGCCGCTGA CCTGACCCGT ACCTGGTCGG CAAAAAGTGA CAACGATTAC GCACAGCTGG TGAAAGACGT AAATGATGAA CAACTGGCGC TGATCGCGAC CATGAAAGCT GGCGTTAGCT ATGTGGATTA CCACATCCAG TTCCATCAGC GCATCGCCAA ATTGCTGCGT AAACATCAAA TCATCACCGA TATGAGTGAA GAGGCGATGG TCGAAAACGA TCTTACCGGG CCGTTTATGC CGCATGGTAT CGGCCATCCG CTGGGCCTGC AGGTGCATGA CGTCGCCGGT TTTATGCAGG ATGATAGCGG TACGCACCTC GCGGCACCGG CAAAATATCC GTACCTGCGC TGCACCCGTA TTCTCCAGCC GGGCATGGTG TTAACCATCG AACCGGGTAT CTACTTCATT GAATCGCTGC TGGCACCGTG GCGTGAAGGG CAGTTCAGCA AGCACTTCAA CTGGCAGAAA ATTGAAGCAC TGAAACCGTT CGGCGGCATT CGTATCGAAG ACAACGTGGT GATCCACGAA AATAACGTGG AAAACATGAC CCGGGATCTG AAACTGGCGT GA
|
Protein sequence | MESLASLYKN HIATLQERTR DALTRFKLDA LLIHSGELFN VFLDDHPYPF KVNPQFKAWV PVTQVPNCWL LVDGVNKPKL WFYLPVDYWH NVEPLPTSFW TEDVEVIALP KADGIGSLLP AARGNIGYIG PVPERALQLG IEASNINPKG VIDYLHYYRS FKTEYELACM REAQKMAVNG HRAAEEAFRS GMSEFDINIA YLTATGHRDT DVPYSNIVAL NEHAAVLHYT KLDHQASEEM RSFLLDAGAE YNGYAADLTR TWSAKSDNDY AQLVKDVNDE QLALIATMKA GVSYVDYHIQ FHQRIAKLLR KHQIITDMSE EAMVENDLTG PFMPHGIGHP LGLQVHDVAG FMQDDSGTHL AAPAKYPYLR CTRILQPGMV LTIEPGIYFI ESLLAPWREG QFSKHFNWQK IEALKPFGGI RIEDNVVIHE NNVENMTRDL KLA
|
| |