Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4228 |
Symbol | pepQ |
ID | 6143415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4326207 |
End bp | 4327538 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619051 |
Product | proline dipeptidase |
Protein accession | YP_001746179 |
Protein GI | 170683404 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000144079 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0133545 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCAC TGGCCTCGCT CTATAAAAAT CATATAGCTA CCTTACAGGA ACGGACTCGC GATGCGCTGG CGCGCTTCAA GCTGGATGCG TTACTTATTC ACTCCGGCGA ACTGTTCAAC GTTTTTCTCG ACGATCATCC CTATCCGTTT AAAGTGAACC CGCAATTCAA AGCGTGGGTG CCGGTAACTC AGGTGCCAAA CTGCTGGCTG CTGGTGGATG GCGTGAACAA GCCGAAACTG TGGTTTTATC TGCCGGTTGA TTACTGGCAC AACGTCGAAC CGCTGCCGAA CTCCTTCTGG ACTGAAGATG TGGAAGTGAT CGCGCTGCCG AAAGCCGATG GCATTGGTAG CCTGCTGCCT GCTGCGCGCG GCAATATCGG TTATATCGGT CCGGTGCCGG AACGTGCGCT GCAACTGGGT ATTGAGGCCA GCAACATCAA TCCGAAAGGG GTGATCGACT ACCTGCATTA CTACCGCTCC TTCAAAACCG AGTACGAACT GGCCTGTATG CGTGAAGCGC AGAAAATGGC GGTCAACGGT CATCGCGCGG CAGAAGAAGC GTTCCGTTCT GGCATGAGCG AGTTTGATAT CAATATTGCC TATCTGACCG CGACCGGTCA TCGTGATACC GACGTACCTT ACAGCAACAT TGTGGCGCTT AACGAACACG CTGCGGTGCT GCATTACACC AAATTGGACC ACCAGGCACC GGAAGAGATG CGCAGCTTCC TGCTGGATGC CGGGGCCGAA TATAACGGCT ATGCGGCTGA CCTGACCCGT ACCTGGTCGG CAAAAAGCGA CAACGACTAC GCACAGCTGG TGAAAGACGT AAATGATGAA CAACTTGCGC TGATCGCCAC CATGAAAGCT GGCGTCAGCT ATGTGGATTA CCACATCCAG TTCCATCAGC GCATCGCCAA ATTGCTGCGT AAACATCAAA TCATCACCGA TATGAGTGAA GAAGCGATGG TCGAAAACGA TCTCACCGGA CCGTTTATGC CGCACGGTAT CGGTCATCCG CTGGGCCTGC AGGTGCATGA CGTAGCTGGC TTTATGCAAG ATGATAGCGG TACGCACCTC GCGGCACCGG CAAAATATCC GTACCTGCGC TGCACCCGTA TTCTCCAGCC AGGCATGGTG TTAACCATCG AACCGGGTAT CTACTTCATC GAATCGCTGC TGGCGCCGTG GCGTGAAGGG CAGTTCAGCA AACACTTCAA CTGGCAGAAA ATTGAAGCAC TGAAACCGTT CGGCGGCATT CGTATCGAAG ACAACGTGGT GATCCACGAA AACAACGTGG AAAACATGAC CCGGGATCTG AAACTGGCGT GA
|
Protein sequence | MESLASLYKN HIATLQERTR DALARFKLDA LLIHSGELFN VFLDDHPYPF KVNPQFKAWV PVTQVPNCWL LVDGVNKPKL WFYLPVDYWH NVEPLPNSFW TEDVEVIALP KADGIGSLLP AARGNIGYIG PVPERALQLG IEASNINPKG VIDYLHYYRS FKTEYELACM REAQKMAVNG HRAAEEAFRS GMSEFDINIA YLTATGHRDT DVPYSNIVAL NEHAAVLHYT KLDHQAPEEM RSFLLDAGAE YNGYAADLTR TWSAKSDNDY AQLVKDVNDE QLALIATMKA GVSYVDYHIQ FHQRIAKLLR KHQIITDMSE EAMVENDLTG PFMPHGIGHP LGLQVHDVAG FMQDDSGTHL AAPAKYPYLR CTRILQPGMV LTIEPGIYFI ESLLAPWREG QFSKHFNWQK IEALKPFGGI RIEDNVVIHE NNVENMTRDL KLA
|
| |