Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_03738 |
Symbol | pepQ |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | + |
Start bp | 3937974 |
End bp | 3939305 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | proline dipeptidase |
Protein accession | ACT45531 |
Protein GI | 253979861 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.24078 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCAC TGGCCTCGCT CTATAAAAAT CATATAGCTA CCTTACAGGA ACGGACTCGC GATGCGCTGG CGCGCTTCAA GCTGGATGCG TTACTTATTC ACTCCGGCGA GCTGTTCAAT GTTTTTCTCG ACGATCATCC CTATCCGTTT AAAGTGAACC CGCAATTCAA AGCGTGGGTG CCGGTAACTC AGGTGCCAAA CTGCTGGTTG CTGGTGGATG GCGTGAACAA GCCGAAACTG TGGTTCTATC TGCCGGTTGA TTACTGGCAC AACGTCGAAC CGCTGCCGAC CTCCTTCTGG ACTGAAGATG TGGAAGTGAT CGCACTGCCG AAAGCCGATG GCATTGGTAG CCTGCTGCCC GCTGCGCGCG GCAATATCGG TTATATCGGT CCGGTGCCGG AGCGTGCGCT GCAACTGGGT ATTGAGGCCA GCAACATCAA CCCGAAAGGG GTTATCGACT ACCTGCATTA CTATCGCTCC TTCAAAACCG AGTACGAACT GGCCTGTATG CGTGAAGCGC AGAAAATGGC GGTCAACGGT CATCGTGCGG CAGAAGAAGC GTTCCGTTCT GGCATGAGCG AGTTCGATAT CAACATTGCC TATCTGACCG CGACCGGTCA TCGTGATACC GACGTACCTT ACAGCAACAT TGTGGCGCTT AACGAACACG CTGCGGTGCT GCATTACACC AAACTGGATC ATCAGGCGCC GGAAGAGATG CGCAGCTTCC TGCTGGATGC CGGGGCCGAA TATAACGGCT ATGCGGCTGA CCTGACCCGT ACCTGGTCGG CAAAAAGTGA CAACGATTAC GCACAGCTGG TGAAAGACGT AAATGATGAA CAACTGGCGC TGATCGCGAC CATGAAAGCT GGCGTTAGCT ATGTGGATTA CCACATCCAG TTCCATCAGC GCATCGCCAA ACTGCTGCGT AAACATCAAA TCATCACCGA TATGAGTGAA GAAGCGATGG TCGAAAACGA TCTCACCGGA CCGTTTATGC CGCACGGTAT CGGCCATCCG CTGGGCCTGC AGGTGCATGA CGTCGCCGGT TTTATGCAGG ATGATAGCGG TACGCACCTC GCGGCACCGG CAAAATATCC GTACCTGCGC TGCACCCGTA TTCTCCAGCC GGGCATGGTG TTAACCATCG AACCGGGTAT CTACTTCATT GAATCGCTAC TGGCACCGTG GCGTGAAGGG CAGTTCAGCA AGCACTTCAA CTGGCAGAAA ATTGAAGCAC TGAAACCGTT CGGCGGCATT CGTATCGAAG ACAACGTGGT GATCCACGAA AACAACGTGG AAAACATGAC CCGGGATCTG AAACTGGCGT GA
|
Protein sequence | MESLASLYKN HIATLQERTR DALARFKLDA LLIHSGELFN VFLDDHPYPF KVNPQFKAWV PVTQVPNCWL LVDGVNKPKL WFYLPVDYWH NVEPLPTSFW TEDVEVIALP KADGIGSLLP AARGNIGYIG PVPERALQLG IEASNINPKG VIDYLHYYRS FKTEYELACM REAQKMAVNG HRAAEEAFRS GMSEFDINIA YLTATGHRDT DVPYSNIVAL NEHAAVLHYT KLDHQAPEEM RSFLLDAGAE YNGYAADLTR TWSAKSDNDY AQLVKDVNDE QLALIATMKA GVSYVDYHIQ FHQRIAKLLR KHQIITDMSE EAMVENDLTG PFMPHGIGHP LGLQVHDVAG FMQDDSGTHL AAPAKYPYLR CTRILQPGMV LTIEPGIYFI ESLLAPWREG QFSKHFNWQK IEALKPFGGI RIEDNVVIHE NNVENMTRDL KLA
|
| |