Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4163 |
Symbol | |
ID | 6067067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4599364 |
End bp | 4600695 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641603591 |
Product | proline dipeptidase |
Protein accession | YP_001727087 |
Protein GI | 170022133 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.215567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000134113 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAATCAC TGGCCTCGCT CTATAAAAAT CATATAGCTA CCTTACAAGA ACGGACTCGC GATGCGCTGG CGCGCTTCAA GCTGGATGCG TTACTTATTC ACTCCGGCGA GCTGTTCAAT GTTTTTCTCG ACGATCATCC CTATCCGTTT AAAGTGAACC CGCAATTCAA AGCGTGGGTG CCGGTAACTC AGGTGCCAAA CTGCTGGTTG CTGGTGGATG GCGTGAACAA GCCGAAACTG TGGTTTTATC TGCCGGTTGA TTACTGGCAC AACGTCGAAC CGCTGCCGAC CTCCTTCTGG ACTGAAGATG TGGAAGTGAT CGCGCTGCCG AAAGCCGATG GCATTGGTAG CCTGCTGCCT GCTGCGCGCG GCAATATCGG TTATATCGGT CCGGTGCCGG AGCGTGCGCT GCAACTGGGT ATTGAGGCCA GCAACATCAA CCCGAAAGGG GTTATCGACT ACCTGCATTA CTATCGCTCC TTCAAAACCG AGTACGAACT GGCCTGTATG CGTGAAGCGC AGAAAATGGC GGTCAACGGT CATCGTGCGG CAGAAGAAGC GTTCCGTTCT GGCATGAGCG AGTTCGATAT CAACATTGCC TATCTGACCG CGACCGGTCA TCGTGATACC GACGTACCTT ACAGCAACAT TGTGGCGCTT AACGAACACG CTGCGGTGCT GCATTACACC AAACTGGATC ATCAGGCGCC GGAAGAGATG CGCAGCTTCC TGCTGGATGC CGGGGCCGAA TATAACGGCT ATGCGGCTGA CCTGACTCGT ACCTGGTCGG CAAAAAGCGA CAACGACTAC GCACAGCTGG TGAAAGACGT AAATGATGAA CAACTGGCGC TGATCGCCAC CATGAAAGCA GGCGTTAGCT ATGTGGATTA CCACATCCAG TTCCATCAGC GCATCGCCAA ACTGCTGCGT AAACATCAAA TCATCACCGA TATGAGTGAA GAAGCGATGG TCGAAAACGA TCTCACCGGA CCGTTTATGC CGCACGGTAT CGGTCATCCG CTGGGCCTGC AGGTGCATGA CGTAGCCGGT TTTATGCAGG ATGATAGCGG TACACACCTC GCGGCACCGG CAAAATATCC GTACCTGCGC TGCACCCGTA TTCTCCAGCC GGGCATGGTG TTAACCATCG AACCGGGTAT CTACTTCATT GAATCGCTAC TGGCACCGTG GCGTGAAGGG CAGTTCAGCA AGCACTTCAA CTGGCAGAAA ATTGAAGCAC TGAAACCGTT CGGCGGCATT CGTATCGAAG ACAACGTGGT GATCCACGAA AACAACGTGG AAAACATGAC CCGGGATCTG AAACTGGCGT GA
|
Protein sequence | MESLASLYKN HIATLQERTR DALARFKLDA LLIHSGELFN VFLDDHPYPF KVNPQFKAWV PVTQVPNCWL LVDGVNKPKL WFYLPVDYWH NVEPLPTSFW TEDVEVIALP KADGIGSLLP AARGNIGYIG PVPERALQLG IEASNINPKG VIDYLHYYRS FKTEYELACM REAQKMAVNG HRAAEEAFRS GMSEFDINIA YLTATGHRDT DVPYSNIVAL NEHAAVLHYT KLDHQAPEEM RSFLLDAGAE YNGYAADLTR TWSAKSDNDY AQLVKDVNDE QLALIATMKA GVSYVDYHIQ FHQRIAKLLR KHQIITDMSE EAMVENDLTG PFMPHGIGHP LGLQVHDVAG FMQDDSGTHL AAPAKYPYLR CTRILQPGMV LTIEPGIYFI ESLLAPWREG QFSKHFNWQK IEALKPFGGI RIEDNVVIHE NNVENMTRDL KLA
|
| |