Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4317 |
Symbol | pepQ |
ID | 6273273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4035752 |
End bp | 4037083 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641728127 |
Product | proline dipeptidase |
Protein accession | YP_001882547 |
Protein GI | 187732539 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00026447 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCAC TGGCCTCGCT CTATAAAAAT CATATAGCTA CCTTACAAGA ACGGACTCGC GATGCGCTGG CGCGCTTCAA GCTGGATGCA TTACTTATTC ACTCCGGCGA GCTGTTCAAC GTTTTTCTCG ACGATCATCC CTATCCGTTT AAAGTGAACC CGCAATTCAA AGCGTGGGTG CCGGTAACTC AGGTGCCAAA CTGCTGGTTG CTGGTGGATG GCGTGAACAA GCCGAAACTG TGGTTCTATC TGCCGGTTGA TTACTGGCAC AACGTCGAAC CGCTGCCGAC CTCCTTCTGG ACTGAAGATG TGGAAGTGAT CGCACTGCCG AAAGCCGATG GCATTGGTAG CCTGCTGCCC GCTGCACGCG GCAATATCGG TTATATCGGT CCGGTGCCGG AACGTGCGCT GCAACTGGGT ATTGAGGCCA GCAACATCAA CCCGAAAGGG GTGATCGACT ACCTGCATTA CTACCGCTCC TTCAAAACCG AGTACGAACT GGCCTGTATG CGTGAAGCGC AGAAAATGGC GGTCAACGGT CATCGTGCGG CAGAAGAAGC GTTCCGTTCT GGCATGAGCG AGTTCGATAT CAACATCGCC TATCTGACCG CGACCGGTCA TCGTGATACC GACGTACCTT ACAGCAACAT TGTGGCGCTT AACGAACACG CTTCGGTGCT GCATTACACC AAACTGGATC ATCAGGCACC GGAAGAGATG CGCAGCTTCC TGCTGGATGC CGGGGCCGAA TATAACGGCT ATGCGGCTGA CCTGACTCGT ACCTGGTCGG CAAAAAGCGA CAACGACTAC GCACAGCTGG TGAAAGACGT AAATGATGAA CAACTTGCGC TGATCGCCAC CATGAAAGCT GGCGTCAGCT ATGTGGATTA CCACCTCCAG TTCCATCAGC GCATTGCCAA ATTGCTGCGT AAACATCAAA TCATCACCGA TATGAGTGAA GAAGCGATGG TCGAAAACGA TCTCACCGGA CCGTTTATGC CGCACGGTAT CGGCCATCCG CTGGGCCTGC AGGTGCATGA CGTAGCCGGT TTTATGCAGG ATGATAGCGG TACACACCTC GCGGCACCGG CAAAATATCC GTACCTGCGC TGCACCCGTA TTCTCCAGCC GGGCATGGTG TTAACCATCG AACCGGGTAT CTACTTCATC GAATCGCTAC TGGCACCGTG GCGTGAAGGG CAGTTCAGCA AGCACTTCAA CTGGCAGAAA ATTGAAGCAC TGAAACCGTT CAGCGGCATT CGTATCGAAG ACAACGTGGT GATCCACGAA AATAACGTGG AAAACATGAC CCGGGATCTG AAACTGGCGT GA
|
Protein sequence | MESLASLYKN HIATLQERTR DALARFKLDA LLIHSGELFN VFLDDHPYPF KVNPQFKAWV PVTQVPNCWL LVDGVNKPKL WFYLPVDYWH NVEPLPTSFW TEDVEVIALP KADGIGSLLP AARGNIGYIG PVPERALQLG IEASNINPKG VIDYLHYYRS FKTEYELACM REAQKMAVNG HRAAEEAFRS GMSEFDINIA YLTATGHRDT DVPYSNIVAL NEHASVLHYT KLDHQAPEEM RSFLLDAGAE YNGYAADLTR TWSAKSDNDY AQLVKDVNDE QLALIATMKA GVSYVDYHLQ FHQRIAKLLR KHQIITDMSE EAMVENDLTG PFMPHGIGHP LGLQVHDVAG FMQDDSGTHL AAPAKYPYLR CTRILQPGMV LTIEPGIYFI ESLLAPWREG QFSKHFNWQK IEALKPFSGI RIEDNVVIHE NNVENMTRDL KLA
|
| |