Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3329 |
Symbol | pepP |
ID | 6272836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3100251 |
End bp | 3101576 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641727226 |
Product | proline aminopeptidase P II |
Protein accession | YP_001881678 |
Protein GI | 187730091 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00000100618 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGA TATCCCGGCA AGAGTTTCAG CGTCGCCGTC AGGCCCTGGT GGAGCAAATG CAACCCGGCA GCGCCGCGCT GATTTTTGCT GCACCAGAAG TAACACGTAG CGCCGACAGC GAATACCCCT ATCGTCAGAA CAGTGACTTC TGGTACTTCA CCGGCTTTAA CGAACCGGAA GCGGTGCTGG TGCTGATTAA AAGCGATGAC ACTCATAACC ACAGCGTTCT GTTTAACCGC GTTCGCGACC TGACGGCGGA GATCTGGTTT GGCCGTCGCT TAGGCCAGGA TGCCGCGCCA GAGAAACTGG GCGTTGACCG CGCACTGGCA TTCAGCGAAA TCAATCAGCA ACTTTATCAA CTACTTAACG GCCTGGATGT GGTTTACCAT GCCCAGGGCG AATATGCATA TGCTGATGAA ATCATGAACA GTGCGCTGGA AAAACTGCGT AAAGGTTCGC GGCAAAATCT CACCGCACCG GCAACGATGA TCGACTGGCG TCCTGTTGTT CATGAAATGC GCCTGTTCAA ATCGCCAGAA GAGATTGCCG TACTCCGCCG CGCGGGAGAA ATCACCGCCA TGGCACATAC ACGGGCGATG GAAAAATGCC GTCCGGGAAT GTTCGAGTAC CATCTGGAAG GCGAAATTCA CCACGAATTT AACCGCCACG GTGCGCGCTA TCCGTCCTAT AACACCATTG TCGGCAGCGG TGAAAACGGC TGCATTCTGC ACTACACCGA AAACGAGAGT GAACTGCGCG ACGGTGACCT GGTGTTGATT GACGCGGGTT GTGAATACAA AGGTTACGCG GGCGATATTA CCCGCACCTT CCCGGTCAAC GGCAAATTCA CCCAGGCCCA GCGTGAAATC TACGACATTG TGCTGGAGTC TCTCGAAACC AGCCTGCGCC TGTATCGTCC GGGAACTTCC ATTCTGGAAG TCACTGGTGA AGTGGTGCGC ATCATGGTTA GCGGCCTGGT AAAACTCGGC ATTCTGAAAG GTGATGTTGA TGAACTGATC GCTCAGAATG CCCATCGTCC TTTCTTTATG CATGGCCTTA GCCACTGGTT AGGGCTAGAT GTCCATGACG TTGGCGTTTA TGGTCAGGAT CGCTCGCGCA TTCTGGAACC GGGCATGGTA CTGACCGTAG AGCCAGGGCT GTATATTGCG CCGGATGCAG AAGTGCCAGA ACAATATCGC GGTATCGGCA TTCGTATTGA AGACGACATT GTGATTACCG AAACCGGTAA CGAAAACCTC ACCGCCAGCG TGGTGAAAAA GCCGGAAGAA ATCGAAGCGT TGATGGCTGC TGCGAGAAAG CAATGA
|
Protein sequence | MSEISRQEFQ RRRQALVEQM QPGSAALIFA APEVTRSADS EYPYRQNSDF WYFTGFNEPE AVLVLIKSDD THNHSVLFNR VRDLTAEIWF GRRLGQDAAP EKLGVDRALA FSEINQQLYQ LLNGLDVVYH AQGEYAYADE IMNSALEKLR KGSRQNLTAP ATMIDWRPVV HEMRLFKSPE EIAVLRRAGE ITAMAHTRAM EKCRPGMFEY HLEGEIHHEF NRHGARYPSY NTIVGSGENG CILHYTENES ELRDGDLVLI DAGCEYKGYA GDITRTFPVN GKFTQAQREI YDIVLESLET SLRLYRPGTS ILEVTGEVVR IMVSGLVKLG ILKGDVDELI AQNAHRPFFM HGLSHWLGLD VHDVGVYGQD RSRILEPGMV LTVEPGLYIA PDAEVPEQYR GIGIRIEDDI VITETGNENL TASVVKKPEE IEALMAAARK Q
|
| |