Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3067 |
Symbol | pepP |
ID | 5592845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3081026 |
End bp | 3082351 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640922184 |
Product | proline aminopeptidase P II |
Protein accession | YP_001459686 |
Protein GI | 157162368 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.000000106171 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGA TATCCCGGCA AGAGTTTCAG CGTCGCCGTC AGGCCCTGGT GGAGCAAATG CAACCCGGCA GCGCCGCGCT GATTTTTGCT GCACCAGAAG TAACACGTAG CGCCGACAGC GAATACCCCT ATCGTCAGAA CAGTGACTTC TGGTACTTCA CCGGCTTTAA CGAACCGGAA GCGGTGCTGG TGCTGATTAA AAGCGATGAC ACTCATAACC ACAGCGTTCT GTTTAACCGC GTTCGCGACC TGACGGCGGA GATCTGGTTT GGCCGTCGCT TAGGCCAGGA TGCCGCGCCA GAGAAACTGG GCGTTGACCG CGCACTGGCA TTCAGCGAAA TCAATCAGCA ACTTTATCAA CTACTTAACG GCCTGGATGT GGTTTACCAT GCCCAGGGCG AATATGCATA TGCTGATGAA ATCGTGAACA GTGCGCTGGA AAAACTGCGT AAAGGTTCGC GGCAAAATCT CACCGCACCG GCAACGATGA TCGACTGGCG TCCTGTTGTT CATGAAATGC GCCTGTTCAA ATCGCCAGAA GAGATTGCCG TACTCCGCCG CGCGGGAGAA ATCACCGCCA TGGCACATAC CCGGGCGATG GAAAAATGCC GTCCGGGAAT GTTCGAGTAC CATCTGGAAG GCGAAATTCA CCACGAATTT AACCGCCACG GTGCGCGCTA TCCGTCCTAT AACACCATTG TCGGCAGCGG TGAAAACGGC TGCATTCTGC ACTACACCGA AAACGAGTGT GAAATGCGCG ACGGCGACCT GGTGTTGATT GACGCGGGTT GCGAATACAA AGGTTACGCT GGCGATATTA CCCGCACCTT CCCGGTCAAC GGCAAATTCA CCCAGGCCCA GCGTGAAATC TACGACATTG TGCTGGAGTC TCTCGAAACC AGCCTGCGCC TGTATCGTCC GGGAACTTCC ATTCTGGAAG TCACTGGTGA AGTGGTGCGC ATCATGGTTA GCGGCCTGGT AAAACTCGGC ATCCTGAAAG GTGATGTTGA TGAACTGATC GCTCAGAACG CCCATCGTCC TTTCTTTATG CATGGCCTTA GCCACTGGTT AGGACTGGAT GTCCATGACG TTGGCGTTTA TGGTCAGGAT CGCTCGCGCA TTCTGGAACC GGGGATGGTA CTGACCGTAG AGCCTGGTCT GTATATCGCG CCGGATGCGG ATGTGCCAGA ACAATATCGC GGTATCGGCA TTCGTATTGA AGACGACATT GTGATTACCG AAACCGGTAA CGAAAACCTC ACCGCCAGCG TGGTGAAAAA GCCGGAAGAA ATCGAAGCGT TGATGGCTGC TGCGAGAAAG CAATGA
|
Protein sequence | MSEISRQEFQ RRRQALVEQM QPGSAALIFA APEVTRSADS EYPYRQNSDF WYFTGFNEPE AVLVLIKSDD THNHSVLFNR VRDLTAEIWF GRRLGQDAAP EKLGVDRALA FSEINQQLYQ LLNGLDVVYH AQGEYAYADE IVNSALEKLR KGSRQNLTAP ATMIDWRPVV HEMRLFKSPE EIAVLRRAGE ITAMAHTRAM EKCRPGMFEY HLEGEIHHEF NRHGARYPSY NTIVGSGENG CILHYTENEC EMRDGDLVLI DAGCEYKGYA GDITRTFPVN GKFTQAQREI YDIVLESLET SLRLYRPGTS ILEVTGEVVR IMVSGLVKLG ILKGDVDELI AQNAHRPFFM HGLSHWLGLD VHDVGVYGQD RSRILEPGMV LTVEPGLYIA PDADVPEQYR GIGIRIEDDI VITETGNENL TASVVKKPEE IEALMAAARK Q
|
| |