Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B3217 |
Symbol | pepP |
ID | 6792642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 3136641 |
End bp | 3137957 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642777367 |
Product | proline aminopeptidase P II |
Protein accession | YP_002147973 |
Protein GI | 197249618 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00100894 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAGC AGGAATACCA ACGCCGTCGC CAGGCATTAC TGGCGCAAAT GCAGCCCGGC AGCGCCGCGC TGATCTTTGC CGCGCCGGAG GCGACGCGCA GCGCAGACAG TGAATATCCG TATCGCCAGA GTAGCGACTT CTGGTATTTC ACCGGTTTTA ACGAACCGGA AGCCGTGCTG GTACTGATTA AGAGTGATGA CACCCACAAC CACAGCGTTT TGTTCAACCG CGTTCGCGAC CTGACGGCGG AAATCTGGTT TGGTCGCCGT TTAGGACAGG ATGCCGCGCC GGAAAAACTG GGCGTTGACC GGGCGCTGGC GTTTAGCGAA ATCAACCAGC AACTCTTTCA GTTGCTTAAT GGTCTGGATG TGGTGTACCA CGCGCAGGGC GAATATGCGT ATGCCGACGA AATTGTTCTG GCTGCGCTGG AGAAGCTGCG TAAAGGCTCC CGCCAGAATC TGACCGCGCC GGCCACCATG ACTGACTGGC GACCGATCGT CCATGAGATG CGCCTGTTCA AATCGCCGGA AGAGATTGCT GTCCTGCGCC GTGCCGGGGA AATTAGCGCG CTGGCGCATA TCCGCGCGAT GGAAAAATGC CGTCCGGGGA TGTTTGAGTA TCAGTTGGAG GGGGAAATTC ACCACGAATT TAATCGCCAC GGCGCGCGCT ATCCCTCCTA TAACACCATT GTCGGCAGCG GCGAAAATGG CTGTATTCTG CATTACACTG AAAACGAAAG TGAAATGCGC GACGGCGATT TAGTGCTTAT CGACGCGGGC TGTGAATATA AAGGTTACGC GGGCGACATC ACGCGTACTT TCCCGGTGAA CGGGAAATTT ACGCCAGCTC AGCGTGAAAT TTATGACATC GTTCTGGAAT CGCTGGAGAC CAGCCTGCGA CTGTTCCGTC CTGGTACCTC TATTCAGGAG GTGACCGGCG AAGTCGTGCG CATCATGATA ACCGGGCTGG TGAAGCTGGG GATTTTGCAA GGAGAGGTTG ATCAACTGAT TGCCGAAAAT GCGCATCGTC CTTTCTTTAT GCATGGCTTG AGCCACTGGC TGGGGCTGGA TGTTCATGAT GTCGGCGTTT ATGGGCCGGA TCGCTCCCGT ACCCTGGAGC CGGGCATGGT GCTGACCGTA GAGCCAGGCC TCTATATCGC GCCGGATGCC GACGTGCCGG AAGCGTATCG CGGCATTGGC GTTCGAATTG AAGATGACAT TGTCATTACC GAAACCGGTA ATGAAAACCT GACCGCTGGC GTTGTGAAGA AGGCGGATGA CATTGAAGCA TTAATGGCGG CGGCGCGGCA GCAATGA
|
Protein sequence | MTQQEYQRRR QALLAQMQPG SAALIFAAPE ATRSADSEYP YRQSSDFWYF TGFNEPEAVL VLIKSDDTHN HSVLFNRVRD LTAEIWFGRR LGQDAAPEKL GVDRALAFSE INQQLFQLLN GLDVVYHAQG EYAYADEIVL AALEKLRKGS RQNLTAPATM TDWRPIVHEM RLFKSPEEIA VLRRAGEISA LAHIRAMEKC RPGMFEYQLE GEIHHEFNRH GARYPSYNTI VGSGENGCIL HYTENESEMR DGDLVLIDAG CEYKGYAGDI TRTFPVNGKF TPAQREIYDI VLESLETSLR LFRPGTSIQE VTGEVVRIMI TGLVKLGILQ GEVDQLIAEN AHRPFFMHGL SHWLGLDVHD VGVYGPDRSR TLEPGMVLTV EPGLYIAPDA DVPEAYRGIG VRIEDDIVIT ETGNENLTAG VVKKADDIEA LMAAARQQ
|
| |