Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3294 |
Symbol | pepP |
ID | 6484327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3203170 |
End bp | 3204486 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642738590 |
Product | proline aminopeptidase P II |
Protein accession | YP_002042311 |
Protein GI | 194443636 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0229287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 81 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAGC AGGAATACCA ACGCCGTCGC CAGGCATTAC TGGCGCAAAT GCAGCCCGGC AGCGCCGCGC TGATCTTTGC CGCGCCGGAG GCGACGCGCA GCGCAGACAG TGAATATCCG TATCGCCAGA GTAGCGACTT CTGGTATTTC ACCGGTTTTA ACGAACCGGA AGCCGTGCTG GTACTGATTA AGAGTGATGA CACCCACAAC CACAGCGTTT TGTTCAACCG CGTTCGCGAC CTGACGGCGG AAATCTGGTT TGGTCGCCGT TTAGGACAGG ATGCCGCGCC GGAAAAACTG GGCGTTGACC GGGCGCTGGC GTTTAGCGAA ATCAACCAGC AACTCTTTCA GTTGCTTAAT GGTCTGGATG TGGTGTACCA CGCGCAGGGC GAATATGCGT ATGCCGACGA GATTGTTCTG GCTGCGCTGG AGAAGCTGCG TAAAGGCTCC CGCCAGAATC TGACCGCGCC GGCCACTATG ACTGACTGGC GACCGATCGT CCATGAGATG CGCCTGTTCA AATCGCCGGA AGAGATTGCT GTCCTGCGCC GCGCCGGGGA AATTAGCGCG CTGGCGCATA TCCGCGCGAT GGAAAAATGC CGTCCGGGGA TGTTTGAGTA TCAACTGGAG GGGGAAATTC ACCACGAATT TAATCGCCAC GGCGCGCGCT ATCCCTCCTA TAACACCATT GTCGGCAGCG GCGAAAATGG CTGTATCCTG CATTACACTG AAAACGAAAG TGAAATGCGC GACGGCGATT TAGTGCTTAT CGACGCGGGT TGTGAATATA AAGGTTACGC GGGCGACATC ACGCGTACTT TCCCGGTGAA CGGGAAATTT ACGCCAGCCC AGCGTGAAAT TTATGACATC GTTCTGGAAT CGCTGGAGAC CAGCCTGCGA CTGTTCCGTC CTGGTACCTC TATTCAGGAG GTGACCGGCG AAGTCGTGCG CATCATGATA ACCGGGCTGG TGAAGCTGGG GATTTTGCAA GGGGAGGTTG ATCAACTGAT TGCCGAAAAT GCGCATCGTC CTTTCTTTAT GCATGGCTTG AGCCACTGGC TGGGGCTGGA TGTTCATGAT GTCGGCGTTT ATGGGCCGGA TCGCTCCCGC ATCCTGGAGC CGGGCATGGT GCTGACCGTA GAGCCAGGCC TCTATATCGC GCCGGATGCC GACGTGCCGG AAGCGTATCG CGGCATTGGC GTTCGAATTG AAGATGACAT TGTCATTACC GAAACCGGTA ATGAAAACCT GACCGCTGGC GTTGTGAAGA AGGCGGATGA CATTGAGGCA TTAATGGCGG CGGCGCGGCA GCAATGA
|
Protein sequence | MTQQEYQRRR QALLAQMQPG SAALIFAAPE ATRSADSEYP YRQSSDFWYF TGFNEPEAVL VLIKSDDTHN HSVLFNRVRD LTAEIWFGRR LGQDAAPEKL GVDRALAFSE INQQLFQLLN GLDVVYHAQG EYAYADEIVL AALEKLRKGS RQNLTAPATM TDWRPIVHEM RLFKSPEEIA VLRRAGEISA LAHIRAMEKC RPGMFEYQLE GEIHHEFNRH GARYPSYNTI VGSGENGCIL HYTENESEMR DGDLVLIDAG CEYKGYAGDI TRTFPVNGKF TPAQREIYDI VLESLETSLR LFRPGTSIQE VTGEVVRIMI TGLVKLGILQ GEVDQLIAEN AHRPFFMHGL SHWLGLDVHD VGVYGPDRSR ILEPGMVLTV EPGLYIAPDA DVPEAYRGIG VRIEDDIVIT ETGNENLTAG VVKKADDIEA LMAAARQQ
|
| |