Gene SNSL254_A3294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3294 
SymbolpepP 
ID6484327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3203170 
End bp3204486 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content56% 
IMG OID642738590 
Productproline aminopeptidase P II 
Protein accessionYP_002042311 
Protein GI194443636 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0229287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGC AGGAATACCA ACGCCGTCGC CAGGCATTAC TGGCGCAAAT GCAGCCCGGC 
AGCGCCGCGC TGATCTTTGC CGCGCCGGAG GCGACGCGCA GCGCAGACAG TGAATATCCG
TATCGCCAGA GTAGCGACTT CTGGTATTTC ACCGGTTTTA ACGAACCGGA AGCCGTGCTG
GTACTGATTA AGAGTGATGA CACCCACAAC CACAGCGTTT TGTTCAACCG CGTTCGCGAC
CTGACGGCGG AAATCTGGTT TGGTCGCCGT TTAGGACAGG ATGCCGCGCC GGAAAAACTG
GGCGTTGACC GGGCGCTGGC GTTTAGCGAA ATCAACCAGC AACTCTTTCA GTTGCTTAAT
GGTCTGGATG TGGTGTACCA CGCGCAGGGC GAATATGCGT ATGCCGACGA GATTGTTCTG
GCTGCGCTGG AGAAGCTGCG TAAAGGCTCC CGCCAGAATC TGACCGCGCC GGCCACTATG
ACTGACTGGC GACCGATCGT CCATGAGATG CGCCTGTTCA AATCGCCGGA AGAGATTGCT
GTCCTGCGCC GCGCCGGGGA AATTAGCGCG CTGGCGCATA TCCGCGCGAT GGAAAAATGC
CGTCCGGGGA TGTTTGAGTA TCAACTGGAG GGGGAAATTC ACCACGAATT TAATCGCCAC
GGCGCGCGCT ATCCCTCCTA TAACACCATT GTCGGCAGCG GCGAAAATGG CTGTATCCTG
CATTACACTG AAAACGAAAG TGAAATGCGC GACGGCGATT TAGTGCTTAT CGACGCGGGT
TGTGAATATA AAGGTTACGC GGGCGACATC ACGCGTACTT TCCCGGTGAA CGGGAAATTT
ACGCCAGCCC AGCGTGAAAT TTATGACATC GTTCTGGAAT CGCTGGAGAC CAGCCTGCGA
CTGTTCCGTC CTGGTACCTC TATTCAGGAG GTGACCGGCG AAGTCGTGCG CATCATGATA
ACCGGGCTGG TGAAGCTGGG GATTTTGCAA GGGGAGGTTG ATCAACTGAT TGCCGAAAAT
GCGCATCGTC CTTTCTTTAT GCATGGCTTG AGCCACTGGC TGGGGCTGGA TGTTCATGAT
GTCGGCGTTT ATGGGCCGGA TCGCTCCCGC ATCCTGGAGC CGGGCATGGT GCTGACCGTA
GAGCCAGGCC TCTATATCGC GCCGGATGCC GACGTGCCGG AAGCGTATCG CGGCATTGGC
GTTCGAATTG AAGATGACAT TGTCATTACC GAAACCGGTA ATGAAAACCT GACCGCTGGC
GTTGTGAAGA AGGCGGATGA CATTGAGGCA TTAATGGCGG CGGCGCGGCA GCAATGA
 
Protein sequence
MTQQEYQRRR QALLAQMQPG SAALIFAAPE ATRSADSEYP YRQSSDFWYF TGFNEPEAVL 
VLIKSDDTHN HSVLFNRVRD LTAEIWFGRR LGQDAAPEKL GVDRALAFSE INQQLFQLLN
GLDVVYHAQG EYAYADEIVL AALEKLRKGS RQNLTAPATM TDWRPIVHEM RLFKSPEEIA
VLRRAGEISA LAHIRAMEKC RPGMFEYQLE GEIHHEFNRH GARYPSYNTI VGSGENGCIL
HYTENESEMR DGDLVLIDAG CEYKGYAGDI TRTFPVNGKF TPAQREIYDI VLESLETSLR
LFRPGTSIQE VTGEVVRIMI TGLVKLGILQ GEVDQLIAEN AHRPFFMHGL SHWLGLDVHD
VGVYGPDRSR ILEPGMVLTV EPGLYIAPDA DVPEAYRGIG VRIEDDIVIT ETGNENLTAG
VVKKADDIEA LMAAARQQ