Gene SeD_A3396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3396 
SymbolpepP 
ID6871009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3266726 
End bp3268042 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content56% 
IMG OID642786399 
Productproline aminopeptidase P II 
Protein accessionYP_002217037 
Protein GI198243720 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00011206 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.758857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGC AGGAATACCA ACGCCGTCGC CAGGCATTAC TGGCGCAAAT GCAGCCCGGC 
AGCGCCGCGC TGATCTTTGC CGCGCCGGAG GCGACGCGCA GCGCAGACAG TGAATATCCG
TATCGCCAGA GTAGCGACTT CTGGTATTTC ACCGGTTTTA ACGAACCGGA AGCCGTGCTG
GTACTGATTA AGAGTGATGA CACCCACAAC CACAGCGTTT TGTTCAACCG CGTTCGCGAC
CTGACTGCGG AAATCTGGTT TGGTCGCCGT TTAGGACAGG ATGCCGCACC GGAAAAACTG
GGCGTTGACC GGGCGCTGGC GTTTAGCGAA ATCAACCAGC AACTCTTTCA GTTGCTTAAT
GGTCTGGATG TGGTGTACCA CGCGCAGGGC GAATATGCGT ATGCCGACGA GATTGTTCTG
GCTGCGCTGG AGAAGCTGCG TAAAGGCTCC CGCCAGAATC TGACCGCGCC GGCCACCATG
ACTGACTGGC GACCGATCGT CCATGAGATG CGCCTGTTCA AATCGCCGGA AGAGATTGCG
GTCATGCGCC GCGCCGGGGA AATTAGCGCT CTGGCGCATA TCCGCGCGAT GGAAAAATGC
CGTCCGGGGA TGTTTGAGTA TCAGCTGGAG GGTGAAATTC ACCACGAATT TAATCGCCAT
GGCGCGCGCT ATCCCTCCTA TAACACCATT GTCGGCAGCG GCGAAAATGG CTGTATCCTG
CATTACACTG AAAACGAAAG TGAAATGCGC GACGGCGATT TAGTGCTTAT CGACGCGGGT
TGTGAATATA AAGGTTACGC GGGCGACATC ACGCGTACTT TCCCGGTGAA CGGGAAATTT
ACGCCAGCTC AGCGTGAAAT TTATGACATC GTTCTGGAAT CGCTGGAGAC CAGCCTGCGA
CTGTTCCGTC CTGGTACCTC TATTCAGCAG GTGACCGGCG AAGTCGTGCG CATCATGATA
ACCGGGCTGG TGAAGTTGGG GATTTTGCAA GGAGAGGTTG ATCAACTGAT TGCCGAGAAT
GCGCATCGTC CTTTCTTTAT GCATGGCTTG AGCCACTGGC TGGGGCTGGA TGTTCATGAT
GTCGGCGTTT ATGGGCCGGA TCGCTCCCGC ATCCTGGAGC CGGGCATGGT GCTGACCGTA
GAGCCAGGCC TCTATATCGC GCCGGATGCC GACGTGCCGG AAGCGTATCG CGGCATTGGC
GTTCGAATTG AAGATGACAT TGTCATTACC GAAACCGGTA ATGAAAACCT GACCGCTGGC
GTTGTGAAGA AGGCGGATGA CATTGAGGCA TTAATGGCGG CGGCGCGGCA GCAATGA
 
Protein sequence
MTQQEYQRRR QALLAQMQPG SAALIFAAPE ATRSADSEYP YRQSSDFWYF TGFNEPEAVL 
VLIKSDDTHN HSVLFNRVRD LTAEIWFGRR LGQDAAPEKL GVDRALAFSE INQQLFQLLN
GLDVVYHAQG EYAYADEIVL AALEKLRKGS RQNLTAPATM TDWRPIVHEM RLFKSPEEIA
VMRRAGEISA LAHIRAMEKC RPGMFEYQLE GEIHHEFNRH GARYPSYNTI VGSGENGCIL
HYTENESEMR DGDLVLIDAG CEYKGYAGDI TRTFPVNGKF TPAQREIYDI VLESLETSLR
LFRPGTSIQQ VTGEVVRIMI TGLVKLGILQ GEVDQLIAEN AHRPFFMHGL SHWLGLDVHD
VGVYGPDRSR ILEPGMVLTV EPGLYIAPDA DVPEAYRGIG VRIEDDIVIT ETGNENLTAG
VVKKADDIEA LMAAARQQ