Gene SeHA_C3291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3291 
SymbolpepP 
ID6489806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3210829 
End bp3212145 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content56% 
IMG OID642743428 
Productproline aminopeptidase P II 
Protein accessionYP_002047044 
Protein GI194448745 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.133072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones93 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGC AGGAATACCA ACGCCGTCGC CAGGCATTAC TGGCGCAAAT GCAGCCCGGC 
AGCGCCGCGC TGATCTTTGC CGCGCCAGAG GCGACGCGCA GCGCAGACAG TGAATATCCG
TATCGCCAGA GTAGCGACTT CTGGTATTTC ACCGGTTTTA ACGAACCGGA AGCCGTGCTG
GTACTGATTA AGAGTGATGA CACCCACAAC CACAGCGTTT TGTTCAACCG CGTTCGCGAC
CTGACGGCGG AAATCTGGTT TGGTCGCCGT TTAGGACAGG ATGCCGCGCC GGAAAAACTG
GGCGTTGACC GGGCGCTGGC GTTTAGCGAA ATCAACCAGC AACTCTTTCA GTTGCTTAAT
GGTCTGGATG TGGTGTACCA CGCGCAGGGC GAATATGCGT ATGCCGACGA GATTGTTCTG
GCTGCGCTGG AGAAGCTGCG TAAAGGCTCC CGCCAGAATC TGACCGCGCC GGCCACCATG
ACTGACTGGC GACCGATCGT CCATGAGATG CGCCTGTTCA AATCGCCGGA AGAGATTGCT
GTCCTGCGCC GCGCCGGGGA AATTAGCGCG CTGGCGCATA TCCGCGCGAT GGAAAAATGC
CGTCCGGGGA TGTTTGAGTA TCAGCTGGAG GGGGAAATTC ACCACGAATT TAATCGCCAC
GGCGCGCGCT ATCCCTCCTA TAACACCATT GTCGGCAGCG GCGAAAATGG CTGTATCCTG
CATTACACTG AAAACGAAAG TGAAATGCGC GACGGCGATT TAGTGCTTAT CGACGCGGGC
TGTGAATATA AAGGTTACGC GGGCGACATC ACGCGTACTT TCCCGGTGAA CGGGAAATTT
ACGCCAGCTC AGCGTGAAAT TTATGACATC GTTCTGGAAT CGCTGGAGAC CAGCCTGCGA
CTGTTCCGTC CTGGTACCTC TATTCAGGAG GTGACCGGCG AAGTCGTGCG CATCATGATA
ACCGGGCTGG TGAAGCTGGG GATTTTGCAA GGAGAGGTTG ATCAACTGAT TGCCGAAAAT
GCGCATCGTC CTTTCTTTAT GCATGGCTTG AGCCACTGGC TGGGGCTGGA TGTTCATGAT
GTCGGCGTTT ATGGGCCGGA TCGCTCCCGT ACCCTGGAGC CGGGCATGGT GCTGACCGTA
GAGCCAGGCC TCTATATCGC GCCGGATGCC GATGTGCCGG AAGCGTATCG CGGCATTGGC
GTTCGAATTG AAGATGACAT TGTCATTACC GAAACCGGTA ATGAAAACCT GACCGCTGGC
GTTGTGAAGA AGGCGGATGA CATTGAAGCA TTAATGGCGG CGGCGCGGCA GCAATGA
 
Protein sequence
MTQQEYQRRR QALLAQMQPG SAALIFAAPE ATRSADSEYP YRQSSDFWYF TGFNEPEAVL 
VLIKSDDTHN HSVLFNRVRD LTAEIWFGRR LGQDAAPEKL GVDRALAFSE INQQLFQLLN
GLDVVYHAQG EYAYADEIVL AALEKLRKGS RQNLTAPATM TDWRPIVHEM RLFKSPEEIA
VLRRAGEISA LAHIRAMEKC RPGMFEYQLE GEIHHEFNRH GARYPSYNTI VGSGENGCIL
HYTENESEMR DGDLVLIDAG CEYKGYAGDI TRTFPVNGKF TPAQREIYDI VLESLETSLR
LFRPGTSIQE VTGEVVRIMI TGLVKLGILQ GEVDQLIAEN AHRPFFMHGL SHWLGLDVHD
VGVYGPDRSR TLEPGMVLTV EPGLYIAPDA DVPEAYRGIG VRIEDDIVIT ETGNENLTAG
VVKKADDIEA LMAAARQQ