Gene SNSL254_A4263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4263 
SymbolpepQ 
ID6485387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4156427 
End bp4157758 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content54% 
IMG OID642739513 
Productproline dipeptidase 
Protein accessionYP_002043212 
Protein GI194443268 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCAC TGGCCGCGCT CTATAAAAAT CATATTGTTA CCTTACAAGA ACGGACGCGC 
GATGTTCTGG CGCGCTTTAA GCTGGATGCG TTACTTATTC ATTCTGGCGA GCTTTTCAAC
GTCTTTCTCG ACGATCACCC TTATCCGTTT AAGGTCAATC CACAGTTTAA AGCGTGGGTG
CCGGTAACTC AGGTTCCAAA TTGCTGGCTG TTGGTCGATG GCGTCAACAA ACCCAAATTG
TGGTTTTATC TGCCGGTCGA TTACTGGCAT AACGTTGAAC CGCTGCCAAC GTCCTTCTGG
ACAGAAGAAG TCGAGGTCGT CGCCTTACCG AAAGCGGATG GCATCGGCAG CCAACTGCCT
GCCGCGCGTG GCAATATCGG CTATATCGGC CCGGTTCCTG AGCGCGCGCT ACAATTGGAT
ATCGCCGCCA GCAACATCAA CCCGAAAGGT GTTATCGACT ATCTGCATTA CTACCGCGCC
TATAAAACGG ATTATGAACT GGCCTGTATG CGCGAAGCGC AGAAAATGGC GGTGAGCGGT
CATCGGGCGG CGGAAGAGGC CTTCCGTTCC GGCATGAGCG AGTTCGACAT CAACCTGGCG
TACCTGACCG CCACGGGACA TCGCGATACC GATGTTCCAT ACAGCAACAT TGTGGCGCTG
AACGAACATG CCGCCGTGCT GCATTACACG AAACTGGATC ATCAGGCACC GTCTGAAATG
CGCAGTTTCC TGCTGGATGC GGGCGCGGAA TACAACGGCT ACGCGGCGGA TCTGACGCGG
ACCTGGTCGG CGAAAAGCGA TAACGACTAC GCCCACCTGG TGAAAGATGT TAACGACGAA
CAGTTGGCGC TGATCGCTAC CATGAAGGCG GGCGTCAGCT ATGTGGATTA TCATATTCAA
TTCCATCAGC GCATCGCGAA GCTGCTGCGT AAACATCAAA TCATTACCGA CATGAGTGAA
GAGGCGATGG TGGAAAATGA TCTCACCGGG CCGTTTATGC CGCACGGTAT TGGTCATCCG
TTGGGTCTGC AGGTACACGA TGTGGCCGGG TTTATGCAGG ATGATTCCGG TACGCATCTC
GCCGCGCCGT CCAAATACCC GTATCTGCGC TGCACGCGTG TGTTACAGCC GCGAATGGTG
TTGACCATCG AACCGGGGAT TTACTTCATC GAATCGCTGT TAGCGCCATG GCGCGAAGGC
CCGTTCAGCA AGCACTTCAA CTGGCAGAAA ATTGAAGCGC TCAAGCCTTT CGGCGGTATT
CGCATTGAAG ATAACGTGGT CATCCACGAA AACGGCGTGG AAAACATGAC GCGGGATTTA
AAACTGGCGT AA
 
Protein sequence
MESLAALYKN HIVTLQERTR DVLARFKLDA LLIHSGELFN VFLDDHPYPF KVNPQFKAWV 
PVTQVPNCWL LVDGVNKPKL WFYLPVDYWH NVEPLPTSFW TEEVEVVALP KADGIGSQLP
AARGNIGYIG PVPERALQLD IAASNINPKG VIDYLHYYRA YKTDYELACM REAQKMAVSG
HRAAEEAFRS GMSEFDINLA YLTATGHRDT DVPYSNIVAL NEHAAVLHYT KLDHQAPSEM
RSFLLDAGAE YNGYAADLTR TWSAKSDNDY AHLVKDVNDE QLALIATMKA GVSYVDYHIQ
FHQRIAKLLR KHQIITDMSE EAMVENDLTG PFMPHGIGHP LGLQVHDVAG FMQDDSGTHL
AAPSKYPYLR CTRVLQPRMV LTIEPGIYFI ESLLAPWREG PFSKHFNWQK IEALKPFGGI
RIEDNVVIHE NGVENMTRDL KLA