Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4263 |
Symbol | pepQ |
ID | 6485387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4156427 |
End bp | 4157758 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642739513 |
Product | proline dipeptidase |
Protein accession | YP_002043212 |
Protein GI | 194443268 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCAC TGGCCGCGCT CTATAAAAAT CATATTGTTA CCTTACAAGA ACGGACGCGC GATGTTCTGG CGCGCTTTAA GCTGGATGCG TTACTTATTC ATTCTGGCGA GCTTTTCAAC GTCTTTCTCG ACGATCACCC TTATCCGTTT AAGGTCAATC CACAGTTTAA AGCGTGGGTG CCGGTAACTC AGGTTCCAAA TTGCTGGCTG TTGGTCGATG GCGTCAACAA ACCCAAATTG TGGTTTTATC TGCCGGTCGA TTACTGGCAT AACGTTGAAC CGCTGCCAAC GTCCTTCTGG ACAGAAGAAG TCGAGGTCGT CGCCTTACCG AAAGCGGATG GCATCGGCAG CCAACTGCCT GCCGCGCGTG GCAATATCGG CTATATCGGC CCGGTTCCTG AGCGCGCGCT ACAATTGGAT ATCGCCGCCA GCAACATCAA CCCGAAAGGT GTTATCGACT ATCTGCATTA CTACCGCGCC TATAAAACGG ATTATGAACT GGCCTGTATG CGCGAAGCGC AGAAAATGGC GGTGAGCGGT CATCGGGCGG CGGAAGAGGC CTTCCGTTCC GGCATGAGCG AGTTCGACAT CAACCTGGCG TACCTGACCG CCACGGGACA TCGCGATACC GATGTTCCAT ACAGCAACAT TGTGGCGCTG AACGAACATG CCGCCGTGCT GCATTACACG AAACTGGATC ATCAGGCACC GTCTGAAATG CGCAGTTTCC TGCTGGATGC GGGCGCGGAA TACAACGGCT ACGCGGCGGA TCTGACGCGG ACCTGGTCGG CGAAAAGCGA TAACGACTAC GCCCACCTGG TGAAAGATGT TAACGACGAA CAGTTGGCGC TGATCGCTAC CATGAAGGCG GGCGTCAGCT ATGTGGATTA TCATATTCAA TTCCATCAGC GCATCGCGAA GCTGCTGCGT AAACATCAAA TCATTACCGA CATGAGTGAA GAGGCGATGG TGGAAAATGA TCTCACCGGG CCGTTTATGC CGCACGGTAT TGGTCATCCG TTGGGTCTGC AGGTACACGA TGTGGCCGGG TTTATGCAGG ATGATTCCGG TACGCATCTC GCCGCGCCGT CCAAATACCC GTATCTGCGC TGCACGCGTG TGTTACAGCC GCGAATGGTG TTGACCATCG AACCGGGGAT TTACTTCATC GAATCGCTGT TAGCGCCATG GCGCGAAGGC CCGTTCAGCA AGCACTTCAA CTGGCAGAAA ATTGAAGCGC TCAAGCCTTT CGGCGGTATT CGCATTGAAG ATAACGTGGT CATCCACGAA AACGGCGTGG AAAACATGAC GCGGGATTTA AAACTGGCGT AA
|
Protein sequence | MESLAALYKN HIVTLQERTR DVLARFKLDA LLIHSGELFN VFLDDHPYPF KVNPQFKAWV PVTQVPNCWL LVDGVNKPKL WFYLPVDYWH NVEPLPTSFW TEEVEVVALP KADGIGSQLP AARGNIGYIG PVPERALQLD IAASNINPKG VIDYLHYYRA YKTDYELACM REAQKMAVSG HRAAEEAFRS GMSEFDINLA YLTATGHRDT DVPYSNIVAL NEHAAVLHYT KLDHQAPSEM RSFLLDAGAE YNGYAADLTR TWSAKSDNDY AHLVKDVNDE QLALIATMKA GVSYVDYHIQ FHQRIAKLLR KHQIITDMSE EAMVENDLTG PFMPHGIGHP LGLQVHDVAG FMQDDSGTHL AAPSKYPYLR CTRVLQPRMV LTIEPGIYFI ESLLAPWREG PFSKHFNWQK IEALKPFGGI RIEDNVVIHE NGVENMTRDL KLA
|
| |