Gene SNSL254_A1624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1624 
Symbol 
ID6483942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1589148 
End bp1591190 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content54% 
IMG OID642737010 
Productdipeptidyl carboxypeptidase II 
Protein accessionYP_002040762 
Protein GI194444337 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATGA ATCCTTTGTT AGACCAGAGT ATGTTGCCCT ATCAGGCACC GCGTTTTGAT 
CGGATCAAAG ATTGCCATTA TCGTCCTGCT TTTGATGAGG GCGTGCGGCA AAAACGCGTG
GAAATCGAGG CCATCGTCAA TCATCCGGCG GCCCCTGATT TTACGAATAC GCTTCTGGCG
CTGGAGCAAA GCGGGGCGCT TCTGTCACGC GTCACCAGCG TTTTTTTCGC GATGACGGCC
GCGCACACTA ACGATGAACT CCAGCGGTTG GATGAGGCCT TTTCTGCCGA GCTGGCGGCG
CTCTCCAACG ATATTTATCT GAATAGCGCG TTATTCGCTC GCGTGGATGC CGTCTGGCAA
CAGCGTCACT CACTGGGGCT GGATGATGAG TCGCTACGGT TGGTCGATGT GATCCATCAG
CGTTTTGTGT TGGCAGGCGC GCAGCTTGCC GAAGAGGATA AAGCGCGACT GAAAGTATTG
AATACTGAAT CCGCGACCTT AATGAGTCAG TTTAATCAGC GTCTGCTGGC GGCAAGTAAA
GCGGGCGGGC TGGCGGTCGA TGACGCGCAT TGCCTGGCAG GATTAAGCCC GGAAGAAATG
ACCGTCGCTG CTGAAGCGGC GCGTGAAAAA GGCCTGGAGG AGCGCTGGTT CATTCCGCTC
CTTAATACGA CGCAACAGCC TGCGCTTGCT ACGCTGCGCG ATCGCCAGAC TCGCGAAAAT
TTATTCGCAG CGTCATGGAC TCGGGCGGAA AAAGGAGATG CCCACGATAC CCGCGCTATC
GTTCAGCGTC TGGTAGAGAT TCGTCGCTGT CAGGCAAAAC TGCTGGGTTT CCCTAATTAT
GCCGCATGGA AAATGGCCGA TCAGATGGCG AAAACGCCGC AAGCCGCACT GAGCTTTATG
CGTGGCATTG TGCCGCCAGC GCGTCAGCGT GTACTCAATG AACAGGCGGA AATTCAGAAC
GTCATTGATG GTGAGCAGGG CGGCTACACC GTTCAGGCCT GGGACTGGAT GTTCTATGCC
GAACAGGTAC GGCGGGAAAA ATATGCGTTA GATGAAGCGC AACTGAAGCC CTATTTTGCC
TTAAATACGG TCTTGCAAGA AGGGGTTTTC TGGACCGCCA ACCAGCTATT CGGCATCACC
TTCGTCGAGC GTTTTGACAT TCCGGTGTAT CACCCTGATG TTCGGGTGTG GGAGATTTTC
GATTCCGATG GTGTCGGCAT GGCGTTATTT TATGGCGACT TCTTCGCGCG GGACTCGAAA
AGCGGCGGCG CGTGGATGGG GAATTTTGTC GAGCAATCCA CACTCAATGA AACCCGGCCC
GTTATATACA ATGTATGTAA CTATCAGAAA CCGGTTGATG GGCAGCCTGC ATTACTGCTT
TGGGACGACG TTATTACGCT CTTCCATGAG TTTGGTCATA CGTTACATGG TTTGTTTGCC
GTTCAGCGCT ATGCCACGCT TTCAGGTACC AATACGCCCC GCGACTTTGT CGAGTTTCCC
TCCCAGATTA ATGAGCATTG GGCGAGCCAT CCACGCGTGT TCGAACGCTA CGCGCGTCAT
GTCGACAGCG GTGAAAAAAT GCCCGCTGAT TTACAGGAAA AAATGCGCAA GGCGAGTTTA
TTTAATAAAG GTTACGATAT GACTGAATTG CTCGGCGCCG CATTGCTGGA TATGCGCTGG
CATATGCTGG AGGAGAGCGT AGCAGAGCAG TCTGTCGCTG AGTTCGAGCA GCAGGCGCTG
GCCGCCGAGC ATCTTGATTT ACCGGCAGTG CCGCCCCGCT ATCGCAGCAG TTATTTTGCC
CATATCTTCG GCGGCGGCTA CGCAGCGGGA TACTACGCCT ACCTGTGGAC GCAAATGTTA
GCGGATGACG GTTATCAATG GTTTGTAGAG CAGGGCGGCC TGACGCGTGA AAACGGGCAG
CGTTTTCGTG ATGCTATCCT TTCCCGGGGG AATAGTACTG ATTTAGAAAC GCTTTATTCA
GCCTGGCGTG GACATGAACC GCATATTGAC CCCATGTTGC AATATCGTGG GCTCGATCGC
TAA
 
Protein sequence
MSMNPLLDQS MLPYQAPRFD RIKDCHYRPA FDEGVRQKRV EIEAIVNHPA APDFTNTLLA 
LEQSGALLSR VTSVFFAMTA AHTNDELQRL DEAFSAELAA LSNDIYLNSA LFARVDAVWQ
QRHSLGLDDE SLRLVDVIHQ RFVLAGAQLA EEDKARLKVL NTESATLMSQ FNQRLLAASK
AGGLAVDDAH CLAGLSPEEM TVAAEAAREK GLEERWFIPL LNTTQQPALA TLRDRQTREN
LFAASWTRAE KGDAHDTRAI VQRLVEIRRC QAKLLGFPNY AAWKMADQMA KTPQAALSFM
RGIVPPARQR VLNEQAEIQN VIDGEQGGYT VQAWDWMFYA EQVRREKYAL DEAQLKPYFA
LNTVLQEGVF WTANQLFGIT FVERFDIPVY HPDVRVWEIF DSDGVGMALF YGDFFARDSK
SGGAWMGNFV EQSTLNETRP VIYNVCNYQK PVDGQPALLL WDDVITLFHE FGHTLHGLFA
VQRYATLSGT NTPRDFVEFP SQINEHWASH PRVFERYARH VDSGEKMPAD LQEKMRKASL
FNKGYDMTEL LGAALLDMRW HMLEESVAEQ SVAEFEQQAL AAEHLDLPAV PPRYRSSYFA
HIFGGGYAAG YYAYLWTQML ADDGYQWFVE QGGLTRENGQ RFRDAILSRG NSTDLETLYS
AWRGHEPHID PMLQYRGLDR