Gene SeAg_B1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B1659 
Symbol 
ID6792899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp1623659 
End bp1625701 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content54% 
IMG OID642775894 
Productdipeptidyl carboxypeptidase II 
Protein accessionYP_002146530 
Protein GI197249959 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.037506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACGA ATCCTTTGTT AGACCAGAGT ATGTTGCCTT ATCAGGCACC GCGTTTTGAT 
CGGATCAAAG ATTGCCATTA TCGTCCTGCT TTTGATGAGG GCGTGCGGCA AAAACGCGTG
GAAATCGAGG CCATCGTCAA TCATCCGGCG GCCCCTGATT TTACGAATAC GCTTCTGGCG
CTGGAGCAAA GCGGGGCGCT TCTGTCACGC GTCACCAGCG TTTTTTTCGC GATGACGGCC
GCGCACACTA ACGATGAACT CCAGCGGTTG GATGAGGCCT TTTCTGCCGA GCTGGCGGCG
CTCTCCAACG ATATTTATCT GAATAGCGCG TTATTCGCTC GCGTGGATGC CGTCTGGCAA
CAGCGTCACT CACTGGGGCT GGATGATGAG TCGCTACGGT TGGTCGATGT TATCCATCAG
CGTTTTGTGT TGGCAGGCGC GCAGCTTGCC GAAGAGGATA AAGCGCGACT GAAGGTATTG
AATACTGAAT CCGCGACCTT AATGAGTCAG TTTAATCAGC GTCTGCTGGC GGCAAGTAAA
GCGGGCGGGC TGGCGGTCGA TGACGCGCAT TGCCTGGCAG GATTAAGCCC GGAAGAAATG
ACCGTCGCTG CTGAAGCGGC GCGTGAAAAA GGCCTGGAGG AGCGCTGGTT CATTCCGCTC
CTTAATACGA CGCAACAGCC TGCGCTTGCT ACACTGCGCG ATCGCCAGAC CCGCGAAAAT
TTATTCGCAG CGTCATGGAC TCGGGCGGAA AAAGGAGATG TCCACGATAC CCGCGCTATC
GTTCAGCGTC TGGTAGAGAT TCGTCGCTGT CAGGCAAAAC TGCTGGGTTT CCCCAATTAT
GCCGCATGGA AAATGGCCGA TCAGATGGCG AAAACGCCGC AAGCCGCACT GAGCTTTATG
CGTGGCATTG TGCCGCCAGC GCGTCAGCGT GTACTCAATG AACAGGCGGA AATTCAGAAC
GTCATTGATG GTGAGCAGGG CGGCTACACC GTTCAGGCCT GGGACTGGAT GTTCTATGCC
GAACAGGTAC GGCGGGAAAA ATATGCGTTA GATGAAGCGC AACTGAAGCC CTATTTTGCC
TTAAATACGG TGTTGCAAGA GGGGGTTTTC TGGACCGCCA ACCAGCTATT CGGCATCACC
TTCGTCGAGC GTTTTGACAT TCCGGTGTAT CACCCTGATG TTCGGGTGTG GGAGATTTTC
GACTCCGATG GTGTCGGCAT GGCGTTATTT TATGGCGACT TCTTCGCGCG GGACTCGAAA
AGCGGCGGCG CGTGGATGGG GAATTTTGTC GAGCAATCCA CACTCAATGA AACCCGGCCC
GTTATATACA ATGTATGTAA CTATCAGAAA CCGGTTGATG GGCAGCCTGC ATTACTGCTT
TGGGACGACG TTATTACGCT CTTCCATGAG TTTGGTCATA CGTTACATGG TTTGTTTGCC
GTTCAGCGCT ATGCCACGCT TTCAGGTACC AATACGCCCC GCGACTTTGT CGAGTTTCCC
TCCCAGATTA ATGAGCATTG GGCGAGCCAT CCACGCGTGT TCGAACGCTA CGCGCGTCAT
GTCGACAGCG GTGAAAAAAT GCCCGCTGAT TTACAGGAAA GAATGCGCAA GGCGAGTTTA
TTTAATAAAG GTTACGATAT GACTGAATTG CTCGGCGCCG CATTGCTGGA TATGCGCTGG
CATATGCTGG AGGAGAGCGT AGCAGAGCAG TCTGTCGCTG AGTTCGAGCA GCAGGCGCTG
GCCGCCGAGC ATCTTGATTT ACCGGCAGTG CCGCCCCGCT ATCGCAGCAG TTATTTTGCC
CATATCTTCG GCGGCGGCTA CGCAGCGGGA TACTACGCCT ACCTGTGGAC GCAAATGTTA
GCGGATGACG GTTATCAATG GTTTGTAGAG CAGGGCGGCC TGACGCGTGA AAACGGGCAG
CGTTTTCGTG ATGCTATCCT TTCCCGGGGG AATAGTACTG ATTTAGAAAC GCTTTATTCA
GCCTGGCGGG GACATGAACC GCATATTGAC CCCATGTTGC AATATCGTGG GCTCGATCGC
TAA
 
Protein sequence
MSTNPLLDQS MLPYQAPRFD RIKDCHYRPA FDEGVRQKRV EIEAIVNHPA APDFTNTLLA 
LEQSGALLSR VTSVFFAMTA AHTNDELQRL DEAFSAELAA LSNDIYLNSA LFARVDAVWQ
QRHSLGLDDE SLRLVDVIHQ RFVLAGAQLA EEDKARLKVL NTESATLMSQ FNQRLLAASK
AGGLAVDDAH CLAGLSPEEM TVAAEAAREK GLEERWFIPL LNTTQQPALA TLRDRQTREN
LFAASWTRAE KGDVHDTRAI VQRLVEIRRC QAKLLGFPNY AAWKMADQMA KTPQAALSFM
RGIVPPARQR VLNEQAEIQN VIDGEQGGYT VQAWDWMFYA EQVRREKYAL DEAQLKPYFA
LNTVLQEGVF WTANQLFGIT FVERFDIPVY HPDVRVWEIF DSDGVGMALF YGDFFARDSK
SGGAWMGNFV EQSTLNETRP VIYNVCNYQK PVDGQPALLL WDDVITLFHE FGHTLHGLFA
VQRYATLSGT NTPRDFVEFP SQINEHWASH PRVFERYARH VDSGEKMPAD LQERMRKASL
FNKGYDMTEL LGAALLDMRW HMLEESVAEQ SVAEFEQQAL AAEHLDLPAV PPRYRSSYFA
HIFGGGYAAG YYAYLWTQML ADDGYQWFVE QGGLTRENGQ RFRDAILSRG NSTDLETLYS
AWRGHEPHID PMLQYRGLDR