Gene SeHA_C1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1683 
Symbol 
ID6490083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1636827 
End bp1638869 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content54% 
IMG OID642741906 
Productdipeptidyl carboxypeptidase II 
Protein accessionYP_002045551 
Protein GI194447867 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.897722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACGA ATCCTTTGTT AGAACAGAGT ATGTTGCCTT ATCAGGCACC GCGTTTTGAT 
CGGATCAAAG ATTGCCATTA TCGTCCTGCT TTTGATGAGG GCGTGCGGCA AAAACGCGTG
GAAATCGAGG CCATCGTCAA TCATCCGGCG GTCCCTGATT TTACGAATAC GCTTCTGGCG
CTGGAGCAAA GCGGGGCGCT TCTGTCACGC GTCACCAGCG TTTTTTTCGC GATGACGGCC
GCGCACACTA ACGATGAACT CCAGCGGTTG GATGAGGCCT TTTCTGCCGA GCTGGCGGCG
CTCTCCAACG ATATTTATCT GAATAGCGCG TTATTCGCTC GCGTGGACGC CGTCTGGCAA
CAGCGTCACT CACTGGGGCT GGATGATGAG TCGCTACGGT TGGTCGATGT GATCCATCAG
CGTTTTGTGT TGGCAGGCGC GCAGCTTGCC GAAGAGGATA AAGCGCGACT GAAGGTATTG
AATACTGAAT CCGCGACCTT AATGAGTCAG TTTAATCAGC GTCTGCTGGC GGCAAGTAAA
GCGGGCGGGC TGGCGGTCGA TGACGCGCAT TGCCTGGCAG GATTAAGCCC GGAAGAAATG
ACCGTCGCTG CTGAAGCGGC GCGTGAAAAA GGCCTGGAGG AGCGTTGGTT CATTCCGCTC
CTTAATACGA CGCAACAGCC TGCGCTTGCT ACGCTGCGCG ATCGCCAGAT CCGCGAAAAT
TTATTCGCAG CGTCATGGAC TCGGGCGGAA AAAGGAGATG CCCACGATAC CCGCGCTATC
GTTCAGCGTC TGGTAGAGAT TCGTCGCTGT CAGGCAAAAC TGCTGGGTTT CCCCAATTAT
GCCGCATGGA AAATGGCCGA TCAGATGGCG AAAACGCCGC AAGCCGCACT GAGCTTTATG
CGTGGCATTG TGCCGCCAGC GCGTCAGCGT GTACTCAATG AACAGGCGGA AATTCAGAAC
GTCATTGATG GTGAGCAGGG CGGCTACACC GTTCAGGCCT GGGACTGGAT GTTCTATGCC
GAACAGGTAC GGCGGGAAAA ATATGCGTTA GATGAAGCGC AACTGAAGCC CTATTTTGCC
TTAAATACGG TGTTGCAAGA GGGGGTTTTC TGGACCGCCA ACCAGCTATT CGGCATCACC
TTCGTCGAGC GTTTTGACAT TCCGGTGTAT CACCCTGATG TTCGGGTGTG GGAGATTTTC
GATTCCGATG GTGTCGGCAT GGCGTTATTT TATGGCGACT TCTTCGCGCG GGACTCGAAA
AGCGGCGGCG CGTGGATGGG GAATTTTGTC GAGCAATCCA CACTCAATGA AACCCGGCCC
GTTATATACA ACGTATGTAA CTATCAGAAA CCTGTTGATA GGCAGCCTGC ATTACTGCTT
TGGGACGACG TTATTACACT CTTCCATGAG TTTGGTCATA CGTTACATGG TTTGTTTGCC
GTCCAGCGTT ATGCCACGCT TTCAGGTACC AATACGCCCC GCGACTTTGT CGAGTTTCCC
TCCCAGATTA ATGAGCATTG GGCGAGCCAT CCACGCGTGT TCGAACGCTA CGCGCGTCAT
GTCAACAGCG GTGAAAAAAT GCCTGCTGAT TTACAGGAAA GAATGCGCAA GGCGAGTTTA
TTTAATAAAG GTTACGATAT GACTGAATTG CTCGGCGCCG CATTGCTGGA TATGCGCTGG
CATATGCTGG AGAAGAGCGT AGCAGAGCAG TCTGTCGCTG AGTTCGAGCA GCAGGCGCTG
GCCGCCGAGC ATCTTGATTT ACCGGCGGTG CCGCCCCGCT ATCGCAGCAG TTATTTTGCC
CATATCTTCG GCGGCGGCTA CGCAGCGGGA TACTACGCCT ACCTGTGGAC GCAAATGTTA
GCGGATGACG GTTATCAATG GTTTGTAGAG CAGGGCGGCC TGACGCGTGA AAACGGGCAG
CGTTTTCGTG ATGCTATCCT TTCCCGGGGG AATAGTACTG ATTTAGAAAC GCTTTATTCA
GCCTGGCGTG GACATGAACC GCATATTGAC CCCATGTTGC AATATCGTGG GCTCGATCGC
TAA
 
Protein sequence
MSTNPLLEQS MLPYQAPRFD RIKDCHYRPA FDEGVRQKRV EIEAIVNHPA VPDFTNTLLA 
LEQSGALLSR VTSVFFAMTA AHTNDELQRL DEAFSAELAA LSNDIYLNSA LFARVDAVWQ
QRHSLGLDDE SLRLVDVIHQ RFVLAGAQLA EEDKARLKVL NTESATLMSQ FNQRLLAASK
AGGLAVDDAH CLAGLSPEEM TVAAEAAREK GLEERWFIPL LNTTQQPALA TLRDRQIREN
LFAASWTRAE KGDAHDTRAI VQRLVEIRRC QAKLLGFPNY AAWKMADQMA KTPQAALSFM
RGIVPPARQR VLNEQAEIQN VIDGEQGGYT VQAWDWMFYA EQVRREKYAL DEAQLKPYFA
LNTVLQEGVF WTANQLFGIT FVERFDIPVY HPDVRVWEIF DSDGVGMALF YGDFFARDSK
SGGAWMGNFV EQSTLNETRP VIYNVCNYQK PVDRQPALLL WDDVITLFHE FGHTLHGLFA
VQRYATLSGT NTPRDFVEFP SQINEHWASH PRVFERYARH VNSGEKMPAD LQERMRKASL
FNKGYDMTEL LGAALLDMRW HMLEKSVAEQ SVAEFEQQAL AAEHLDLPAV PPRYRSSYFA
HIFGGGYAAG YYAYLWTQML ADDGYQWFVE QGGLTRENGQ RFRDAILSRG NSTDLETLYS
AWRGHEPHID PMLQYRGLDR