Gene SeHA_C3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3646 
Symbol 
ID6491692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3529830 
End bp3531197 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content54% 
IMG OID642743764 
Productserine endoprotease 
Protein accessionYP_002047376 
Protein GI194447307 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC ACACCCAGCT GTTAAGTGCA TTAGCGTTAA GTGTCGGGTT AACTCTTTCG 
GCGCCGTTTC CAGCCCTTGC ATCGATACCA GGCCAGGTGC CAGGCCAGGC GACGCTGCCA
AGCCTTGCCC CTATGCTGGA GAAAGTGCTG CCTGCTGTCG TCAGCGTAAA AGTCGAGGGA
ACCGCCGCCC AGAGCCAAAA AGTGCCGGAG GAGTTTAAAA AATTCTTTGG CGAGGATCTG
CCAGACCAGC CGTCCCAGCC GTTTGAAGGA CTCGGTTCGG GGGTGATTAT CGATGCCGCG
AAAGGCTATG TATTAACCAA TAATCATGTG ATTAATCAGG CACAGAAGAT CAGTATTCAA
CTGAATGACG GACGCGAATT CGACGCGAAG CTGATCGGCG GCGACGACCA GAGCGATATC
GCTCTGTTAC AAATTCAGAA TCCCAGCAAG TTAACGCAAA TTGCCATCGC CGATTCCGAC
AAACTCCGCG TCGGCGATTT CGCCGTGGCG GTCGGTAATC CGTTTGGTCT TGGACAAACC
GCCACCTCCG GGATTATTTC AGCGCTGGGA CGCAGCGGGC TTAATCTGGA AGGGCTTGAG
AACTTTATTC AAACCGATGC CTCTATTAAC CGCGGCAACT CCGGCGGCGC GCTGCTTAAC
CTGAACGGCG AGCTGATCGG GATTAATACC GCGATCCTCG CGCCAGGGGG CGGGAGCATC
GGCATTGGCT TTGCTATTCC TTCCAATATG GCGCAGACGC TGGCGCAGCA GTTGATTCAG
TTCGGCGAAA TCAAACGCGG ATTGCTGGGA ATTAAAGGCA CTGAAATGAC CGCTGATATC
GCCAAGGCAT TCAAACTGAA CGTTCAGCGT GGCGCTTTTG TCAGCGAGGT TTTACCCAAT
TCAGGTTCGG CGAAGGCCGG GGTGAAATCC GGAGACGTGA TTATCAGTCT TAACGGTAAG
CCGCTGAATA GCTTTGCCGA ACTGCGTTCA CGTATCGCCA CCACCGAACC GGGCACGAAA
GTGAAGCTGG GCCTGTTGCG CGATGGTAAG CCGCTGGAGG TGGACGTCAC GCTGGATTCC
AATACCTCTT CTTCCGCCAG CGCCGAAATG ATCGCCCCGG CGTTGCAAGG CGCGACGTTG
AGCGACGGCC AGCTGAAAGA CGGGACGAAA GGCGTTAAGG TTGATAGCGT CGAAAAAAGC
AGTCCTGCCG CGCAGGCTGG TTTGCAAAAA GATGATGTTA TCATCGGCGT TAATCGCGAT
CGCATCAGTT CTATCGCCGA AATGCGCAAA GTGATGGCGG CAAAACCGTC CATCATTGCT
CTTCAGGTAG TACGCGGCAA CGAGAACATT TATCTATTGC TGCGCTAA
 
Protein sequence
MKKHTQLLSA LALSVGLTLS APFPALASIP GQVPGQATLP SLAPMLEKVL PAVVSVKVEG 
TAAQSQKVPE EFKKFFGEDL PDQPSQPFEG LGSGVIIDAA KGYVLTNNHV INQAQKISIQ
LNDGREFDAK LIGGDDQSDI ALLQIQNPSK LTQIAIADSD KLRVGDFAVA VGNPFGLGQT
ATSGIISALG RSGLNLEGLE NFIQTDASIN RGNSGGALLN LNGELIGINT AILAPGGGSI
GIGFAIPSNM AQTLAQQLIQ FGEIKRGLLG IKGTEMTADI AKAFKLNVQR GAFVSEVLPN
SGSAKAGVKS GDVIISLNGK PLNSFAELRS RIATTEPGTK VKLGLLRDGK PLEVDVTLDS
NTSSSASAEM IAPALQGATL SDGQLKDGTK GVKVDSVEKS SPAAQAGLQK DDVIIGVNRD
RISSIAEMRK VMAAKPSIIA LQVVRGNENI YLLLR