Gene SeHA_C4930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4930 
Symbol 
ID6491171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4808952 
End bp4810361 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content41% 
IMG OID642744975 
Producttype I restriction enzyme StySJI specificity protein 
Protein accessionYP_002048547 
Protein GI194449649 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGGG GGAAATTGCC GGAGGGGTGG GCTACAAGCA CAATCAATGA AATGTGCAAC 
CTCAATCCCA AACTTAAACT TGATGATGAT TTAGATGTTG GATTTATGCC GATGGCCGGT
GTTCCAACAA CCTATTTAGG TAAATGTAAC TTTGAAACCA AAAAGTGGAG TGAAGTAAAA
AAAGGATTTA CTCAATTTCA GAACGATGAT GTTATTTTTG CCAAAATCAC GCCATGCTTC
GAAAATGGAA AAGCTGTTGT AATCAAAGAA TTCCCTAACG GCTATGGTGC CGGTAGCACT
GAATATTATG TTCTTCGGTC TATTAATGGG TTAATTAATC CCCATTGGTT GTTTGCTTTA
GTTAAAACTA AAGATTTTTT AACTAATGGA GCACTTAATA TGTCCGGTTC AGTCGGACAT
AAACGTGTTA CTAAAGAATT TCTTGAGAAC TATGGTGTTC CTGTCCCACC TCTTGCCGAA
CAAAAAGTCA TCGCCGAAAA ACTCGATACG CTGCTGGCGC AGGTAGACAG CACCAAAGCA
CGTCTTGAGC AAATCCCACA AATCCTGAAA CGTTTTCGCC AATCAGTGAT AGTTGCAGCA
GTAAACGGGC AACTGACAAA AGAACTTCAT AAAAAAAATA AATTCAAGTT AACAGAATTG
AATATTTCTA TTCCATCTTT ATGGAAAATC AGTGAGATTG GTCAATTTGC TGATGTCAAA
GGTGGCAAGA GATTACCTAA AGGTGAATCA TTAATAGCTG AGAATACAGG GTTTCCATAT
ATTAGAGCAG GGCAGCTAAA AAATGGAACT GTTCTTCCTG AAGGACAACT ATACTTAGAA
GAATATATAC AAAAAAGTAT TTCTAGATAT ACAGTCTCAT CTGGAGATCT TTATATAACT
ATTGTTGGTG CATGTATTGG TGATGCTGGT ATAATTCCAG ATGTTTATAA TAATGCAAAT
TTAACTGAAA ATGCTGCCAA GATATGTAAT TTAAACGAAA ATATTTTCAA TAGATTTCTT
TCTTTATGGC TGAGAAGTAG TTATCTTCAG GATATTATCA ATTCAGAAAT AAAATCAGGA
GCTCAAGGGA AGTTGGCTTT GGCAAGGATA AAATCACTCC CATTGATACT ACCACCACTC
CAAGAACAAC ACGAAATCGT CCGCCGCGTC GAACAACTCT TCGCCTACGC CGACACCATT
GAAAAGCAGG TCAACAACGC CCTGACCCGC GTCAACAGCC TCACCCAGTC GATCCTGGCG
AAGGCCTTCC GCGGCGAGCT GACCGCCCAG TGGCGTGCGG AAAACCCTGA ACTTATCAGC
GGTGAAAACA GCGCCGCCGC CCTGCTGGAA AAAATTAAGG CCGAACGCGC CGCCAGCGGC
GGTAAAAAAA CCTCGCGTAA AAAAGCCTGA
 
Protein sequence
MSGGKLPEGW ATSTINEMCN LNPKLKLDDD LDVGFMPMAG VPTTYLGKCN FETKKWSEVK 
KGFTQFQNDD VIFAKITPCF ENGKAVVIKE FPNGYGAGST EYYVLRSING LINPHWLFAL
VKTKDFLTNG ALNMSGSVGH KRVTKEFLEN YGVPVPPLAE QKVIAEKLDT LLAQVDSTKA
RLEQIPQILK RFRQSVIVAA VNGQLTKELH KKNKFKLTEL NISIPSLWKI SEIGQFADVK
GGKRLPKGES LIAENTGFPY IRAGQLKNGT VLPEGQLYLE EYIQKSISRY TVSSGDLYIT
IVGACIGDAG IIPDVYNNAN LTENAAKICN LNENIFNRFL SLWLRSSYLQ DIINSEIKSG
AQGKLALARI KSLPLILPPL QEQHEIVRRV EQLFAYADTI EKQVNNALTR VNSLTQSILA
KAFRGELTAQ WRAENPELIS GENSAAALLE KIKAERAASG GKKTSRKKA