Gene SeHA_C4712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4712 
Symbol 
ID6488622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4593159 
End bp4596572 
Gene Length3414 bp 
Protein Length1137 aa 
Translation table11 
GC content51% 
IMG OID642744769 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_002048346 
Protein GI194448462 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAGT CTCTCAATTT TGAAATGTTG CGCAGCCAGT GGCCGGAACT GGCTGAACTC 
GCGTGTATGG CGGAACGCTA TGTTCACTCC GATCCGGAAA GTTGTCTGGT CAAGCTGCGC
AACTACACCG AATTGATGGT GCGCTGGTTG TATCGTCAGG AGCGGTTGCC GGAAGGTATT
AAGGCTAATC TTTACGATTT AATGAATGCT GATGTCTTTA CCAGCATGAT GCCGGAAGCC
ATCATCATGA AAATGGATGC GCTGCGTATC CATGGCAACC GTGCCGCGCA CGGCGGACGT
ATCAAAGCTA AAGATACTTA CTGGCTGCTC AAAGAAGCGT ATTTGTTGGG AATTTGGCTG
TATGTTCGCT ACGCCCACGG TAATGTTGAT GACTGCCCCA AATTTACACT CCCTCCATTA
ACACAATCTT CAGGTCGTGC AGATGAAAAA CGTCTGGAAG ATGCAATCAG GGCTCAGGAT
GAAAGCCGTG AGCGCGAACT GGCGCTGCAA CGCGCACTAC AGCAAGAACA GGAAAAGGCC
GAACACCTCA CTCAACGCCT GAATGAAGCC AGAGCGCGTA ACCAGCATGT TGCCGATATT
CTCTCTATTG ATGAAGCGGA AACCCGCCGT CGCCTGATTG ACTCTCGCCT GCTTGCTGCT
GACTGGAATG TAGGCGAAGG CCTTAAAAAT ACCGATCAGG TTACGCAGGA ACATCCCGTT
AAAGAACAAC CTACCGCGAC CGGAGACGGT TATGCGGACT ATGTCTTGTG GGATGAGGCA
CACAAACCGC TGGCGGTGGT GGAAGCAAAA AAAACCAGCG TCAATGCCGA GCAAGGACGA
ATTCAGGCCC GGCTGTATGC AGACTGGCTG GAAAAAGAAT ACGACCAGCG TCCGATCATC
TTCTACACCA ACGGCTATGA TATCTGGCTG TGGGATGACC ATAAAACTCA TGGTTATCCC
CCGCGTCGGG TGTTTGGCTT CTACAGCAAG GAAAGCCTGC AATATCTTAT TCAACAGCGT
GAAACCCGTC TTCCGCTGAA CAGTGTGCCG CACGTAAAAG ATAACGAAGG TAAGGCCGTT
GCCGGACGTT TGTACCAGCT TGAAACCATT GCCCGTGTCA GCGAGCGGTT TACCAACAAA
TACCGGCAAT CGCTAATTGT TCAGGCTACG GGTACAGGCA AAACCCGCGT GGCAATAGCG
CTAAGCAAAC TAATGATTGA TGCCCGCTGG GTAAAACGCG TCCTGTTTCT TTGCGACCGC
AAGGAACTAC GTAAACAGGC GGCGAATGCC TTTAATCAAT TCACCAATGA GCCGCTGTAT
GTGGTCGGGA AATCGAAAAA AGCAGACAGG CAGAATGCTA GGATCTACAT CGCCACCTAT
CCTGGTATGA TGAAAATCAT GGACCATTTC GATGTTGGTT ATTTCGATTT GATCATCGCC
GATGAATCCC ATCGCTCTAT CTACAACGTT TACGGCGATC TGTTTAAGTA TTTTGATGCC
CTCCAGATTG GCCTGACGGC CACGCCGATT GACATGGTGA GTAAAACCAC TTTCGGCCTG
TTTGGTTGTG AAGGACGTAT TCCTACCGCC AACTACAGCC TGGAAGATGC CATCGCCGAT
AACAATCTGG TTCCCTATGA GGTTGTCACG CACACCACAG AATTCCTGCG CGAAGGCATC
AAACGGGAGA AATTAAGCGA CGCACAAATC CGCGAGCTGG AAGAACAAGG CATAGATCCC
AATACGCTGG AGTTTGATGG CAAGGCGCTG GATGAGGCGA TCTACAACAA AGACACCAAC
CGCTATATCC TGCGTAACCT GATGGAAAAT GGCCTGAAAG ATCGGGACGG CCAGTTGCCA
GGTAAAACCA TCATTTTCGC CCGTAACCAC AAACACGCGC TGCTGTTGAA TGAATTATTC
GACGATATGT ACCCGCAGTT TGCCGGGCGC TTCTGCCAGG TCATCGACAA TTACGATCCC
CGCGCCGAGC AACTGATTGA TGATTTTAAA GGGCTGGATG AAAGCACCAA CAAAGAGCTG
ACCATCGCTA TCTCCGTTGA CATGCTCGAT ACCGGTATTG ATGTCCCGGA AATTGTGAAT
CTGGTCTTTG CCAAACCGGT CAAATCAAAA GTGAAGTTCT GGCAGATGAT TGGCCGTGGT
ACGCGCCTCT GTCCGGGGTT GTACGGTTAC GACGATAACG GCAAACCTCT GGATAAACAG
AAATTCCGCA TTTTCGATCA CTGGGGCAAT TTTGAGTATC ACGAACTGCA TACCGAAGAG
GCCGAAGTCA CAGCGACAAA ATCGCTTGCA CAAAAACGCT TCGAAGCGTG GGTTATGCTG
GGGGCCGCTG CGCAACGTAA ATTCGACAAA CAGGCGGTGG ATTTAGTTGC TCACCAGCTA
CGCGAGCAAA TCAATGCACT GGATGAAAAG TCGATTGCCG TCCAGGAAAA GTGGCATCAA
AAAGCGCAGT ACAGCGATGA AAAAGTGCTG CGCCAGCTTT CCCCGAAAAC ACAGCAGGAT
TTGCTATCTG TCCTGGCCCC GCTGATGCAG TGGCTGGACG TGCGCGGGCA AAGCGACGCT
ATGCGCTTTG ATATGGATAT TCTGGCGGCG CAAACTGCCC GTTATACCAA CCCGGAAGAG
CTGGATGTGC TCTGGCCGGT CATCGTCGAA AAAGTGGAGC GCCTGCCGCC GCATTTGGCG
CAGGTCCAAC AGCAAGGACC GCGGATCAAT CAGCTTCGTG ATTTAGGCTG GTGGAAGCAG
GCCAGCCTGG AAGAGCTGGA AGATATCCGC ATTCATCTGC GCGGCATCAT GCACCTGATG
GAAAAAGACG CGACGCCGAA ATTTGGTTCG ATACAGGTGG ATATTACCGA AGATGCGAAC
CTGATTCAGA CGGAAAACCG CAAAACCAAC ATCCGCTCGA TTGACTTCAA ACTCTATCGC
CAGCAGGTGC AGGGAGCGCT GGAGCCGCTG TTCCAGCAAA ATCCGGTGCT GAAGAAAATC
CGCAACGGCG AGCCTGTGAC GCAAAGCGAG CTGGATGAAC TGGCTAAACT GGTGCTGATC
CAGAACCCTA ACGTTGATAT CCGGGCACTG AAAGAGTTCT ACCCGCAGGC GACCGCCAGC
CTGGATAAAC TATTGCGTAC CATCATCGGG ATGGACAGCG ACGCGGTGGA AGTGCGCTTT
GCTCAGTTCG CCGCTGATAA CAGCCTGACC AGCCAGCAAT TGCGCTTCCT GTCATTGCTG
AAAAACCACA TTCGCGATTA CGGCACCATT GAAATGCGGC AGCTCTTTGA ACAGCCTTTT
ACGCATATCC ACAACGAAGG CGTTACCGGT GTGTTCCCTG ATATAGAGCA GATCGCCCGC
CTACAAAAGA TAGTCGAAGA GCTGGGTGTT GTGACCGACG CAGCAACGGT ATAA
 
Protein sequence
MEQSLNFEML RSQWPELAEL ACMAERYVHS DPESCLVKLR NYTELMVRWL YRQERLPEGI 
KANLYDLMNA DVFTSMMPEA IIMKMDALRI HGNRAAHGGR IKAKDTYWLL KEAYLLGIWL
YVRYAHGNVD DCPKFTLPPL TQSSGRADEK RLEDAIRAQD ESRERELALQ RALQQEQEKA
EHLTQRLNEA RARNQHVADI LSIDEAETRR RLIDSRLLAA DWNVGEGLKN TDQVTQEHPV
KEQPTATGDG YADYVLWDEA HKPLAVVEAK KTSVNAEQGR IQARLYADWL EKEYDQRPII
FYTNGYDIWL WDDHKTHGYP PRRVFGFYSK ESLQYLIQQR ETRLPLNSVP HVKDNEGKAV
AGRLYQLETI ARVSERFTNK YRQSLIVQAT GTGKTRVAIA LSKLMIDARW VKRVLFLCDR
KELRKQAANA FNQFTNEPLY VVGKSKKADR QNARIYIATY PGMMKIMDHF DVGYFDLIIA
DESHRSIYNV YGDLFKYFDA LQIGLTATPI DMVSKTTFGL FGCEGRIPTA NYSLEDAIAD
NNLVPYEVVT HTTEFLREGI KREKLSDAQI RELEEQGIDP NTLEFDGKAL DEAIYNKDTN
RYILRNLMEN GLKDRDGQLP GKTIIFARNH KHALLLNELF DDMYPQFAGR FCQVIDNYDP
RAEQLIDDFK GLDESTNKEL TIAISVDMLD TGIDVPEIVN LVFAKPVKSK VKFWQMIGRG
TRLCPGLYGY DDNGKPLDKQ KFRIFDHWGN FEYHELHTEE AEVTATKSLA QKRFEAWVML
GAAAQRKFDK QAVDLVAHQL REQINALDEK SIAVQEKWHQ KAQYSDEKVL RQLSPKTQQD
LLSVLAPLMQ WLDVRGQSDA MRFDMDILAA QTARYTNPEE LDVLWPVIVE KVERLPPHLA
QVQQQGPRIN QLRDLGWWKQ ASLEELEDIR IHLRGIMHLM EKDATPKFGS IQVDITEDAN
LIQTENRKTN IRSIDFKLYR QQVQGALEPL FQQNPVLKKI RNGEPVTQSE LDELAKLVLI
QNPNVDIRAL KEFYPQATAS LDKLLRTIIG MDSDAVEVRF AQFAADNSLT SQQLRFLSLL
KNHIRDYGTI EMRQLFEQPF THIHNEGVTG VFPDIEQIAR LQKIVEELGV VTDAATV