Gene ECH74115_5861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5861 
Symbol 
ID6969406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5514767 
End bp5517199 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content49% 
IMG OID643389480 
Producttype III restriction enzyme domain protein 
Protein accessionYP_002273872 
Protein GI209398938 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAC TGAACCTAAG TAACCTGACG GAAGCAGACA TCATTACCAA ATGCGTTATG 
CCAGCCATTC TCAATGCAGG CTGGGACAAC ACAACGCAAA TCAGACAGGA GGTCAAACTC
CGGGACGGTA AAGTCATCGT ACGTGGTAAA GTTGCGGCAC GCAGAACGGT AAAATCTGCT
GACATCGTGC TGTATCACAA ACCTGGCATT CCCTTAGCCG TGATTGAAGC AAAAGCCAAC
AAACATGAAA TTGGCAAAGG GATGCAACAG GGCATTGAAT ATGCGCGCCT GCTGGACGTT
CCTTTTGTTT TTGCCACCAA CGGTGATGGC TTTATCTTCC GCGATGCCAC CGCAGCCGAA
GGTGAATGCC TTGAAAAGCA AATAACGCTG GATGACTTCC CCTCCCCCGC TGAACTCTGG
CAAAAATTCT GCCTCTGGAA AGGATACACA CAAGCTCAGC TTCCGGTGAT TACTCAGGAC
TATTACGACG ATGGCAGCGG TAAATCGCCA CGTTATTACC AGCTTCAGGC AATCAACAAA
ACCATTGAAG CCGTCTCCAA CGGGCAAAAC CGCGTTCTGC TGGTCATGGC GACCGGAACA
GGGAAAACCT ATACCGCATT CCAGATCATC TGGCGCCTGT GGAAATCAAA AAATAAAAAA
CGCATTTTGT TCCTTGCCGA TCGCAATATT CTGGTCGACC AAACCAAAAA TAATGATTTC
CAGCCATTTG GTACGGCAAT GACCAAAGTC AGCGGACGCA CCATTGATCC CGCTTATGAA
ATTCACCTCG CGCTCTATCA GGCTATAACT GGCCCGGAGG AAGACCAAAA AGCGTTTAAA
CAAGTCGCAC CAGATTTCTT CGATCTGATC GTGATCGACG AATGCCATCG CGGCAGCGCA
TCCGAAGACA GCGCCTGGCG AGAAATCCTT GATTATTTTA GTTCCGCCAC CCAAATTGGC
TTAACCGCCA CGCCAAAAGA GACGCATGAA GTCTCCAGCA CGGATTACTT CGGCGATCCG
GTTTACGTCT ACTCACTAAA AGAAGGGATC GAAGACGGCT TCCTCGCCCC TTATAAAGTT
GTCCGTGTTG ATATTGATGT TGATCTGCAA GGCTGGCGCC CAACCAAAGG GCAAACTGAC
TTAAACGGCG AAGTGATCGA CGATCGTATC TATAACCAGA AAGATTTCGA TCGCACGATG
GTAATCGACG AACGCACAGA ACTGGTTGCC AGAACCATTA CCGACTACCT CAAGCGTACC
AATCCGATGG ATAAAACCAT CGTCTTCTGT AACGACATCG ATCATGCAGA ACGTATGCGC
CGCGCCCTGG TTAATCTCAA CCCGGAGCAG GTGAAAAAGA ACGACAAATA CGTCATGAAA
ATCACCGGCG ATGATGAAAT TGGCAAAGCT CAGTTGGATA ACTTCATCAA CCCGAAAAAA
CCGTACCCGG TTATCGCGAC CACTTCAGAG CTGATGACCA CCGGTGTGGA TGCTAAAACC
TGCAAACTGG TAGTACTGGA CCAGAACATC CAGTCGATGA CCAAATTCAA GCAGATTATC
GGTCGTGGTA CACGCATCGA CGAACGTTAC GGCAAACTCT GGTTTACCAT CCTCGACTTT
AAAAAAGCCA CCGAACTGTT TGCCGATGAG CGTTTCGATG GCATTCCCGA AAAAGTCATG
GATACCACAC CAGAGGATAT CGCCGATCCA GAATCTGATT TTGAAGAGAA ACTCGAAGAA
ATCAGCGAAC ATGACGACGA ACAGGTAACA GGCGTTGATG AACCGCCTGC GCCACCATAC
CAGGTTAAAG ATACCGATGA TGTCGGCCCA CTTCCGGAAG AAGACGAGAA GAAAATCCGC
AAGTTTCACG TCAACGGTGT AGCAGTGGGC GTTATTGCCC AGCGTGTTCA GTATTACGAC
GCCGACGGTA AACTGGTTAC CGAATCCTTT AAAGATTACA CCCGCAAAAC ACTGCTCAAA
GAATATGCCT CGCTGGATGA CTTTACCCGC AAGTGGCAGG ACGCCGATCG CAAAGAAGCG
ATCATTCACG AGCTGGAGCA ACAGGGGATC ATCTGGGAAG TACTGGCAGA AGAAGTCGGT
AAAGATCTCG ACCCGTTCGA CATGCTTTGC CACGTAGTGT ATGGTCAGCC GCCGTTAACC
CGCAAAGAGC GCGCCGAGAA CGTGCGCAAG CGGAACTACT TCACAAAATA CTCTGAAGCA
GCGCAAGCCG TGCTCGATAA TCTGCTGGAT AAATACGCCG ATGCGGGCGT ACAGGAGATC
GAAAGTATTC AGGTGCTGAA ACTTAAGCCA TTCGACAGCA TGGGCACCTT ACCGGAGATT
ATTAAAACCG GATTTGGCGA CCGTAACGGG TATAATCAGG CGCTCAGCGA GCTGGAAAAC
GAAATCTACC AATTACCGCC CCGCTCTGCT TAA
 
Protein sequence
MAELNLSNLT EADIITKCVM PAILNAGWDN TTQIRQEVKL RDGKVIVRGK VAARRTVKSA 
DIVLYHKPGI PLAVIEAKAN KHEIGKGMQQ GIEYARLLDV PFVFATNGDG FIFRDATAAE
GECLEKQITL DDFPSPAELW QKFCLWKGYT QAQLPVITQD YYDDGSGKSP RYYQLQAINK
TIEAVSNGQN RVLLVMATGT GKTYTAFQII WRLWKSKNKK RILFLADRNI LVDQTKNNDF
QPFGTAMTKV SGRTIDPAYE IHLALYQAIT GPEEDQKAFK QVAPDFFDLI VIDECHRGSA
SEDSAWREIL DYFSSATQIG LTATPKETHE VSSTDYFGDP VYVYSLKEGI EDGFLAPYKV
VRVDIDVDLQ GWRPTKGQTD LNGEVIDDRI YNQKDFDRTM VIDERTELVA RTITDYLKRT
NPMDKTIVFC NDIDHAERMR RALVNLNPEQ VKKNDKYVMK ITGDDEIGKA QLDNFINPKK
PYPVIATTSE LMTTGVDAKT CKLVVLDQNI QSMTKFKQII GRGTRIDERY GKLWFTILDF
KKATELFADE RFDGIPEKVM DTTPEDIADP ESDFEEKLEE ISEHDDEQVT GVDEPPAPPY
QVKDTDDVGP LPEEDEKKIR KFHVNGVAVG VIAQRVQYYD ADGKLVTESF KDYTRKTLLK
EYASLDDFTR KWQDADRKEA IIHELEQQGI IWEVLAEEVG KDLDPFDMLC HVVYGQPPLT
RKERAENVRK RNYFTKYSEA AQAVLDNLLD KYADAGVQEI ESIQVLKLKP FDSMGTLPEI
IKTGFGDRNG YNQALSELEN EIYQLPPRSA