Gene ECH74115_5809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5809 
Symbol 
ID6969870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5450290 
End bp5455224 
Gene Length4935 bp 
Protein Length1644 aa 
Translation table11 
GC content52% 
IMG OID643389437 
Producthypothetical protein 
Protein accessionYP_002273829 
Protein GI209400195 
COG category[V] Defense mechanisms 
COG ID[COG1002] Type II restriction enzyme, methylase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTGG TCGGTATTAA TAACGAAAAC GAATTTTACT CTAACCACTA TTTGGGTGAG 
GTTTTCACCA GTGATATCCG CGATGTGCTG GAACCCTGGA TAGCCCAGGA AAATGCAGCG
CGTGAAGCGG AGCGTGCCGC TCGTGAACAG GGCAAAGACG TGGAGCCGGG ATACCGCGCT
CCGTGGAACC AGTTTAACAG TCTGGCGACT GAGTTTTTCC GCAAACTTGC CGAGCACGAA
AAACAGCGTC AGATCCCGCA GCGTCTGGCC GATCAACGTA ATCGCTGGCA GCCATTGTTA
AAGGCGCTGG GCTACGAAAT TACGCCACAG ATCCAAATGC TGGATGACGA TACACCGCTG
CCGGTACTGG CGCGTTACAA CAGCACTGAC GGTAGCCCGT GGCTGTGGAT TGTTGAAGCA
CACGATCAGG AAGAAGGAAC GCTGGATCCG CTGGCGCTCT CCTTACTGAC CGCGCAATTC
CCGGCGGATA CCGACAAACA TAAGCACGAC AGCCTGCGCA AAAAAGCCAA CGGTGAATAT
CGCAGCTGGC AGGATTTGCT CTCTACGGCG GTCTTCACCC AAAATGAACC GCCGCGTTTT
GTGCTGCTGC TCGGTAACCG TCAGCTATTG TTGTTGGACC GTACTAAGTG GGCGCAAAAC
CGTCTGCTAC GTTTTGATTT TGAAGAGATT TTAAGCCGTC GTGAAACGGA TACGCTGAAA
GCGACTGCGG TGTTGTTACA TAAAGATTCT CTGCTGCCGG GCAGTGGGGC ACCTTATCTT
GACTCGCTGG ATGACAATTC GCACAAACAT GCGTTTGGTG TTTCGGAAGA TCTGAAATAT
GCCCTGCGCG AAAGCATTGA GTTGCTGGGC AACGAAGCGA TGCATTATCT GATCGACCGT
GGCCTGGCAA ACTATACAGG TAATCGTGCG GTGGACCCGG ATGAACTGAG CCGCGAATGT
CTGCGTTACA TGTACCGCCT GCTGTTCCTG TTCTACATTG AAGCGCGCCC GGAGCTGGGT
TATGCGCCAA TGACCGCCAA AACCTATCTG CAAGGTTACA GCCTGGAAAC GTTGCGCGAT
CTGGAGATGA TCCCGCTGAC CAGCGAAGAA GATCGCAACG GGCGCTACTT CCACGACAGC
CTGAATATGC TGTTTAAACT GGTGCGCGAA GGCTACAACG GCGGCGTGAA AATGCAGAGT
GACCTGGAGA GCGGCGACCG GATCACCATC CATAGTCATC AGTTCAGCGT CCCGCGTCTG
GAAAGTCATC TGTTTGATGC CAACAACACG CGCATTCTTA ACCGCGTGGT ATTCCGTAAC
GAAACCCTGC AACAGATTAT CCAGGCGATG TCGTTAAGCC GCCCGGCCAA AGGGCGCTTT
AACCGCCGCG GACGTATTTC TTATCGCCAG TTGGGTATCA ACCAGTTGGG TGCGGTGTAT
GAGGCGCTGC TCTCCTATCG CGGATTCTTC GCCAGCGAAG ATCTCTACGA GGTGAAGAAA
GCCGGGGAAG AGTTTAACGA GCTGGAGACG GGTTACTTCG TCAGTAAGGA TGAGATTAGC
AAATACCACG AAGACGAGAA GGTCTACGAG AAAGACGGCA GTCTGCGCAT TCACCGCAAA
GGCAGCTTTA TCTACCGTAT GGCCGGGCGC GACCGTGAGA AATCTGCCTC TTATTACACC
CCGGAAGTGC TGACCCGCTC ACTGGTTAAA TATGCCCTGA AAGAACTGTT TAAGGAGCAG
ATTGATCCCA TTAGCGATCC GCACGCCAAA GCTGATGCCA TCTTAAACCT CACCGTGTGC
GAACCGGCGA TGGGCAGCGC GGCGTTCCTT AACGAAGCCA TCAACCAGCT GGCGGAAGCG
TATCTGTTCC ACAAGCAGCA GGCGGAAGGT CGCCGTATTC CGCAGGATCG TTACACCCAG
GAGTTACAGC GGGTGAAAAT GTACATTGCC GACAACAACG TTTTCGGCGT GGACTTAAAC
CCGGTGGCGG TGGAACTGGC GGAAGTGTCG CTGTGGCTGA ACGCCATTAG TGGCGATGCG
TTTGTACCGT GGTTTGGTTA CCAGCTGCAC TGCGGTAACT CGCTGGTGGG CGCGCGCCGT
CAGGTGTTCA ACAAGAGCGA ACTGACCTAC AAAAAAGCCA AAGATCCGAG CTGGCTTAAC
AGCGAGCCGG TCGAACTGGC GATGAACACG CCGCGTGAAG AGACGCAGAT TTTCCACTTC
CTGCTGCCCG ACGGCGGTAT GGCTAACTAC AGCGATAAAA CTGTTAAGCA GCGTTATCCG
GATGACTTCA AAGCGCTGGA CAGCTGGCGC AAAGAGTTTA TTAAAAGCTT TGCCGGGCAT
GAGATTGCTG ATGTGCAGCG TATCAGCGAA AAGGTGGAAG CACTGTGGAA CACCTATCGC
CAGCAACTTA AAGCAGAACG TCTGAAAACC GCCGACAGCT ACCCGGTGTG GCCGGCAGAA
AACAGCGAGC AGACGCGTTC TTCGCTGAGC AGTAAAGATG AAACCTTTAG CGGTCGTCTT
GAAGATAACA GCGCCTACCA GAAGCTGCGT TGGGTAATGG ACTACTGGTG CGCGCTGTGG
TTCTGGCCGA TCGACAAAGC CGATGAGCTA CCGGATCGCG GCACCTGGTT GTTTGAGATT
GAAACCCTGC TTGACGGGAT TGTAATCACG GAAAAAGTCA CTGAAGTTGC GGAGCACACC
ACCGGCGATC TGTTTGCCGA AGAAGGCCTG CTGCGAGAAG AGTCTTCGCT GTTTTCTGTT
GCTGGTCGTC TGAAAACCGA GGTGTTGTTC CGTCATTTGC CGCGTCTGGC GATTGTCGAT
GCCCTGAGAA AGCAGCACCG TTTCTTCCAC TGGGATCTGG AGTTCTGCGA CCTGTTTGCC
GAGCGCGGCG GTTTTGACCT GATGCTCGGA AACCCGCCGT GGCTGAAAGT GGAATGGCAG
GAAGCGGGCG TGCTGGGTGA TTACGAGCCG GAATTTGTGC TGCGTAAGCT GAGCGCCTCG
AAGCTGGCAA CGTTGCGTAT TGATACCTTT AACCAGATCC CGGCGCTGGA AGCGGCCTGG
CGCAGCGAGT ATGAAGGCTG TGAAGGGATG CAAAACTTCC TGAATGCGCA GCAGAACTAC
CCGGTACTGC GCGGGGTGCA GACCAACTTG TATAAATGCT TTCTGCCGCA GGCCTGGCGA
TTAGGGGCGC AGAAAGGCGT GGCAGGTTTC CTGCACCCGG AAGGGATTTA TGATGACCCG
AAAGGCGGGC AATTACGTGC GGCGGTATAT CCGAGGCTGA GGGCGCATTT TCAGTTTCAG
AATGAGTTAA ATTTGTTTGT TGAAGTTGAT CACCATGCGA AGTTTAGCAG CAATATTTAT
TCTGCTAGCC CTAGCACAGT GGGATTTGAA CATATATCTA ATTTGTATGC TCCGCAAACT
ATTGATGCAT GTTTTGAACA TTCTGGCAGT GGGGACATTC CCGGTCTCAA AGACGAGATT
GAGAGCGAGG GAAAATTAAA AGTTGTATGG AACACATCTG GCCACCGTTC TCGATTAATA
AGTATCGCCA CTCATGAGCT AGAATTATTT GCTCGTCTAT ATGACAGCGA AGGAACGCCA
GCCTGGCAGG CACGTTTGCC AGCCTTACAT GCTAAACAAC TTGTTGCTGT ACTGGAAAAG
TTTGCTAATC AGCCGAATAG ATTAGGTGAT TTGCAGGGGC AGTATTTTTC AACGGTTATG
TTCGATGAAA CATATGCTCA GAGGGATGGG ACAATTTTAC GGCAGACTCA ATTCCCTCAA
GATTCATCAC AATGGGTACT GTCTGGCCCT CATTTCTTTG TTGGGACGCC GTTCTACAAG
ACTCCGCGCG AAAACTGTAC GCTTAACAGC GATTATGACT GCCTGGACTT GCTAACTCTG
CCTGACGACT ATCTGCCGCG CACTAACTAC ATTCCGGCAT GTGATGCACA GGAGTATGCA
AAACGTACTC CATGCGTTAC ATGGACTGAA CTGGCTGAAG ATGAACCGAA GAAGGTAACA
GATTATTATC GCTTAGCTAT CAGAGCCATG TTGGCTCAAT CGGGGGAACG TACACTAATT
AGTGCTATTT ATCCGCCAGA AATAAGTCAC ATGAACGCAG TACGTTCTTA CTGCTATAGC
TCACAGAATC TGTTACTCGA ACATTCAGGT ATGTGTTTTT CTTTACCTTT TGATTTTATT
TGTAAATCTA CTGGCAAGGC AAACTTACAT CAGATGCTTG ATGGTTTCTC ATACGTATTA
TTCAATCCGA GACAAAAGGC ATTATTATAC TGCTTAGTAT TATCATTAAA TTCTGTAAAT
GATGTATATG CTGGCCTTTG GCAATCCTGC TACACCCCAG ACTTCAACAC CCAGCGTTGG
AGCCGCGATC TCCCGCAGCT CCCCCAGGAT TTCTTCGCCA AACTGACCCC AGAGTGGCAG
CGTAACTGCG CTTTACGCTC TGACTACAGT CGTCGTCAGG CGCTGGTGGA AATCGACGTA
TTGGTGGCGC AGGCGCTGGG GTTAACTCTC GAAGAGCTGC TTACCATTTA TCGCGTTCAG
TTCCCGGTGA TGCGCCAGTA CGAAGCGGAT ACCTGGTACG ATCAAAACGG TCGCATTATC
TTTACCCCAA GCAAAGGGCT GGTGGGCGTT GGCTTGCCTC GCACCGCGCG TAAAGCTGAC
CTGAAAAACG GCTTTGTCTT TAACGTCGAC AGCCCGGAGT GGACCGGCGG TGACTGCACC
GATCAAGCTA TCGGTTGGGA TGATGTCAAA CATCTTAAAA CCGGTACCGT CAGCGTCACC
TTTGATGATT ATACCCGCAG CGACGAAGGT GAGCGCCGTA CCGTCACCTG GCAGGCTCCG
TTTATCAAGC CAGATCGCGA AGATGACTAC AAAGTGGCCT GGGCGTTCTT TGCACAAGAT
AAGGAGAGCG CCTGA
 
Protein sequence
MALVGINNEN EFYSNHYLGE VFTSDIRDVL EPWIAQENAA REAERAAREQ GKDVEPGYRA 
PWNQFNSLAT EFFRKLAEHE KQRQIPQRLA DQRNRWQPLL KALGYEITPQ IQMLDDDTPL
PVLARYNSTD GSPWLWIVEA HDQEEGTLDP LALSLLTAQF PADTDKHKHD SLRKKANGEY
RSWQDLLSTA VFTQNEPPRF VLLLGNRQLL LLDRTKWAQN RLLRFDFEEI LSRRETDTLK
ATAVLLHKDS LLPGSGAPYL DSLDDNSHKH AFGVSEDLKY ALRESIELLG NEAMHYLIDR
GLANYTGNRA VDPDELSREC LRYMYRLLFL FYIEARPELG YAPMTAKTYL QGYSLETLRD
LEMIPLTSEE DRNGRYFHDS LNMLFKLVRE GYNGGVKMQS DLESGDRITI HSHQFSVPRL
ESHLFDANNT RILNRVVFRN ETLQQIIQAM SLSRPAKGRF NRRGRISYRQ LGINQLGAVY
EALLSYRGFF ASEDLYEVKK AGEEFNELET GYFVSKDEIS KYHEDEKVYE KDGSLRIHRK
GSFIYRMAGR DREKSASYYT PEVLTRSLVK YALKELFKEQ IDPISDPHAK ADAILNLTVC
EPAMGSAAFL NEAINQLAEA YLFHKQQAEG RRIPQDRYTQ ELQRVKMYIA DNNVFGVDLN
PVAVELAEVS LWLNAISGDA FVPWFGYQLH CGNSLVGARR QVFNKSELTY KKAKDPSWLN
SEPVELAMNT PREETQIFHF LLPDGGMANY SDKTVKQRYP DDFKALDSWR KEFIKSFAGH
EIADVQRISE KVEALWNTYR QQLKAERLKT ADSYPVWPAE NSEQTRSSLS SKDETFSGRL
EDNSAYQKLR WVMDYWCALW FWPIDKADEL PDRGTWLFEI ETLLDGIVIT EKVTEVAEHT
TGDLFAEEGL LREESSLFSV AGRLKTEVLF RHLPRLAIVD ALRKQHRFFH WDLEFCDLFA
ERGGFDLMLG NPPWLKVEWQ EAGVLGDYEP EFVLRKLSAS KLATLRIDTF NQIPALEAAW
RSEYEGCEGM QNFLNAQQNY PVLRGVQTNL YKCFLPQAWR LGAQKGVAGF LHPEGIYDDP
KGGQLRAAVY PRLRAHFQFQ NELNLFVEVD HHAKFSSNIY SASPSTVGFE HISNLYAPQT
IDACFEHSGS GDIPGLKDEI ESEGKLKVVW NTSGHRSRLI SIATHELELF ARLYDSEGTP
AWQARLPALH AKQLVAVLEK FANQPNRLGD LQGQYFSTVM FDETYAQRDG TILRQTQFPQ
DSSQWVLSGP HFFVGTPFYK TPRENCTLNS DYDCLDLLTL PDDYLPRTNY IPACDAQEYA
KRTPCVTWTE LAEDEPKKVT DYYRLAIRAM LAQSGERTLI SAIYPPEISH MNAVRSYCYS
SQNLLLEHSG MCFSLPFDFI CKSTGKANLH QMLDGFSYVL FNPRQKALLY CLVLSLNSVN
DVYAGLWQSC YTPDFNTQRW SRDLPQLPQD FFAKLTPEWQ RNCALRSDYS RRQALVEIDV
LVAQALGLTL EELLTIYRVQ FPVMRQYEAD TWYDQNGRII FTPSKGLVGV GLPRTARKAD
LKNGFVFNVD SPEWTGGDCT DQAIGWDDVK HLKTGTVSVT FDDYTRSDEG ERRTVTWQAP
FIKPDREDDY KVAWAFFAQD KESA