Gene ECH74115_0417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0417 
SymbollacZ 
ID6971849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp423606 
End bp426680 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content56% 
IMG OID643384469 
Productbeta-D-galactosidase 
Protein accessionYP_002268983 
Protein GI209400479 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000241482 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATGA TTACAGATTC ACTGGCCGTC GTATTACAAC GTCGTGACTG GGAAAACCCT 
GGCGTTACCC AACTTAATCG CCTTGCAGCA CATCCCCCTT TCGCCAGCTG GCGTAATAGC
GAAGAGGCCC GCACCAATCG CCCTTCCCAG CAGTTGCGCA GCCTGAATGG TGAGTGGCAA
TTTGTCTGGT TTCCGGCACC AGAAGCGGTT CCGGAAAGCT GGCTGGAGTG CGATCTTCCT
GACGCCGATA CTGTCGTCGT CCCCTCAAAC TGGCAGATGC ACGGTTACGA CGCGCCCATC
TACACCAACG TGACATATCC CATTACGGTC AATCCGCCAT TTGTTCCCAC GGAGAATCCG
ACGGGTTGTT ACTCGCTCAC ATTTAATGTT GATGAAAGCT GGCTACAGGA AGGCCAGACG
CGAATTATTT TTGATGGCGT TAACTCGGCG TTTCATCTGT GGTGCAACGG GCGCTGGGTC
GGTTACGGCC AGGACAGTCG TTTGCTGTCT GAATTTGACC TGAGCGCATT TTTACGCGCC
GGAGAAAACC GCCTCGCGGT GATGGTGCTG CGCTGGAGTG ACGGCAGTTA TCTGGAAGAT
CAGGATATGT GGCGGATGAG CGGCATTTTC CGTGACGTCT CGTTGCTGCA CAAACCGACC
ACACAAATCA GCGATTTCCA TGTTGCCACT CTCTTTAATG ATGATTTTAG CCGCGCGGTA
CTGGAGGCAG AAGTTCAGAT GTACGGCGAG CTGCGCGATG AGCTGCGGGT GACGGTTTCT
TTGTGGCAGG GTGAAACGCA GGTCGCCAGC GGCACCGCGC CTTTCGGCGG TGAAATTATC
GATGAGCGTG GCGGTTATGC CGATCGCGTC ACACTAGGTC TGAACGTCGA AAACCCGAAA
CTGTGGAGCG CCGAAATCCC GAATATCTAT CGTGCGGTGG TTGAACTGCA CACCGCCGAC
GGCACGCTGA TTGAAGCAGA AGCCTGCGAT GTCGGTTTCC GCGAGGTGCG GATTGAAAAT
GGTCTGCTGC TGCTGAACGG CAAGCCGTTG CTGATTCGCG GCGTTAACCG TCACGAGCAT
CATCCTCTGC ATGGTCAGGT CATGGATGAG CAGACGATGG TGCAGGATAT CCTGCTAATG
AAGCAGAACA ACTTTAACGC CGTGCGCTGT TCGCATTATC CGAACCATCC GCTGTGGTAC
ACCCTGTGCG ACCGCTACGG CCTGTATGTG GTGGATGAAG CCAATATTGA AACCCACGGC
ATGGTGCCAA TGAATCGTCT GACCGATGAT CCGCGCTGGT TACCGGCCAT GAGCGAACGA
GTAACACGAA TGGTGCAGCG CGATCGTAAT CACCCGAGTG TGATCATCTG GTCGCTGGGG
AATGAGTCAG GCCACGGCGC TAATCACGAC GCACTCTATC GCTGGATTAA ATCTGTCGAT
CCATCCCGCC CGGTGCAGTA TGAAGGCGGC GGAGCCGACA CCTCCGCAAC CGATATTATT
TGCCCGATGT ACGCGCGCGT GGATGAAGAC CAGCCCTTCC CGGCTGTGCC GAAATGGTCC
ATCAAAAAAT GGCTTTCGCT GCCTGGAGAA ATGCGCCCAC TGATCCTTTG CGAATACGCC
CACGCGATGG GTAACAGTCT TGGCGGCTTC GCTAAATACT GGCAGGCGTT TCGTCAGTAC
CCCCGTTTAC AGGGCGGCTT CGTCTGGGAC TTGGTGGATC AGTCGCTGAT TAAATATGAT
GAAAATGGCA ATCCGTGGTC GGCTTACGGC GGTGATTTTG GCGATACGCC GAATGATCGC
CAGTTCTGCA TGAACGGTCT GGTCTTTGCC GACCGCACGC CGCATCCGGC GCTGACGGAA
GCAAAACACC AGCAGCAGTT TTTCCAGTTC CGTTTATCCG GGCGAACCAT CGAAGTGACC
AGCGAATACC TGTTCCATCA TAGCGATAAC GAGCTCCTGC ACTGGACGGT GGCGCTGGAT
GGTAAGCCGC TGGCAAGCGG TGAAGTGCCT CTGGATGTCG CTCCGCAAGG TAAACAGGTA
ATTGAATTGC CTGAACTACC GCGACTGGAG AGCACCGGGC AACTCTGGCT AACGGTACAC
GTAGTGCAAC CGAACGCGAC CGCATGGTCA GAAGCCGGAC ACATCAGCGC CTGGCAGCAG
TGGCGTCTGG CGGAAAACCT CAGCGTGACA CTCCCCTCCG CGCCCCACGC CATCCCGCAA
CTGACCACCA GCGAAACGGA TTTTTGCATC GAGCTGGATA ATAAGCGTTG GCAATTTAAC
CGCCAGTCAG GCTTTCTTTC ACAGATGTGG ATTGGCGATA AAAAACAACT GCTGACGCCG
CTGCGCGATC AGTTCACCCG CGCACCGCTG GATAACGACA TTGGCGTAAG TGAAGCGACC
CGCATTGACC CTAACGCCTG GGTCGAACGC TGGAAGGCTG CGGGCCATTA CCAGGCAGAA
GCGGCGTTGT TGCAGTGCAC GGCAGATACA CTTGCCGACG CGGTGCTGAT TACCACTGTC
CACGCATGGC AGCATCAGGG AAAAACCTTA TTTATTAGCC GGAAAACCTA CCGGATTGAT
GGTAGTGGTC AAATGGCGAT TACCGTTGAT GTTGAAGTAG CGAGCGATAC ACCGCATCCG
GCACGGATTG GCCTGACCTG CCAGCTGGCG CAGGTAGCAG AGCGGGTAAA CTGGCTCGGA
TTAGGGCCGC AAGAAAACTA TCCCGACCGC CTTACTGCCG CCTGTTTTGA CCGCTGGGAT
CTGCCATTGT CAGACATGTA TACCCCGTAC GTCTTCCCGA GCGAAAACGG TCTGCGCTGC
GGGACGCGCG AATTGAATTA TGGCCCACAC CAGTGGCGCG GCGACTTCCA GTTCAACATC
AGCCGCTACA GCCAACAACA ACTGATGGAA ACCAGCCATC GCCATCTGCT GCACGCGGAA
GAAGGAACAT GGCTGAATAT CGACGGTTTC CATATGGGGA TTGGTGGCGA CGACTCCTGG
AGCCCGTCAG TATCGGCGGA ATTCCAGCTG AGCGCCGGTC GCTACCATTA CCAGTTGGTC
TGGTGTCAAA AATAA
 
Protein sequence
MTMITDSLAV VLQRRDWENP GVTQLNRLAA HPPFASWRNS EEARTNRPSQ QLRSLNGEWQ 
FVWFPAPEAV PESWLECDLP DADTVVVPSN WQMHGYDAPI YTNVTYPITV NPPFVPTENP
TGCYSLTFNV DESWLQEGQT RIIFDGVNSA FHLWCNGRWV GYGQDSRLLS EFDLSAFLRA
GENRLAVMVL RWSDGSYLED QDMWRMSGIF RDVSLLHKPT TQISDFHVAT LFNDDFSRAV
LEAEVQMYGE LRDELRVTVS LWQGETQVAS GTAPFGGEII DERGGYADRV TLGLNVENPK
LWSAEIPNIY RAVVELHTAD GTLIEAEACD VGFREVRIEN GLLLLNGKPL LIRGVNRHEH
HPLHGQVMDE QTMVQDILLM KQNNFNAVRC SHYPNHPLWY TLCDRYGLYV VDEANIETHG
MVPMNRLTDD PRWLPAMSER VTRMVQRDRN HPSVIIWSLG NESGHGANHD ALYRWIKSVD
PSRPVQYEGG GADTSATDII CPMYARVDED QPFPAVPKWS IKKWLSLPGE MRPLILCEYA
HAMGNSLGGF AKYWQAFRQY PRLQGGFVWD LVDQSLIKYD ENGNPWSAYG GDFGDTPNDR
QFCMNGLVFA DRTPHPALTE AKHQQQFFQF RLSGRTIEVT SEYLFHHSDN ELLHWTVALD
GKPLASGEVP LDVAPQGKQV IELPELPRLE STGQLWLTVH VVQPNATAWS EAGHISAWQQ
WRLAENLSVT LPSAPHAIPQ LTTSETDFCI ELDNKRWQFN RQSGFLSQMW IGDKKQLLTP
LRDQFTRAPL DNDIGVSEAT RIDPNAWVER WKAAGHYQAE AALLQCTADT LADAVLITTV
HAWQHQGKTL FISRKTYRID GSGQMAITVD VEVASDTPHP ARIGLTCQLA QVAERVNWLG
LGPQENYPDR LTAACFDRWD LPLSDMYTPY VFPSENGLRC GTRELNYGPH QWRGDFQFNI
SRYSQQQLME TSHRHLLHAE EGTWLNIDGF HMGIGGDDSW SPSVSAEFQL SAGRYHYQLV
WCQK