Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0417 |
Symbol | lacZ |
ID | 6971849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 423606 |
End bp | 426680 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384469 |
Product | beta-D-galactosidase |
Protein accession | YP_002268983 |
Protein GI | 209400479 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000241482 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTATGA TTACAGATTC ACTGGCCGTC GTATTACAAC GTCGTGACTG GGAAAACCCT GGCGTTACCC AACTTAATCG CCTTGCAGCA CATCCCCCTT TCGCCAGCTG GCGTAATAGC GAAGAGGCCC GCACCAATCG CCCTTCCCAG CAGTTGCGCA GCCTGAATGG TGAGTGGCAA TTTGTCTGGT TTCCGGCACC AGAAGCGGTT CCGGAAAGCT GGCTGGAGTG CGATCTTCCT GACGCCGATA CTGTCGTCGT CCCCTCAAAC TGGCAGATGC ACGGTTACGA CGCGCCCATC TACACCAACG TGACATATCC CATTACGGTC AATCCGCCAT TTGTTCCCAC GGAGAATCCG ACGGGTTGTT ACTCGCTCAC ATTTAATGTT GATGAAAGCT GGCTACAGGA AGGCCAGACG CGAATTATTT TTGATGGCGT TAACTCGGCG TTTCATCTGT GGTGCAACGG GCGCTGGGTC GGTTACGGCC AGGACAGTCG TTTGCTGTCT GAATTTGACC TGAGCGCATT TTTACGCGCC GGAGAAAACC GCCTCGCGGT GATGGTGCTG CGCTGGAGTG ACGGCAGTTA TCTGGAAGAT CAGGATATGT GGCGGATGAG CGGCATTTTC CGTGACGTCT CGTTGCTGCA CAAACCGACC ACACAAATCA GCGATTTCCA TGTTGCCACT CTCTTTAATG ATGATTTTAG CCGCGCGGTA CTGGAGGCAG AAGTTCAGAT GTACGGCGAG CTGCGCGATG AGCTGCGGGT GACGGTTTCT TTGTGGCAGG GTGAAACGCA GGTCGCCAGC GGCACCGCGC CTTTCGGCGG TGAAATTATC GATGAGCGTG GCGGTTATGC CGATCGCGTC ACACTAGGTC TGAACGTCGA AAACCCGAAA CTGTGGAGCG CCGAAATCCC GAATATCTAT CGTGCGGTGG TTGAACTGCA CACCGCCGAC GGCACGCTGA TTGAAGCAGA AGCCTGCGAT GTCGGTTTCC GCGAGGTGCG GATTGAAAAT GGTCTGCTGC TGCTGAACGG CAAGCCGTTG CTGATTCGCG GCGTTAACCG TCACGAGCAT CATCCTCTGC ATGGTCAGGT CATGGATGAG CAGACGATGG TGCAGGATAT CCTGCTAATG AAGCAGAACA ACTTTAACGC CGTGCGCTGT TCGCATTATC CGAACCATCC GCTGTGGTAC ACCCTGTGCG ACCGCTACGG CCTGTATGTG GTGGATGAAG CCAATATTGA AACCCACGGC ATGGTGCCAA TGAATCGTCT GACCGATGAT CCGCGCTGGT TACCGGCCAT GAGCGAACGA GTAACACGAA TGGTGCAGCG CGATCGTAAT CACCCGAGTG TGATCATCTG GTCGCTGGGG AATGAGTCAG GCCACGGCGC TAATCACGAC GCACTCTATC GCTGGATTAA ATCTGTCGAT CCATCCCGCC CGGTGCAGTA TGAAGGCGGC GGAGCCGACA CCTCCGCAAC CGATATTATT TGCCCGATGT ACGCGCGCGT GGATGAAGAC CAGCCCTTCC CGGCTGTGCC GAAATGGTCC ATCAAAAAAT GGCTTTCGCT GCCTGGAGAA ATGCGCCCAC TGATCCTTTG CGAATACGCC CACGCGATGG GTAACAGTCT TGGCGGCTTC GCTAAATACT GGCAGGCGTT TCGTCAGTAC CCCCGTTTAC AGGGCGGCTT CGTCTGGGAC TTGGTGGATC AGTCGCTGAT TAAATATGAT GAAAATGGCA ATCCGTGGTC GGCTTACGGC GGTGATTTTG GCGATACGCC GAATGATCGC CAGTTCTGCA TGAACGGTCT GGTCTTTGCC GACCGCACGC CGCATCCGGC GCTGACGGAA GCAAAACACC AGCAGCAGTT TTTCCAGTTC CGTTTATCCG GGCGAACCAT CGAAGTGACC AGCGAATACC TGTTCCATCA TAGCGATAAC GAGCTCCTGC ACTGGACGGT GGCGCTGGAT GGTAAGCCGC TGGCAAGCGG TGAAGTGCCT CTGGATGTCG CTCCGCAAGG TAAACAGGTA ATTGAATTGC CTGAACTACC GCGACTGGAG AGCACCGGGC AACTCTGGCT AACGGTACAC GTAGTGCAAC CGAACGCGAC CGCATGGTCA GAAGCCGGAC ACATCAGCGC CTGGCAGCAG TGGCGTCTGG CGGAAAACCT CAGCGTGACA CTCCCCTCCG CGCCCCACGC CATCCCGCAA CTGACCACCA GCGAAACGGA TTTTTGCATC GAGCTGGATA ATAAGCGTTG GCAATTTAAC CGCCAGTCAG GCTTTCTTTC ACAGATGTGG ATTGGCGATA AAAAACAACT GCTGACGCCG CTGCGCGATC AGTTCACCCG CGCACCGCTG GATAACGACA TTGGCGTAAG TGAAGCGACC CGCATTGACC CTAACGCCTG GGTCGAACGC TGGAAGGCTG CGGGCCATTA CCAGGCAGAA GCGGCGTTGT TGCAGTGCAC GGCAGATACA CTTGCCGACG CGGTGCTGAT TACCACTGTC CACGCATGGC AGCATCAGGG AAAAACCTTA TTTATTAGCC GGAAAACCTA CCGGATTGAT GGTAGTGGTC AAATGGCGAT TACCGTTGAT GTTGAAGTAG CGAGCGATAC ACCGCATCCG GCACGGATTG GCCTGACCTG CCAGCTGGCG CAGGTAGCAG AGCGGGTAAA CTGGCTCGGA TTAGGGCCGC AAGAAAACTA TCCCGACCGC CTTACTGCCG CCTGTTTTGA CCGCTGGGAT CTGCCATTGT CAGACATGTA TACCCCGTAC GTCTTCCCGA GCGAAAACGG TCTGCGCTGC GGGACGCGCG AATTGAATTA TGGCCCACAC CAGTGGCGCG GCGACTTCCA GTTCAACATC AGCCGCTACA GCCAACAACA ACTGATGGAA ACCAGCCATC GCCATCTGCT GCACGCGGAA GAAGGAACAT GGCTGAATAT CGACGGTTTC CATATGGGGA TTGGTGGCGA CGACTCCTGG AGCCCGTCAG TATCGGCGGA ATTCCAGCTG AGCGCCGGTC GCTACCATTA CCAGTTGGTC TGGTGTCAAA AATAA
|
Protein sequence | MTMITDSLAV VLQRRDWENP GVTQLNRLAA HPPFASWRNS EEARTNRPSQ QLRSLNGEWQ FVWFPAPEAV PESWLECDLP DADTVVVPSN WQMHGYDAPI YTNVTYPITV NPPFVPTENP TGCYSLTFNV DESWLQEGQT RIIFDGVNSA FHLWCNGRWV GYGQDSRLLS EFDLSAFLRA GENRLAVMVL RWSDGSYLED QDMWRMSGIF RDVSLLHKPT TQISDFHVAT LFNDDFSRAV LEAEVQMYGE LRDELRVTVS LWQGETQVAS GTAPFGGEII DERGGYADRV TLGLNVENPK LWSAEIPNIY RAVVELHTAD GTLIEAEACD VGFREVRIEN GLLLLNGKPL LIRGVNRHEH HPLHGQVMDE QTMVQDILLM KQNNFNAVRC SHYPNHPLWY TLCDRYGLYV VDEANIETHG MVPMNRLTDD PRWLPAMSER VTRMVQRDRN HPSVIIWSLG NESGHGANHD ALYRWIKSVD PSRPVQYEGG GADTSATDII CPMYARVDED QPFPAVPKWS IKKWLSLPGE MRPLILCEYA HAMGNSLGGF AKYWQAFRQY PRLQGGFVWD LVDQSLIKYD ENGNPWSAYG GDFGDTPNDR QFCMNGLVFA DRTPHPALTE AKHQQQFFQF RLSGRTIEVT SEYLFHHSDN ELLHWTVALD GKPLASGEVP LDVAPQGKQV IELPELPRLE STGQLWLTVH VVQPNATAWS EAGHISAWQQ WRLAENLSVT LPSAPHAIPQ LTTSETDFCI ELDNKRWQFN RQSGFLSQMW IGDKKQLLTP LRDQFTRAPL DNDIGVSEAT RIDPNAWVER WKAAGHYQAE AALLQCTADT LADAVLITTV HAWQHQGKTL FISRKTYRID GSGQMAITVD VEVASDTPHP ARIGLTCQLA QVAERVNWLG LGPQENYPDR LTAACFDRWD LPLSDMYTPY VFPSENGLRC GTRELNYGPH QWRGDFQFNI SRYSQQQLME TSHRHLLHAE EGTWLNIDGF HMGIGGDDSW SPSVSAEFQL SAGRYHYQLV WCQK
|
| |