Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3281 |
Symbol | lacZ |
ID | 6066946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3590734 |
End bp | 3593808 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641602696 |
Product | beta-D-galactosidase |
Protein accession | YP_001726230 |
Protein GI | 170021276 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000245928 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATGA TTACGGATTC ACTGGCCGTC GTTTTACAAC GTCGTGACTG GGAAAACCCT GGCGTTACCC AACTTAATCG CCTTGCAGCA CATCCCCCTT TCGCCAGCTG GCGTAATAGC GAAGAGGCCC GCACCGATCG CCCTTCCCAA CAGTTGCGCA GCCTGAATGG CGAATGGCGC TTTGCCTGGT TTCCGGCACC AGAAGCGGTG CCGGAAAGCT GGCTGGAGTG CGATCTTCCT GAGGCCGATA CTGTCGTCGT CCCCTCAAAC TGGCAGATGC ACGGTTACGA TGCGCCCATC TACACCAACG TGACCTATCC CATTACGGTC AATCCGCCGT TTGTTCCCAC GGAGAATCCG ACGAGTTGTT ACTCGCTCAC ATTTAATGTT GATGAAAGCT GGCTACAGGA AGGCCAGACG CGAATTATTT TTGATGGCGT TAACTCGGCG TTTCATCTGT GGTGCAACGG GCGCTGGGTC GGTTACGGCC AGGACAGTCG TTTGCCGTCT GAATTTGACC TGAGCGCATT TTTACGCGCC GGAAAAAACC GCCTCGCGGT GATGGTGCTG CGCTGGAGTG ACGGTAGTTA TCTGGAAGAT CAGGATATGT GGCGAATGAG CGGCATTTTC CGTGACGTCT CGTTGCTGCA CAAACCGACT ACACAAATCA GCGATTTCCA TGTTGCCACT CGCTTTAATG ATGATTTCAG CCGCGCTGTA CTGGAGGCTG AAGTTCAGAT GTGCGGCGAG TTGCGTGACT ACCTACGGGT AACAGTTTCT TTATGGCAGG GTGAAACGCA GGTCGCCAGC GGCACCGCGC CTTTCGGCGG TGAAATTATC GATGAGCGTG GTGGTTATGC CGATCGCGTC ACACTACGTC TGAACGTCGA AAACCCGAAA CTGTGGAGCG CCGAAATCCC GAATCTCTAT CGTGCGGTGG TTGAACTGCA CACCGCCGAC GGCACGCTGA TTGAAGCAGA AGCCTGCGAT GTCGGTTTCC GCGAGGTGCG GATTGAAAAT GGTCTGCTGC TGCTGAACGG CAAGCCGTTG CTGATTCGAG GCGTTAACCG TCACGAGCAT CATCCTCTGC ATGGTCAGGT CATGGATGAG CAGACGATGG TGCAGGATAT CCTGCTGATG AAGCAGAACA ACTTTAACGC CGTGCGCTGT TCGCATTATC CGAACCATCC GCTGTGGTAC ACGCTGTGCG ACCGCTACGG CCTGTATGTG GTGGATGAAG CCAATATTGA AACCCACGGC ATGGTGCCAA TGAATCGTCT GACCGATGAT CCGCGCTGGC TACCGGCGAT GAGCGAACGC GTAACGCGAA TGGTGCAGCG CGATCGTAAT CACCCGAGTG TGATCATCTG GTCGCTGGGG AATGAATCAG GCCACGGCGC TAATCACGAC GCGCTGTATC GCTGGATCAA ATCTGTCGAT CCTTCCCGCC CGGTGCAGTA TGAAGGCGGC GGAGCCGACA CCACGGCCAC CGATATTATT TGCCCGATGT ACGCGCGCGT GGATGAAGAC CAGCCCTTCC CGGCTGTGCC GAAATGGTCC ATCAAAAAAT GGCTTTCGCT ACCTGGAGAG ACGCGCCCGC TGATCCTTTG CGAATACGCC CACGCGATGG GTAACAGTCT TGGCGGTTTC GCTAAATACT GGCAGGCGTT TCGTCAGTAT CCCCGTTTAC AGGGCGGCTT CGTCTGGGAC TGGGTGGATC AGTCGCTGAT TAAATATGAT GAAAACGGCA ACCCGTGGTC GGCTTACGGC GGTGATTTTG GCGATACGCC GAACGATCGC CAGTTCTGTA TGAACGGTCT GGTCTTTGCC GACCGCACGC CGCATCCAGC GCTGACGGAA GCAAAACACC AGCAGCAGTT TTTCCAGTTC CGTTTATCCG GGCAAACCAT CGAAGTGACC AGCGAATACC TGTTCCGTCA TAGCGATAAC GAGCTCCTGC ACTGGATGGT GGCGCTGGAT GGTAAGCCGC TGGCAAGCGG TGAAGTGCCT CTGGATGTCG CTCCACAAGG TAAACAGTTG ATTGAACTGC CTGAACTACC GCAGCCGGAG AGCGCCGGGC AACTCTGGCT CACAGTACGC GTAGTGCAAC CGAACGCGAC CGCATGGTCA GAAGCCGGAC ACATCAGCGC CTGGCAGCAG TGGCGTCTGG CTGAAAACCT CAGCGTGACA CTCCCCGCCG CGTCCCACGC CATCCCGCAT CTGACCACCA GCGAAATGGA TTTTTGCATC GAGCTGGGTA ATAAGCGTTG GCAATTTAAC CGCCAGTCAG GCTTTCTTTC ACAGATGTGG ATTGGCGATA AAAAACAACT GCTGACGCCG CTGCGCGATC AGTTCACCCG TGCACCGCTG GATAACGACA TTGGCGTAAG TGAAGCGACC CGCATTGACC CTAACGCCTG GGTCGAACGC TGGAAGGCGG CGGGCCATTA CCAGGCCGAA GCAGCGTTGT TGCAGTGCAC GGCAGATACA CTTGCTGATG CGGTGCTGAT TACGACCGCT CACGCGTGGC AGCATCAGGG GAAAACCTTA TTTATCAGCC GGAAAACCTA CCGGATTGAT GGTAGTGGTC AAATGGCGAT TACCGTTGAT GTTGAAGTGG CGAGCGATAC ACCGCATCCG GCGCGGATTG GCCTGAACTG CCAGCTGGCG CAGGTAGCAG AGCGGGTAAA CTGGCTCGGA TTAGGGCCGC AAGAAAACTA TCCCGACCGC CTTACTGCCG CCTGTTTTGA CCGCTGGGAT CTGCCATTGT CAGACATGTA TACCCCGTAC GTCTTCCCGA GCGAAAACGG TCTGCGCTGC GGGACGCGCG AATTGAATTA TGGCCCACAC CAGTGGCGCG GCGACTTCCA GTTCAACATC AGCCGCTACA GTCAACAGCA ACTGATGGAA ACCAGCCATC GCCATCTGCT GCACGCGGAA GAAGGCACAT GGCTGAATAT CGACGGTTTC CATATGGGGA TTGGTGGCGA CGACTCCTGG AGCCCGTCAG TATCGGCGGA ATTCCAGCTG AGCGCCGGTC GCTACCATTA CCAGTTGGTC TGGTGTCAAA AATAA
|
Protein sequence | MTMITDSLAV VLQRRDWENP GVTQLNRLAA HPPFASWRNS EEARTDRPSQ QLRSLNGEWR FAWFPAPEAV PESWLECDLP EADTVVVPSN WQMHGYDAPI YTNVTYPITV NPPFVPTENP TSCYSLTFNV DESWLQEGQT RIIFDGVNSA FHLWCNGRWV GYGQDSRLPS EFDLSAFLRA GKNRLAVMVL RWSDGSYLED QDMWRMSGIF RDVSLLHKPT TQISDFHVAT RFNDDFSRAV LEAEVQMCGE LRDYLRVTVS LWQGETQVAS GTAPFGGEII DERGGYADRV TLRLNVENPK LWSAEIPNLY RAVVELHTAD GTLIEAEACD VGFREVRIEN GLLLLNGKPL LIRGVNRHEH HPLHGQVMDE QTMVQDILLM KQNNFNAVRC SHYPNHPLWY TLCDRYGLYV VDEANIETHG MVPMNRLTDD PRWLPAMSER VTRMVQRDRN HPSVIIWSLG NESGHGANHD ALYRWIKSVD PSRPVQYEGG GADTTATDII CPMYARVDED QPFPAVPKWS IKKWLSLPGE TRPLILCEYA HAMGNSLGGF AKYWQAFRQY PRLQGGFVWD WVDQSLIKYD ENGNPWSAYG GDFGDTPNDR QFCMNGLVFA DRTPHPALTE AKHQQQFFQF RLSGQTIEVT SEYLFRHSDN ELLHWMVALD GKPLASGEVP LDVAPQGKQL IELPELPQPE SAGQLWLTVR VVQPNATAWS EAGHISAWQQ WRLAENLSVT LPAASHAIPH LTTSEMDFCI ELGNKRWQFN RQSGFLSQMW IGDKKQLLTP LRDQFTRAPL DNDIGVSEAT RIDPNAWVER WKAAGHYQAE AALLQCTADT LADAVLITTA HAWQHQGKTL FISRKTYRID GSGQMAITVD VEVASDTPHP ARIGLNCQLA QVAERVNWLG LGPQENYPDR LTAACFDRWD LPLSDMYTPY VFPSENGLRC GTRELNYGPH QWRGDFQFNI SRYSQQQLME TSHRHLLHAE EGTWLNIDGF HMGIGGDDSW SPSVSAEFQL SAGRYHYQLV WCQK
|
| |