Gene ECD_00298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00298 
SymbollacZ 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp332766 
End bp335840 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content56% 
IMG OID 
Productbeta-D-galactosidase 
Protein accessionACT42197 
Protein GI253976527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value2.6518e-05 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATGA TTACGGATTC ACTGGCCGTC GTTTTACAAC GTCGTGACTG GGAAAACCCT 
GGCGTTACCC AACTTAATCG CCTTGCAGCA CATCCCCCTT TCGCCAGCTG GCGTAATAGC
GAAGAGGCCC GCACCGATCG CCCTTCCCAA CAGTTGCGCA GCCTGAATGG CGAATGGCGC
TTTGCCTGGT TTCCGGCACC AGAAGCGGTG CCGGAAAGCT GGCTGGAGTG CGATCTTCCT
GAGGCCGATA CTGTCGTCGT CCCCTCAAAC TGGCAGATGC ACGGTTACGA TGCGCCCATC
TACACCAACG TGACCTATCC CATTACGGTC AATCCGCCGT TTGTTCCCAC GGAGAATCCG
ACGGGTTGTT ACTCGCTCAC ATTTAATGTT GATGAAAGCT GGCTACAGGA AGGCCAGACG
CGAATTATTT TTGATGGCGT TAACTCGGCG TTTCATCTGT GGTGCAACGG GCGCTGGGTC
GGTTACGGCC AGGACAGTCG TTTGCCGTCT GAATTTGACC TGAGCGCATT TTTACGCGCC
GGAGAAAACC GCCTCGCGGT GATGGTGCTG CGCTGGAGTG ACGGCAGTTA TCTGGAAGAT
CAGGATATGT GGCGGATGAG CGGCATTTTC CGTGACGTCT CGTTGCTGCA TAAACCGACT
ACACAAATCA GCGATTTCCA TGTTGCCACT CGCTTTAATG ATGATTTCAG CCGCGCTGTA
CTGGAGGCTG AAGTTCAGAT GTGCGGCGAG TTGCGTGACT ACCTACGGGT AACAGTTTCT
TTATGGCAGG GTGAAACGCA GGTCGCCAGC GGCACCGCGC CTTTCGGCGG TGAAATTATC
GATGAGCGTG GTGGTTATGC CGATCGCGTC ACACTACGTC TGAACGTCGA AAACCCGAAA
CTGTGGAGCG CCGAAATCCC GAATCTCTAT CGTGCGGTGG TTGAACTGCA CACCGCCGAC
GGCACGCTGA TTGAAGCAGA AGCCTGCGAT GTCGGTTTCC GCGAGGTGCG GATTGAAAAT
GGTCTGCTGC TGCTGAACGG CAAGCCGTTG CTGATTCGAG GCGTTAACCG TCACGAGCAT
CATCCTCTGC ATGGTCAGGT CATGGATGAG CAGACGATGG TGCAGGATAT CCTGCTGATG
AAGCAGAACA ACTTTAACGC CGTGCGCTGT TCGCATTATC CGAACCATCC GCTGTGGTAC
ACGCTGTGCG ACCGCTACGG CCTGTATGTG GTGGATGAAG CCAATATTGA AACCCACGGC
ATGGTGCCAA TGAATCGTCT GACCGATGAT CCGCGCTGGC TACCGGCGAT GAGCGAACGC
GTAACGCGAA TGGTGCAGCG CGATCGTAAT CACCCGAGTG TGATCATCTG GTCGCTGGGG
AATGAATCAG GCCACGGCGC TAATCACGAC GCGCTGTATC GCTGGATCAA ATCTGTCGAT
CCTTCCCGCC CGGTGCAGTA TGAAGGCGGC GGAGCCGACA CCACGGCCAC CGATATTATT
TGCCCGATGT ACGCGCGCGT GGATGAAGAC CAGCCCTTCC CGGCTGTGCC GAAATGGTCC
ATCAAAAAAT GGCTTTCGCT ACCTGGAGAG ACGCGCCCGC TGATCCTTTG CGAATACGCC
CACGCGATGG GTAACAGTCT TGGCGGTTTC GCTAAATACT GGCAGGCGTT TCGTCAGTAT
CCCCGTTTAC AGGGCGGCTT CGTCTGGGAC TGGGTGGATC AGTCGCTGAT TAAATATGAT
GAAAACGGCA ACCCGTGGTC GGCTTACGGC GGTGATTTTG GCGATACGCC GAACGATCGC
CAGTTCTGTA TGAACGGTCT GGTCTTTGCC GACCGCACGC CGCATCCAGC GCTGACGGAA
GCAAAACACC AGCAGCAGTT TTTCCAGTTC CGTTTATCCG GGCAAACCAT CGAAGTGACC
AGCGAATACC TGTTCCGTCA TAGCGATAAC GAGCTCCTGC ACTGGATGGT GGCGCTGGAT
GGTAAGCCGC TGGCAAGCGG TGAAGTGCCT CTGGATGTCG CTCCACAAGG TAAACAGTTG
ATTGAACTGC CTGAACTACC GCAGCCGGAG AGCGCCGGGC AACTCTGGCT CACAGTACGC
GTAGTGCAAC CGAACGCGAC CGCATGGTCA GAAGCCGGAC ACATCAGCGC CTGGCAGCAG
TGGCGTCTGG CTGAAAACCT CAGCGTGACA CTCCCCGCCG CGTCCCACGC CATCCCGCAT
CTGACCACCA GCGAAATGGA TTTTTGCATC GAGCTGGGTA ATAAGCGTTG GCAATTTAAC
CGCCAGTCAG GCTTTCTTTC ACAGATGTGG ATTGGCGATA AAAAACAACT GCTGACGCCG
CTGCGCGATC AGTTCACCCG TGCACCGCTG GATAACGACA TTGGCGTAAG TGAAGCGACC
CGCATTGACC CTAACGCCTG GGTCGAACGC TGGAAGGCGG CGGGCCATTA CCAGGCCGAA
GCAGCGTTGT TGCAGTGCAC GGCAGATACA CTTGCTGATG CGGTGCTGAT TACGACCGCT
CACGCGTGGC AGCATCAGGG GAAAACCTTA TTTATCAGCC GGAAAACCTA CCGGATTGAT
GGTAGTGGTC AAATGGCGAT TACCGTTGAT GTTGAAGTGG CGAGCGATAC ACCGCATCCG
GCGCGGATTG GCCTGAACTG CCAGCTGGCG CAGGTAGCAG AGCGGGTAAA CTGGCTCGGA
TTAGGGCCGC AAGAAAACTA TCCCGACCGC CTTACTGCCG CCTGTTTTGA CCGCTGGGAT
CTGCCATTGT CAGACATGTA TACCCCGTAC GTCTTCCCGA GCGAAAACGG TCTGCGCTGC
GGGACGCGCG AATTGAATTA TGGCCCACAC CAGTGGCGCG GCGACTTCCA GTTCAACATC
AGCCGCTACA GTCAACAGCA ACTGATGGAA ACCAGCCATC GCCATCTGCT GCACGCGGAA
GAAGGCACAT GGCTGAATAT CGACGGTTTC CATATGGGGA TTGGTGGCGA CGACTCCTGG
AGCCCGTCAG TATCGGCGGA ATTCCAGCTG AGCGCCGGTC GCTACCATTA CCAGTTGGTC
TGGTGTCAAA AATAA
 
Protein sequence
MTMITDSLAV VLQRRDWENP GVTQLNRLAA HPPFASWRNS EEARTDRPSQ QLRSLNGEWR 
FAWFPAPEAV PESWLECDLP EADTVVVPSN WQMHGYDAPI YTNVTYPITV NPPFVPTENP
TGCYSLTFNV DESWLQEGQT RIIFDGVNSA FHLWCNGRWV GYGQDSRLPS EFDLSAFLRA
GENRLAVMVL RWSDGSYLED QDMWRMSGIF RDVSLLHKPT TQISDFHVAT RFNDDFSRAV
LEAEVQMCGE LRDYLRVTVS LWQGETQVAS GTAPFGGEII DERGGYADRV TLRLNVENPK
LWSAEIPNLY RAVVELHTAD GTLIEAEACD VGFREVRIEN GLLLLNGKPL LIRGVNRHEH
HPLHGQVMDE QTMVQDILLM KQNNFNAVRC SHYPNHPLWY TLCDRYGLYV VDEANIETHG
MVPMNRLTDD PRWLPAMSER VTRMVQRDRN HPSVIIWSLG NESGHGANHD ALYRWIKSVD
PSRPVQYEGG GADTTATDII CPMYARVDED QPFPAVPKWS IKKWLSLPGE TRPLILCEYA
HAMGNSLGGF AKYWQAFRQY PRLQGGFVWD WVDQSLIKYD ENGNPWSAYG GDFGDTPNDR
QFCMNGLVFA DRTPHPALTE AKHQQQFFQF RLSGQTIEVT SEYLFRHSDN ELLHWMVALD
GKPLASGEVP LDVAPQGKQL IELPELPQPE SAGQLWLTVR VVQPNATAWS EAGHISAWQQ
WRLAENLSVT LPAASHAIPH LTTSEMDFCI ELGNKRWQFN RQSGFLSQMW IGDKKQLLTP
LRDQFTRAPL DNDIGVSEAT RIDPNAWVER WKAAGHYQAE AALLQCTADT LADAVLITTA
HAWQHQGKTL FISRKTYRID GSGQMAITVD VEVASDTPHP ARIGLNCQLA QVAERVNWLG
LGPQENYPDR LTAACFDRWD LPLSDMYTPY VFPSENGLRC GTRELNYGPH QWRGDFQFNI
SRYSQQQLME TSHRHLLHAE EGTWLNIDGF HMGIGGDDSW SPSVSAEFQL SAGRYHYQLV
WCQK