Gene EcE24377A_0368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0368 
SymbollacZ 
ID5589745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp397825 
End bp400899 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content56% 
IMG OID640924093 
Productbeta-D-galactosidase 
Protein accessionYP_001461520 
Protein GI157154977 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value2.19694e-09 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATGA TTACGGATTC ACTGGCCGTC GTTTTACAAC GTCGTGACTG GGAAAACCCT 
GGCGTTACCC AACTTAATCG CCTTGCAGCA CATCCCCCTT TCGCCAGCTG GCGTAATAGC
GAAGAGGCCC GCACCGATCG CCCTTCCCAA CAGTTGCGCA GCCTGAATGG CGAATGGCGC
TTTGCCTGGT TTCCGGCACC AGAAGCGGTG CCGGAAAGCT GGCTGGAGTG CGATCTTCCT
GAGGCCGATA CTGTCGTCGT CCCCTCAAAC TGGCAGATGC ACGGTTACGA TGCGCCCATC
TACACCAACG TGACCTATCC CATTACGGTC AATCCGCCGT TTGTTCCCAC GGAGAATCCG
ACGGGTTGTT ACTCGCTCAC ATTTAATGTT GATGAAAGCT GGCTACAGGA AGGCCAGACG
CGAATTATTT TTGATGGCGT TAACTCGGCG TTTCATCTGT GGTGCAACGG TCGCTGGGTC
GGTTACGGTC AGGACAGTCG TTTGCCGTCT GAATTTGACC TGAGCGCATT TTTACGCGCC
GGAGAAAACC GCCTCGCGGT GATGGTGCTG CGCTGGAGTG ACGGCAGTTA TCTGGAAGAT
CAGGATATGT GGCGGATGAG CGGCATTTTC CGTGACGTCT CGTTGCTGCA TAAACCGACT
ACACAAATCA GCGATTTCCA TGTTGCCACT CGCTTTAATG ATGATTTCAG CCGCGCTGTA
CTGGAGGCTG AAGTTCAGAT GTGCGGCGAG TTGCGTGACT ACCTACGGGT AACAGTTTCT
TTATGGCAGG GTGAAACGCA GGTCGCCAGC GGCACCGCGC CTTTCGGCGG TGAAATTATC
GATGAGCGTG GTAGTTATGC CGATCGCGTC ACACTACGTC TGAACGTCGA AAACCCGAAA
CTGTGGAGCG CCGAAATCCC GAATCTCTAT CGTGCGGTGG TTGAACTGCA CACCGCCGAC
GGCACGCTGA TTGAAGCAGA AGCCTGCGAT GTCGGTTTCC GCGAGGTGCG GATTGAAAAT
GGTCTGCTGC TGCTGAACGG CAAGCCGTTG CTGATTCGAG GCGTTAACCG TCACGAGCAT
CATCCTCTGC ATGGTCAGGT CATGGATGAG CAGACGATGG TGCAGGATAT CCTGCTGATG
AAGCAGAACA ACTTTAACGC CGTGCGCTGT TCGCATTATC CGAACCATCC GCTGTGGTAC
ACGCTGTGCG ACCGCTACGG CCTGTATGTG GTGGATGAAG CCAATATTGA AACCCACGGC
ATGGTGCCAA TGAATCGTCT GACCGATGAT CCGCGCTGGC TACCGGCGAT GAGCGAACGC
GTAACGCGAA TGGTGCAGCG CGATCGTAAT CACCCGAGTG TGATCATCTG GTCGCTGGGG
AATGAATCAG GCCACGGCGC TAATCACGAC GCGCTGTATC GCTGGATCAA ATCTGTCGAT
CCTTCCCGCC CGGTGCAGTA TGAAGGCGGC GGAGCCGACA CCACGGCCAC CGATATTATT
TGCCCGATGT ACGCGCGCGT GGATGAAGAC CAGCCCTTCC CGGCTGTGCC GAAATGGTCC
ATCAAAAAAT GGCTTTCGCT ACCTGGAGAG ACGCGCCCGC TGATCCTTTG CGAATACGCC
CACGCGATGG GTAACAGTCT TGGCGGTTTC GCTAAATACT GGCAGGCGTT TCGTCAGTAT
CCCCGTTTAC AGGGCGGCTT CGTCTGGGAC TGGGTGGATC AGTCGCTGAT TAAATATGAT
GAAAACGGCA ACCCGTGGTC GGCTTACGGC GGTGATTTTG GCGATACGCC GAACGATCGC
CAGTTCTGTA TGAACGGTCT GGTCTTTGCC GACCGCACGC CGCATCCAGC GCTGACGGAA
GCAAAACACC AGCAGCAGTT TTTCCAGTTC CGTTTATCCG GGCAAACCAT CGAAGTGACC
AGCGAATACC TGTTCCGTCA TAGCGATAAC GAGCTCCTGC ACTGGATGGT GGCGCTGGAT
GGTAAGCCGC TGGCAAGCGG TGAAGTGCCT CTGGATGTCG CTCCACAAGG TAAACAGTTG
ATTGAACTGC CTGAACTACC GCAGCCGGAG AGCGCCGGGC AACTCTGGCT CACAGTACGC
GTAGTGCAAC CGAACGCGAC CGCATGGTCA GAAGCCGGGC ACATCAGCGC CTGGCAGCAG
TGGCGTCTGG CGGAAAACCT CAGTGTGACG CTCCCCTCCG CGTCCCATAT CATCCCGCAA
CTAACCACCA GCGAAACGGA TTTTTGCATC GAGCTGGGTA ATAAGCGTTG GCAATTTAAC
CGTCAGTCAG GCCTTCTTTC ACAGATGTGG ATTGGCGATG AAAAACAACT GCTGACGCCG
CTGCGCGATC AGTTCACCCG TGCACCGCTG GATAACGACA TTGGCGTAAG TGAAGCGACC
CGCATTGACC CTAACGCCTG GGTCGAACGC TGGAAGGCGG CGGGCCATTA CCAGGCCGAA
GCAGCGTTGT TGCAGTGCTC GGCAGATACA CTTGCCGACG CGGTGCTGAT TACGACCGCT
CACGCGTGGC AGCATCAGGG AAAAACCTTA TTTATCAGCC GGAAAACCTA CCGGATTGAT
GGTAGTGGTC AAATGGCGAT TACCGTTGAT GTTGAAGTAG CGAGCGATAC ACCGCATCCG
GCACGGATTG GCCTGACCTG CCAGCTGGCG CAGGTAGCAG AGCGGGTAAA CTGGCTCGGA
TTAGGGCCGC AAGAAAACTA TCCCGACCGC CTTACTGCCG CCTGTTTTGA CCGCTGGGAT
CTGCCATTGT CAGACATGTA TACCCCGTAC GTCTTCCCGA GCGAAAACGG TCTGCGCTGC
GGGACGCGCG AATTGAATTA TGGCTCACAC CAGTGGCGCG GCGACTTTCA GTTCAACATC
AGCCGCTACA GTCAACAGCA ACTGATGGAA ACCAGCCATC GCCATCTGCT GCACGCGGAA
GAAGGCACAT GGCTGAATAT CGACGGTTTC CATATGGGGA TTGGTGGCGA CGACTCCTGG
AGCCCGTCAG TGTCGGCGGA ATTCCAGCTG AGCGCCGGTC GCTACCATTA CCAGTTGGTC
TGGTGTCAAA AATAA
 
Protein sequence
MTMITDSLAV VLQRRDWENP GVTQLNRLAA HPPFASWRNS EEARTDRPSQ QLRSLNGEWR 
FAWFPAPEAV PESWLECDLP EADTVVVPSN WQMHGYDAPI YTNVTYPITV NPPFVPTENP
TGCYSLTFNV DESWLQEGQT RIIFDGVNSA FHLWCNGRWV GYGQDSRLPS EFDLSAFLRA
GENRLAVMVL RWSDGSYLED QDMWRMSGIF RDVSLLHKPT TQISDFHVAT RFNDDFSRAV
LEAEVQMCGE LRDYLRVTVS LWQGETQVAS GTAPFGGEII DERGSYADRV TLRLNVENPK
LWSAEIPNLY RAVVELHTAD GTLIEAEACD VGFREVRIEN GLLLLNGKPL LIRGVNRHEH
HPLHGQVMDE QTMVQDILLM KQNNFNAVRC SHYPNHPLWY TLCDRYGLYV VDEANIETHG
MVPMNRLTDD PRWLPAMSER VTRMVQRDRN HPSVIIWSLG NESGHGANHD ALYRWIKSVD
PSRPVQYEGG GADTTATDII CPMYARVDED QPFPAVPKWS IKKWLSLPGE TRPLILCEYA
HAMGNSLGGF AKYWQAFRQY PRLQGGFVWD WVDQSLIKYD ENGNPWSAYG GDFGDTPNDR
QFCMNGLVFA DRTPHPALTE AKHQQQFFQF RLSGQTIEVT SEYLFRHSDN ELLHWMVALD
GKPLASGEVP LDVAPQGKQL IELPELPQPE SAGQLWLTVR VVQPNATAWS EAGHISAWQQ
WRLAENLSVT LPSASHIIPQ LTTSETDFCI ELGNKRWQFN RQSGLLSQMW IGDEKQLLTP
LRDQFTRAPL DNDIGVSEAT RIDPNAWVER WKAAGHYQAE AALLQCSADT LADAVLITTA
HAWQHQGKTL FISRKTYRID GSGQMAITVD VEVASDTPHP ARIGLTCQLA QVAERVNWLG
LGPQENYPDR LTAACFDRWD LPLSDMYTPY VFPSENGLRC GTRELNYGSH QWRGDFQFNI
SRYSQQQLME TSHRHLLHAE EGTWLNIDGF HMGIGGDDSW SPSVSAEFQL SAGRYHYQLV
WCQK