Gene EcSMS35_0434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0434 
SymbolmalZ 
ID6144822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp444790 
End bp446607 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content55% 
IMG OID641615330 
Productmaltodextrin glucosidase 
Protein accessionYP_001742537 
Protein GI170681194 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTAA ATGCATGGCA CCTGCCGGTG CCCCCATTTG TTAAACAAAG CAAAGATCAA 
CTGCTTATTA CACTGTGGCT GACGGGCGAA GACCCACCGC AGCGCATTAT GCTGCGTACA
GAACACGACA ACGAAGAAAC GTCAGTAGCG ATGCATAAGC AGCGCAGTCA GCCGCAGCCT
GGCGTTACCG CATGGCGGGC GGCGATCGAT CTCTCCAGCG GGCAACCTCG GCGGCGTTAT
AGTTTCAAAT TACTGTGGCA CGATCGCCAG CGTTGGTTTA CACCGCAGGG CTTCAGCCGA
ATGCCGCCAG CACGACTGGA GCAGTTTGCT GTCGATGTCC CGGATATCGG CCCACAATGG
GCTGCGGATC AGATTTTTTA TCAGATCTTC CCTGATCGTT TCGCCCGTAG TCTTCCTCGT
GAAGCTGAAC AGGATCATGT TTATTACCAT CACGCAGCCG GACAAGAGAT CATATTGCGT
GACTGGGATG AACCGGTCAC GGCGCAGGCG GGCGGATCAA CGTTCTATGG CGGCGATCTG
GACGGTATCA GCGAAAAACT GCCGTATCTG AAAAAGCTCG GCGTGACGGC GCTGTATCTT
AACCCGGTGT TTAAAGCCCC CAGCGTACAT AAATACGATA CAGAGGATTA TCGCCACGTC
GATCCGCAGT TTGGCGGCGA CGGAGCGTTG CTGCGTTTGC GACACCATAC GCAGCAGCTG
GGAATGCGGC TGGTGCTGGA TGGCGTGTTT AACCACAGTG GCGATTCCCA TGCCTGGTTT
GACAGGCACA ATCGTGGCAC GGGTGGTGCT TGTCACAACC CCGAATCGCC CTGGCGCGAC
TGGTACTCGT TTAGTGACGA TGGCACGGCG CTCGACTGGC TGGGCTATGC CAGCTTGCCG
AAACTGGATT ATCAGTCGGA AAGTCTGGTG AATGAAATTT ATCGCGGGGA AGACAGTATT
GTCCGCCACT GGCTGAAAGC GCCGTGGAAT ATGGACGGCT GGCGGCTGGA TGTGGTGCAT
ATGCTGGGGG AAGCGGGTGG GGCGCGCAAT AATTTGCAGC ACGTTGCCGG GATCACCGAA
GCGGCGAAAG AAACCCAGCC GGAAGCGTAT ATCGTTGGAG AGCATTTTGG CGATGCACGG
CAATGGTTAC AGGCCGATGT GGAAGATGCC GCCATGAACT ATCGCGGCTT CACATTCCCG
TTGTGGGGAT TTCTTGCCAA TACCGACATC TCTTATGATC CGCAGCAAAT TGACGCCCAA
ACCTGTATGG CCTGGATGGA TAATTACCGC GCCGGACTTT CTCATCAACA GCAACTGCGT
ATGTTTAATC AGCTCGATAG CCACGATACT GCGCGATTTA AAACGCTGCT CGGTCGGGAT
ATTGCGCGCC TGCCGCTGGC GGTGGTCTGG TTGTTCACCT GGCCTGGTGT ACCGTGCATT
TATTACGGTG ATGAAGTGGG GCTGGATGGT AAAAACGATC CGTTTTGCCG TAAACCGTTC
CCCTGGCAGG TGGAAAAGCA GGATACGGCG TTATTCGCGC TGTACCAGCG AATGATTGCG
CTGCGTAAGA AAAGTCAGGC GCTGCGTCGT GGCGGCTGTC AGGTGCTGTA TGCGGAAGAT
AACGTGGTGG TATTTGGCCG CGTGCTGAAT CAGCAGCGTG TACTGGTGGC AATCAACCGT
GGAGAAGCCT GTGAAGTGGT GCTACCCGCG TCACCGTTGC TTAATGTTGC GCAATGGCAA
CGCAAAGAAG GCCATGGGCA ACTGACAGAC GGGATTCTGG CTTTGCCTGC CATTTCGGCT
ACGGTGTGGA TCAACTAA
 
Protein sequence
MMLNAWHLPV PPFVKQSKDQ LLITLWLTGE DPPQRIMLRT EHDNEETSVA MHKQRSQPQP 
GVTAWRAAID LSSGQPRRRY SFKLLWHDRQ RWFTPQGFSR MPPARLEQFA VDVPDIGPQW
AADQIFYQIF PDRFARSLPR EAEQDHVYYH HAAGQEIILR DWDEPVTAQA GGSTFYGGDL
DGISEKLPYL KKLGVTALYL NPVFKAPSVH KYDTEDYRHV DPQFGGDGAL LRLRHHTQQL
GMRLVLDGVF NHSGDSHAWF DRHNRGTGGA CHNPESPWRD WYSFSDDGTA LDWLGYASLP
KLDYQSESLV NEIYRGEDSI VRHWLKAPWN MDGWRLDVVH MLGEAGGARN NLQHVAGITE
AAKETQPEAY IVGEHFGDAR QWLQADVEDA AMNYRGFTFP LWGFLANTDI SYDPQQIDAQ
TCMAWMDNYR AGLSHQQQLR MFNQLDSHDT ARFKTLLGRD IARLPLAVVW LFTWPGVPCI
YYGDEVGLDG KNDPFCRKPF PWQVEKQDTA LFALYQRMIA LRKKSQALRR GGCQVLYAED
NVVVFGRVLN QQRVLVAINR GEACEVVLPA SPLLNVAQWQ RKEGHGQLTD GILALPAISA
TVWIN