Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0434 |
Symbol | malZ |
ID | 6144822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 444790 |
End bp | 446607 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615330 |
Product | maltodextrin glucosidase |
Protein accession | YP_001742537 |
Protein GI | 170681194 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTTAA ATGCATGGCA CCTGCCGGTG CCCCCATTTG TTAAACAAAG CAAAGATCAA CTGCTTATTA CACTGTGGCT GACGGGCGAA GACCCACCGC AGCGCATTAT GCTGCGTACA GAACACGACA ACGAAGAAAC GTCAGTAGCG ATGCATAAGC AGCGCAGTCA GCCGCAGCCT GGCGTTACCG CATGGCGGGC GGCGATCGAT CTCTCCAGCG GGCAACCTCG GCGGCGTTAT AGTTTCAAAT TACTGTGGCA CGATCGCCAG CGTTGGTTTA CACCGCAGGG CTTCAGCCGA ATGCCGCCAG CACGACTGGA GCAGTTTGCT GTCGATGTCC CGGATATCGG CCCACAATGG GCTGCGGATC AGATTTTTTA TCAGATCTTC CCTGATCGTT TCGCCCGTAG TCTTCCTCGT GAAGCTGAAC AGGATCATGT TTATTACCAT CACGCAGCCG GACAAGAGAT CATATTGCGT GACTGGGATG AACCGGTCAC GGCGCAGGCG GGCGGATCAA CGTTCTATGG CGGCGATCTG GACGGTATCA GCGAAAAACT GCCGTATCTG AAAAAGCTCG GCGTGACGGC GCTGTATCTT AACCCGGTGT TTAAAGCCCC CAGCGTACAT AAATACGATA CAGAGGATTA TCGCCACGTC GATCCGCAGT TTGGCGGCGA CGGAGCGTTG CTGCGTTTGC GACACCATAC GCAGCAGCTG GGAATGCGGC TGGTGCTGGA TGGCGTGTTT AACCACAGTG GCGATTCCCA TGCCTGGTTT GACAGGCACA ATCGTGGCAC GGGTGGTGCT TGTCACAACC CCGAATCGCC CTGGCGCGAC TGGTACTCGT TTAGTGACGA TGGCACGGCG CTCGACTGGC TGGGCTATGC CAGCTTGCCG AAACTGGATT ATCAGTCGGA AAGTCTGGTG AATGAAATTT ATCGCGGGGA AGACAGTATT GTCCGCCACT GGCTGAAAGC GCCGTGGAAT ATGGACGGCT GGCGGCTGGA TGTGGTGCAT ATGCTGGGGG AAGCGGGTGG GGCGCGCAAT AATTTGCAGC ACGTTGCCGG GATCACCGAA GCGGCGAAAG AAACCCAGCC GGAAGCGTAT ATCGTTGGAG AGCATTTTGG CGATGCACGG CAATGGTTAC AGGCCGATGT GGAAGATGCC GCCATGAACT ATCGCGGCTT CACATTCCCG TTGTGGGGAT TTCTTGCCAA TACCGACATC TCTTATGATC CGCAGCAAAT TGACGCCCAA ACCTGTATGG CCTGGATGGA TAATTACCGC GCCGGACTTT CTCATCAACA GCAACTGCGT ATGTTTAATC AGCTCGATAG CCACGATACT GCGCGATTTA AAACGCTGCT CGGTCGGGAT ATTGCGCGCC TGCCGCTGGC GGTGGTCTGG TTGTTCACCT GGCCTGGTGT ACCGTGCATT TATTACGGTG ATGAAGTGGG GCTGGATGGT AAAAACGATC CGTTTTGCCG TAAACCGTTC CCCTGGCAGG TGGAAAAGCA GGATACGGCG TTATTCGCGC TGTACCAGCG AATGATTGCG CTGCGTAAGA AAAGTCAGGC GCTGCGTCGT GGCGGCTGTC AGGTGCTGTA TGCGGAAGAT AACGTGGTGG TATTTGGCCG CGTGCTGAAT CAGCAGCGTG TACTGGTGGC AATCAACCGT GGAGAAGCCT GTGAAGTGGT GCTACCCGCG TCACCGTTGC TTAATGTTGC GCAATGGCAA CGCAAAGAAG GCCATGGGCA ACTGACAGAC GGGATTCTGG CTTTGCCTGC CATTTCGGCT ACGGTGTGGA TCAACTAA
|
Protein sequence | MMLNAWHLPV PPFVKQSKDQ LLITLWLTGE DPPQRIMLRT EHDNEETSVA MHKQRSQPQP GVTAWRAAID LSSGQPRRRY SFKLLWHDRQ RWFTPQGFSR MPPARLEQFA VDVPDIGPQW AADQIFYQIF PDRFARSLPR EAEQDHVYYH HAAGQEIILR DWDEPVTAQA GGSTFYGGDL DGISEKLPYL KKLGVTALYL NPVFKAPSVH KYDTEDYRHV DPQFGGDGAL LRLRHHTQQL GMRLVLDGVF NHSGDSHAWF DRHNRGTGGA CHNPESPWRD WYSFSDDGTA LDWLGYASLP KLDYQSESLV NEIYRGEDSI VRHWLKAPWN MDGWRLDVVH MLGEAGGARN NLQHVAGITE AAKETQPEAY IVGEHFGDAR QWLQADVEDA AMNYRGFTFP LWGFLANTDI SYDPQQIDAQ TCMAWMDNYR AGLSHQQQLR MFNQLDSHDT ARFKTLLGRD IARLPLAVVW LFTWPGVPCI YYGDEVGLDG KNDPFCRKPF PWQVEKQDTA LFALYQRMIA LRKKSQALRR GGCQVLYAED NVVVFGRVLN QQRVLVAINR GEACEVVLPA SPLLNVAQWQ RKEGHGQLTD GILALPAISA TVWIN
|
| |