Gene EcE24377A_0433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0433 
SymbolmalZ 
ID5588632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp455390 
End bp457207 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content55% 
IMG OID640924157 
Productmaltodextrin glucosidase 
Protein accessionYP_001461584 
Protein GI157156114 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTTAA ATGCATGGCA CCTGCCGGTG CCCCCATTTG TTAAACAAAG CAAAGATCAA 
CTGCTCATTA CACTGTGGCT GACGGGCGAA GACCCACCGC AGCGCATTAT GCTGCGTACA
GAACACGATA ACGAAGAAAT GTCAGTATCG ATGCATAAGC AGCGCAGTCA GCAGCAGCCT
GGCGTCACCG CATGGCGTGC GGCGATTGAT CTCTCCAGCG GACAACCCCG GCGGCGTTAC
AGTTTCAAAC TGCTGTGGCA CGATCGCCAG CGTTGGTTTA CACCGCAGGG CTTCAGCCGA
ATGCCGCCGG CACGACTGGA GCAGTTTGCC GTCGATGTAC CGGATATCGG CCCACAATGG
GCTGCGGATC AGATTTTTTA TCAGATCTTC CCTGATCGTT TTGCGCGTAG TCTTCCTCGT
GAAGCTGAAC AGGATCATGT CTATTACCAT CATGCAGCCG GACAAGAGAT CATCTTGCGT
GACTGGGATG AACCGGTCAC GGCGCAGGCG GGCGGATCAA CGTTCTATGG CGGCGATCTG
GACGGGATAA GCGAAAAACT GCCGTATCTG AAAAAGCTTG GCGTGACGGC GCTGTATCTC
AATCCGGTGT TTAAAGCTCC CAGCGTACAT AAATACGATA CCGAGGATTA TCGCCATGTC
GATCCGCAGT TTGGCGGTGA TGGGGCGTTG CTGCGTTTGC GACACAATAC GCAGCAGCTG
GGAATGCGGC TGGTGCTGGA CGGCGTGTTT AACCACAGTG GCGATTCCCA TGCCTGGTTT
GACAGGCACA ATCGTGGCAC GGGGGGAGCT TGTCACAACC CTGAATCGCC CTGGCGCGAC
TGGTACTCGT TTAGTGATGA TGGCACGGCG CTCGACTGGC TTGGCTATGC CAGCTTGCCG
AAGCTGGATT ATCAGTCGGA AAGTCTGGTG AATGAAATTT ATCGCGGGGA AGACAGTATT
GTCCGCCATT GGCTGAAAGC GCCGTGGAGT ATGGACGGCT GGCGGCTGGA TGTGGTGCAT
ATGCTGGGGG AGGCGGGTGG GGCGCGCAAT AATATGCAGC ACGTTGCTGG GATCACCGAA
GCGGCGAAAG AAACCCAGCC GGAAGCGTAT ATTGTCGGCG AACATTTTGG CGATGCACGG
CAATGGTTAC AGGCCGATGT GGAAGATGCC GCCATGAACT ATCGTGGCTT CACATTCCCG
TTGTGGGGAT TTCTTGCCAA TACCGATATC TCTTACGATC CGCAGCAAAT TGATGCCCAA
ACCTGTATGG CCTGGATGGA TAATTACCGC GCAGGGCTTT CTCATCAACA ACAATTACGT
ATGTTTAATC AGCTCGACAG CCACGATACT GCGCGATTTA AAACGCTGCT CGGTCGGGAT
ATTGCGCGCC TGCCGCTGGC GGTGGTCTGG CTGTTCACCT GGCCTGGTGT ACCGTGCATT
TATTACGGTG ATGAAGTAGG ACTGGATGGC AAAAACGATC CGTTTTGCCG TAAACCGTTC
CCCTGGCAGG TGGAAAAGCA GGATACGGCG TTATTCGCGC TGTACCAGCG AATGATTGCG
CTGCGTAAGA AAAGTCAGGC GCTACGTCAT GGCGGCTGTC AGGTGCTGTA TGCGGAAGAT
AACGTGGTGG TATTTGTCCG CGTGCTGAAT CAGCAGCGTG TACTGGTGGC AATCAACCGT
GGCGAAGCTT GTGAAGTCGT GCTACCCGCG TCACCGTTGC TCAATGCCGT GCAATGGCAA
TGCAAAGAAG GGCATGGGCA ACTGACTGAC GGGATTCTGG CTTTGCCTGC CATTTCGGCT
ACGGTATGGA TGAACTAA
 
Protein sequence
MMLNAWHLPV PPFVKQSKDQ LLITLWLTGE DPPQRIMLRT EHDNEEMSVS MHKQRSQQQP 
GVTAWRAAID LSSGQPRRRY SFKLLWHDRQ RWFTPQGFSR MPPARLEQFA VDVPDIGPQW
AADQIFYQIF PDRFARSLPR EAEQDHVYYH HAAGQEIILR DWDEPVTAQA GGSTFYGGDL
DGISEKLPYL KKLGVTALYL NPVFKAPSVH KYDTEDYRHV DPQFGGDGAL LRLRHNTQQL
GMRLVLDGVF NHSGDSHAWF DRHNRGTGGA CHNPESPWRD WYSFSDDGTA LDWLGYASLP
KLDYQSESLV NEIYRGEDSI VRHWLKAPWS MDGWRLDVVH MLGEAGGARN NMQHVAGITE
AAKETQPEAY IVGEHFGDAR QWLQADVEDA AMNYRGFTFP LWGFLANTDI SYDPQQIDAQ
TCMAWMDNYR AGLSHQQQLR MFNQLDSHDT ARFKTLLGRD IARLPLAVVW LFTWPGVPCI
YYGDEVGLDG KNDPFCRKPF PWQVEKQDTA LFALYQRMIA LRKKSQALRH GGCQVLYAED
NVVVFVRVLN QQRVLVAINR GEACEVVLPA SPLLNAVQWQ CKEGHGQLTD GILALPAISA
TVWMN