Gene ECH74115_0480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0480 
SymbolmalZ 
ID6967572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp488084 
End bp489898 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content55% 
IMG OID643384528 
Productmaltodextrin glucosidase 
Protein accessionYP_002269042 
Protein GI209395918 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAATG CATGGCACCT GCCGGTGCCC CCATTTGTTA AACAAAGCAA AGATCAACTG 
CTCATTACAC TGTGGCTGAC GGGCGAAGAC CCACCGCAGC GCATTATGCT GCGTACAGAA
CACGATAACG AAGAAATGTC AGTACCAATG CATAAGCAGC GCAGTCAGCC GCAGCCAGGC
GTCACCGCAT GGCGTGCGGC GATTGATCTC TCCAGCGGAC AACCCCGGCG GCGTTACAGT
TTCAAACTGC TGTGGCACGA TCGCCAGCGT TGGTTTACAC CGCAGGGCTT CAGCCGAATG
CCGCCGGCAC GACTGGAGCA GTTTGCCGTC GATGTACCGG ATATCGGCCC ACAATGGGCT
GCGGATCAGA TTTTTTATCA GATCTTCCCT GATCGTTTTG CGCGTAGTCT TCCTCGTGAA
GCTGAACAGG ATCATGTCTA TTACCATCAT GCAGCCGGAC AAGAGATCAT CTTGCGTGAC
TGGGATGAAC CGGTCACGGC GCAGGCGGGC GGATCAACGT TCTATGGCGG CGATCTGGAC
GGGATAAGCG AAAAACTGCC GTATCTGAAA AAGCTTGGCG TGACGGCGCT GTATCTCAAT
CCGGTGTTTA AAGCTCCCAG CGTACATAAA TACGATACCG AGGATTATCG CCATGTCGAT
CCGCAGTTTG GCGGTGATGG GGCGTTGCTG CGCTTACGCC ACAATACGCA GCAGCTGGGA
ATGCGGCTGG TGCTGGATGG CGTGTTTAAC CACAGTGGCG ATTCCCATGC CTGGTTTGAC
AGGCACAATC GTGGCACGGG GGGAGCTTGT CACAACCCCG AATCGCCCTG GCGCGACTGG
TACTCGTTTA GTGATGATGG CACGGCGCTC GACTGGCTTG GCTATGCCAG CTTGCCGAAG
CTGGATTATC AGTCGGAAAG TCTGGTGAAT GAAATTTATC GCGGGGAAGA CAGTATTGTC
CGCCACTGGC TGAAAGCGCC GTGGAATATG GACGGCTGGC GGCTGGATGT GGTGCATATG
CTGGGGGAAG CGGGTGGGGC GCGCAATAAT TTGCAGCACG TTGCTGGGAT CACCGAAGCG
GCGAAAGAAA CCCAGCCGGA AGCGTATATA GTTGGAGAGC ATTTTGGCGA TGCACGGCAA
TGGTTACAGG CCGATGTGGA AGATGCCGCC ATGAACTATC GCGGCTTCAC ATTCCCGTTG
TGGGGATTTC TTGCCAATAC CGACATCTCT TACGATCCGC AGCAAATTGA CGCTCAAACC
TGTATGGCCT GGATGGATAA TTACCGCGCA GGGCTTTCTC ATCAACAGCA ATTACGTATG
TTTAATCAGC TCGACAGCCA CGATACTGCG CGATTTAAAA CGCTGCTCGG TCGGGAGGTT
GCGCGCCTGC CCCTGGCAGT GGTCTGGCTG TTCACCTGGC CTGGTGTACC GTGCATTTAT
TACGGTGATG AAGTGGGACT GGATGGCAAA AATGATCCGT TTTGCCGTAA GCCGTTCCCC
TGGCAGGTGG AAAAACAGGA TATGGCGTTA TTCGCGTTGT ACCAGCGAAT GATTGCGCTG
CGTAAGAAAA GTCAGGCGCT GCGTCGTGGC GGCTGTCAGG TGCTGTATGC GGAAGATAAC
GTGGTGGTAT TTGTCCGCGT GCTGAATCAG CAGCGTGTAC TGGTGGCAAT TAACCGTGGC
GAAGCCTGTG AAGTGGTGCT ACCCGCGTCA CCGTTTCTTA ATGTCGTGCA ATGGCAACGC
AAAGAAGGCC ATGGGCAACT GACAGACGGG ATTCTGGCTT TGCCTGCCAT TTCGGCTACG
GTGTGGATGA ACTAA
 
Protein sequence
MLNAWHLPVP PFVKQSKDQL LITLWLTGED PPQRIMLRTE HDNEEMSVPM HKQRSQPQPG 
VTAWRAAIDL SSGQPRRRYS FKLLWHDRQR WFTPQGFSRM PPARLEQFAV DVPDIGPQWA
ADQIFYQIFP DRFARSLPRE AEQDHVYYHH AAGQEIILRD WDEPVTAQAG GSTFYGGDLD
GISEKLPYLK KLGVTALYLN PVFKAPSVHK YDTEDYRHVD PQFGGDGALL RLRHNTQQLG
MRLVLDGVFN HSGDSHAWFD RHNRGTGGAC HNPESPWRDW YSFSDDGTAL DWLGYASLPK
LDYQSESLVN EIYRGEDSIV RHWLKAPWNM DGWRLDVVHM LGEAGGARNN LQHVAGITEA
AKETQPEAYI VGEHFGDARQ WLQADVEDAA MNYRGFTFPL WGFLANTDIS YDPQQIDAQT
CMAWMDNYRA GLSHQQQLRM FNQLDSHDTA RFKTLLGREV ARLPLAVVWL FTWPGVPCIY
YGDEVGLDGK NDPFCRKPFP WQVEKQDMAL FALYQRMIAL RKKSQALRRG GCQVLYAEDN
VVVFVRVLNQ QRVLVAINRG EACEVVLPAS PFLNVVQWQR KEGHGQLTDG ILALPAISAT
VWMN