Gene ECH74115_5515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5515 
SymbolmalE 
ID6972030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5162857 
End bp5164047 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content51% 
IMG OID643389158 
Productmaltose ABC transporter periplasmic protein 
Protein accessionYP_002273555 
Protein GI209400060 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.241432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA AAACAGGTGC ACGCATCCTC GCATTATCCG CATTAACGAC GATGATGTTT 
TCCGCCTCGG CTCTCGCCAA AATCGAAGAA GGTAAACTGG TAATCTGGAT TAACGGCGAT
AAAGGCTATA ACGGTCTCGC TGAAGTCGGT AAGAAATTCG AGAAAGATAC CGGAATTAAA
GTCACCGTTG AGCATCCGGA TAAACTGGAA GAGAAATTCC CTCAGGTTGC GGCAACTGGC
GATGGCCCTG ACATTATCTT CTGGGCGCAC GACCGCTTTG GTGGCTACGC TCAATCTGGC
CTGTTGGCTG AAATCACCCC GGACAAAGCG TTCCAGGACA AGCTGTATCC GTTTACCTGG
GATGCCGTAC GTTACAACGG CAAGCTGATT GCTTACCCGA TCGCTGTTGA AGCGTTATCG
CTGATTTATA ACAAAGATCT GCTACCGAAC CCGCCAAAAA CCTGGGAAGA GATCCCGGCG
CTGGATAAAG AACTGAAAGC GAAAGGTAAG AGCGCGCTGA TGTTCAACCT GCAAGAACCG
TACTTCACCT GGCCGCTGAT TGCTGCTGAC GGGGGTTATG CGTTCAAGTA TGAAAACGGC
AAGTACGACA TTAAAGACGT GGGCGTGGAT AACGCTGGCG CGAAAGCGGG TCTGACCTTC
CTGGTTGACC TGATTAAAAA CAAACACATG AATGCAGACA CCGATTACTC CATCGCAGAA
GCTGCCTTTA ATAAAGGCGA AACAGCGATG ACTATCAACG GCCCGTGGGC ATGGTCCAAC
ATCGACACCA GCAAAGTGAA TTATGGTGTA ACGGTACTGC CGACCTTCAA GGGTCAACCA
TCTAAACCGT TCGTTGGCGT GCTGAGCGCC GGTATTAACG CCGCCAGTCC GAACAAAGAG
CTGGCGAAAG AGTTCCTCGA AAACTATCTG CTGACTGACG AAGGTCTGGA AGCGGTTAAT
AAAGACAAAC CGCTGGGTGC CGTAGCGCTG AAGTCTTACG AGGAAGAGTT GGCGAAAGAT
CCACGTATTG CAGCCACCAT GGAAAACGCC CAGAAAGGTG AAATCATGCC GAACATCCCG
CAGATGTCCG CTTTCTGGTA TGCCGTGCGT ACTGCGGTGA TCAACGCCGC CAGCGGTCGT
CAGACTGTCG ATGAAGCCCT GAAAGACGCG CAGACTCGTA TCACCAAGTA A
 
Protein sequence
MKIKTGARIL ALSALTTMMF SASALAKIEE GKLVIWINGD KGYNGLAEVG KKFEKDTGIK 
VTVEHPDKLE EKFPQVAATG DGPDIIFWAH DRFGGYAQSG LLAEITPDKA FQDKLYPFTW
DAVRYNGKLI AYPIAVEALS LIYNKDLLPN PPKTWEEIPA LDKELKAKGK SALMFNLQEP
YFTWPLIAAD GGYAFKYENG KYDIKDVGVD NAGAKAGLTF LVDLIKNKHM NADTDYSIAE
AAFNKGETAM TINGPWAWSN IDTSKVNYGV TVLPTFKGQP SKPFVGVLSA GINAASPNKE
LAKEFLENYL LTDEGLEAVN KDKPLGAVAL KSYEEELAKD PRIAATMENA QKGEIMPNIP
QMSAFWYAVR TAVINAASGR QTVDEALKDA QTRITK