Gene ECH74115_5516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5516 
SymbolmalK 
ID6971239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5164412 
End bp5165527 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content55% 
IMG OID643389159 
Productmaltose/maltodextrin transporter ATP-binding protein 
Protein accessionYP_002273556 
Protein GI209399920 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.301255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCG TACAGCTGCA AAATGTAACG AAAGCCTGGG GCGAGGTCGT GGTATCGAAA 
GATATCAATC TCGATATCCA TGAAGGTGAA TTCGTGGTGT TTGTCGGACC GTCTGGCTGC
GGTAAATCGA CTTTACTGCG CATGATTGCC GGGCTTGAGA CGATCACCAG CGGCGACCTG
TTCATCGGTG AGAAACGGAT GAATGACACT CCGCCAGCAG AACGTGGCGT TGGTATGGTG
TTTCAGTCCT ACGCGCTCTA TCCCCACCTG TCAGTAGCAG AAAACATGTC ATTTGGCCTG
AAACTGGCAG GCGCAAAAAA AGAGGTGATT AACCAACGCG TCAACCAGGT GGCGGAAGTG
CTACAACTGG CGCATTTGCT GGATCGCAAA CCGAAAGCGC TCTCCGGTGG TCAGCGTCAG
CGTGTGGCGA TTGGCCGTAC GCTGGTGGCC GAGCCAAGCG TATTTTTGCT CGATGAACCG
CTCTCCAACC TCGATGCTGC ACTGCGTGTG CAAATGCGTA TCGAAATCTC CCGTCTGCAT
AAACGCCTGG GCCGCACAAT GATTTACGTC ACCCACGATC AGGTCGAAGC GATGACGCTG
GCCGACAAAA TCGTGGTGCT GGACGCCGGT CGCGTGGCGC AGGTTGGGAA ACCGCTGGAG
CTGTACCACT ATCCGGCAGA CCGTTTTGTC GCCGGATTTA TCGGTTCGCC AAAGATGAAC
TTCCTGCCGG TAAAAGTGAC CGCCACCGCA ATCGATCAAG TGCAGGTGGA GCTGCCGATG
CCAAATCGTC AGCAAGTCTG GCTGCCAGTT GAAAGCCGTG ATGTCCAGGT TGGAGCCAAT
ATGTCGCTGG GTATTCGCCC GGAACATCTA CTGCCGAGTG ATATCGCTGA CGTCATCCTT
GAGGGTGAAG TTCAGGTCGT CGAGCAACTC GGCAACGAAA CCCAAATCCA TATCCAGATC
CCTTCCATTC GTCAAAACCT GGTGTACCGC CAGAACGACG TGGTGTTGGT AGAAGAAGGT
GCCACATTCG CTATCGGCCT GCCGCCAGAG CGTTGCCATC TGTTCCGTGA GGATGGCACT
GCATGTCGTC GACTGCATAA GGAGCCGGGC GTTTAA
 
Protein sequence
MASVQLQNVT KAWGEVVVSK DINLDIHEGE FVVFVGPSGC GKSTLLRMIA GLETITSGDL 
FIGEKRMNDT PPAERGVGMV FQSYALYPHL SVAENMSFGL KLAGAKKEVI NQRVNQVAEV
LQLAHLLDRK PKALSGGQRQ RVAIGRTLVA EPSVFLLDEP LSNLDAALRV QMRIEISRLH
KRLGRTMIYV THDQVEAMTL ADKIVVLDAG RVAQVGKPLE LYHYPADRFV AGFIGSPKMN
FLPVKVTATA IDQVQVELPM PNRQQVWLPV ESRDVQVGAN MSLGIRPEHL LPSDIADVIL
EGEVQVVEQL GNETQIHIQI PSIRQNLVYR QNDVVLVEEG ATFAIGLPPE RCHLFREDGT
ACRRLHKEPG V