Gene ECH74115_3282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3282 
SymbolmglA 
ID6968451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3014125 
End bp3015645 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content45% 
IMG OID643387095 
Productgalactose/methyl galaxtoside transporter ATP-binding protein 
Protein accessionYP_002271559 
Protein GI209397905 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.684279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAGCT CAACGACTCC GTCTTCCGGG GAATACTTGT TGGAAATGAG CGGTATCAAC 
AAGTCTTTTC CTGGTGTTAA GGCACTTGAT AACGTTAATT TAAAAGTCCG GCCACATTCT
ATCCATGCAT TAATGGGGGA AAACGGCGCA GGAAAATCAA CATTATTAAA ATGCCTGTTT
GGTATTTATA AAAAAGACTC CGGCACCATT TTATTCCAGG GTAAAGAGAT CGATTTCCAT
TCGGCCAAAG AAGCACTGGA AAATGGTATT TCGATGGTAC ACCAGGAGTT AAACCTGGTA
TTACAACGTT CGGTGATGGA CAACATGTGG CTTGGGCGAT ACCCCACCAA AGGCATGTTT
GTCGATCAGG ACAAAATGTA CCGCGAAACC AAAGCGATTT TTGATGAACT GGATATTGAT
ATCGATCCGC GTGCGCGCGT CGGCACATTA TCTGTTTCGC AAATGCAGAT GATCGAAATC
GCCAAAGCGT TTTCCTATAA CGCGAAAATT GTGATTATGG ATGAACCGAC TTCTTCGTTA
ACCGAAAAAG AGGTCAATCA TCTGTTCACT ATTATTCGTA AATTAAAAGA GCGCGGCTGC
GGTATTGTTT ATATCTCGCA TAAAATGGAG GAAATCTTCC AGTTATGTGA TGAAGTTACC
GTATTGCGCG ACGGTCAGTG GATCGCCACC GAACCGCTGG CAGGGCTGAC GATGGACAAG
ATTATCGCCA TGATGGTTGG GCGTTCTCTT AACCAGCGTT TTCCTGACAA AGAAAACAAG
CCGGGCGAAG TCATCCTCGA GGTACGTAAC CTGACGTCAC TGCGCCAGCC GTCGATTCGC
GATGTCTCGT TTGATCTGCA TAAAGGGGAG ATCCTCGGTA TTGCCGGTCT GGTGGGGGCG
AAACGTACCG ATATTGTTGA GACGTTATTT GGTATTCGCG AGAAATCGGC TGGCACCATT
ACGTTGCACG GCAAAAAGAT CAATAACCAT AATGCCAACG AAGCCATAAA CCACGGATTT
GCACTGGTAA CTGAGGAGCG CCGCTCAACG GGAATTTATG CCTATCTGGA TATTGGTTTT
AACTCGTTAA TTTCCAATAT TCGCAACTAC AAAAATAAAG TTGGTTTACT GGATAACTCG
CGGATGAAAC GCGATACCCA GTGGGTGATT GATTCGATGC GGGTAAAAAC ACCGGGGCAT
CGGACGCAAA TTGGTTCGCT CTCAGGTGGT AATCAGCAAA AGGTGATTAT TGGGCGCTGG
CTGTTAACGC AACCAGAAAT ATTAATGCTC GATGAACCGA CGCGCGGTAT TGATGTTGGG
GCGAAGTTTG AGATTTATCA GTTAATTGCC GAACTGGCGA AGAAAGGCAA GGGGATTATT
ATTATCTCCT CTGAAATGCC AGAGTTGTTA GGGATAACAG ACCGTATTCT GGTTATGAGC
AATGGCCTCG TTTCCGGAAT TGTCGATACA AAAACAACAA CGCAAAACGA AATTCTACGT
CTTGCGTCTT TGCACCTTTA A
 
Protein sequence
MVSSTTPSSG EYLLEMSGIN KSFPGVKALD NVNLKVRPHS IHALMGENGA GKSTLLKCLF 
GIYKKDSGTI LFQGKEIDFH SAKEALENGI SMVHQELNLV LQRSVMDNMW LGRYPTKGMF
VDQDKMYRET KAIFDELDID IDPRARVGTL SVSQMQMIEI AKAFSYNAKI VIMDEPTSSL
TEKEVNHLFT IIRKLKERGC GIVYISHKME EIFQLCDEVT VLRDGQWIAT EPLAGLTMDK
IIAMMVGRSL NQRFPDKENK PGEVILEVRN LTSLRQPSIR DVSFDLHKGE ILGIAGLVGA
KRTDIVETLF GIREKSAGTI TLHGKKINNH NANEAINHGF ALVTEERRST GIYAYLDIGF
NSLISNIRNY KNKVGLLDNS RMKRDTQWVI DSMRVKTPGH RTQIGSLSGG NQQKVIIGRW
LLTQPEILML DEPTRGIDVG AKFEIYQLIA ELAKKGKGII IISSEMPELL GITDRILVMS
NGLVSGIVDT KTTTQNEILR LASLHL