Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2296 |
Symbol | mglA |
ID | 6144584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2323483 |
End bp | 2325003 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641617170 |
Product | galactose/methyl galaxtoside transporter ATP-binding protein |
Protein accession | YP_001744343 |
Protein GI | 170679772 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0020685 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTCAGCT CAACGACTCC GTCTTCCGGG GAATACTTGT TGGAAATGAG CGGTATCAAC AAGTCTTTTC CTGGTGTTAA GGCACTTGAT AACGTTAATT TAAAAGTCCG GCCACATTCT ATCCATGCAT TAATGGGGGA AAACGGCGCA GGAAAATCAA CATTATTAAA ATGCCTGTTT GGTATTTATC AAAAAGACTC CGGCACCATT TTATTCCAGG GTAAAGAGAT CGATTTCCAT TCTGCAAAAG AAGCGCTGGA AAATGGTATT TCGATGGTAC ACCAGGAGTT AAACCTGGTA TTACAACGTT CGGTGATGGA TAACATGTGG CTGGGGCGAT ATCCCACCAA AGGCATGTTT GTCGATCAGG ACAAAATGTA CCGCGAAACC AAAGCGATTT TTGATGAACT GGATATTGAT ATCGATCCGC GTGCGCGCGT CGGCACATTA TCCGTTTCGC AAATGCAGAT GATCGAAATC GCCAAAGCGT TTTCCTATAA CGCGAAAATT GTGATTATGG ATGAACCGAC TTCTTCGTTA ACCGAAAAAG AGGTCAATCA TCTGTTCACT ATTATTCGTA AATTAAAAGA GCGCGGCTGC GGTATTGTTT ATATCTCACA TAAAATGGAA GAAATCTTCC AGTTATGTGA TGAAGTTACC GTATTGCGCG ACGGTCAGTG GATCGCCACC GAACCGCTGG CAGGGCTGAC GATGGACAAG ATCATCGCCA TGATGGTTGG GCGTTCTCTT AACCAGCGTT TCCCTGACAA AGAAAACAAG CCGGGCGAAG TGATTCTGGA AGTGCGTAAC CTGACGTCGC TGCGCCAGCC GTCGATCCGT GATGTCTCGT TTGATCTGCA TAAAGGGGAG ATTCTCGGTA TAGCCGGTCT GGTGGGAGCG AAACGTACCG ATATTGTTGA GACGTTATTT GGTATTCGCG AGAAATCGGC TGGCACCATT ACGTTGCACG GCAAAAAGAT CAATAACCAT AATGCCAACG AAGCCATAAA CCACGGATTT GCACTGGTAA CTGAAGAGCG CCGCTCAACG GGAATTTATG CCTATCTGGA TATTGGCTTT AACTCGTTAA TTTCCAATAT TCGCAATTAC AAAAACAAAG TTGGCTTGCT GGATAATTCG CGGATGAAAA GCGATACCCA GTGGGTCATT GATTCGATGC GGGTAAAAAC GCCAGGTCAT CGGACGCAAA TAGGTTCGCT CTCCGGTGGT AATCAGCAAA AAGTGATTAT TGGTCGCTGG CTGTTAACGC AACCAGAAAT ATTAATGCTC GATGAACCGA CGCGCGGTAT TGACGTTGGG GCTAAGTTTG AGATTTATCA GTTAATTGCC GAACTGGCGA AGAAAGGCAA GGGGATTATT ATTATCTCCT CTGAAATGCC TGAGTTGTTA GGGATAACAG ACCGTATTCT GGTCATGAGC AATGGTCTCG TTTCCGGAAT TGTCGATACA AAAACAACAA CGCAAAACGA AATTCTGCGT CTTGCGTCTT TGCACCTTTA A
|
Protein sequence | MVSSTTPSSG EYLLEMSGIN KSFPGVKALD NVNLKVRPHS IHALMGENGA GKSTLLKCLF GIYQKDSGTI LFQGKEIDFH SAKEALENGI SMVHQELNLV LQRSVMDNMW LGRYPTKGMF VDQDKMYRET KAIFDELDID IDPRARVGTL SVSQMQMIEI AKAFSYNAKI VIMDEPTSSL TEKEVNHLFT IIRKLKERGC GIVYISHKME EIFQLCDEVT VLRDGQWIAT EPLAGLTMDK IIAMMVGRSL NQRFPDKENK PGEVILEVRN LTSLRQPSIR DVSFDLHKGE ILGIAGLVGA KRTDIVETLF GIREKSAGTI TLHGKKINNH NANEAINHGF ALVTEERRST GIYAYLDIGF NSLISNIRNY KNKVGLLDNS RMKSDTQWVI DSMRVKTPGH RTQIGSLSGG NQQKVIIGRW LLTQPEILML DEPTRGIDVG AKFEIYQLIA ELAKKGKGII IISSEMPELL GITDRILVMS NGLVSGIVDT KTTTQNEILR LASLHL
|
| |