Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2295 |
Symbol | mglC |
ID | 6147082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2322457 |
End bp | 2323467 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641617169 |
Product | beta-methylgalactoside transporter inner membrane component |
Protein accession | YP_001744342 |
Protein GI | 170680415 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4211] ABC-type glucose/galactose transport system, permease component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.575488 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00421208 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGCGT TAAATAAGAA AAGTTTTCTT ACTTACCTGA AAGAAGGCGG TATTTACGTC GTTCTTTTAG TTTTGCTGGC AATTATTATT TTCCAGGACC CAACATTTTT AAGTCTGTTG AACTTAAGTA ATATTCTCAC CCAGTCATCG GTGCGTATTA TTATCGCGCT CGGTGTGGCA GGGTTAATTG TCACCCAGGG GACCGACCTT TCTGCAGGTC GCCAGGTGGG GCTGGCGGCA GTAGTAGCGG CGACGTTGTT GCAGTCCATG GATAACGCCA ACAAAGTGTT CCCGGAAATG GCGACGATGC CGATTGCGCT GGTTATTCTG ATTGTCTGTG CCATTGGTGC GGTGATCGGT TTGATTAACG GCCTGATTAT CGCTTATCTC AACGTGACGC CATTCATTAC CACGCTCGGC ACGATGATCA TCGTCTATGG CATCAACTCG CTCTATTACG ACTTTGTCGG GGCGTCGCCA ATTTCTGGTT TTGACAGTGG CTTCTCTACC TTTGCTCAGG GCTTTGTCGC GCTGGGGAGT TTCCGTCTCT CTTACATCAC CTTCTATGCG TTGATTGCGG TGGTGTTTGT CTGGGTGTTG TGGAACAAAA CCCGCTTCGG TAAGAACATT TTTGCCATTG GCGGTAACCC GGAAGCGGCG AAAGTATCTG GTGTCAACGT CGGCCTGAAC CTGCTGATGA TCTACGCGTT GTCTGGTGTG TTCTATGCCT TCGGCGGGAT GTTAGAAGCC GGACGTATCG GCTCAGCCAC CAATAACCTC GGCTTTATGT ATGAGCTGGA TGCTATCGCG GCGTGCGTGG TAGGCGGCGT ATCGTTCAGC GGCGGTGTGG GGACGGTGAT TGGCGTGGTG ACCGGGGTAA TTATTTTTAC CGTCATCAAC TATGGCCTGA CGTATATCGG CGTAAACCCA TACTGGCAGT ACATCATCAA AGGGGCGATT ATTATCTTCG CCGTAGCGCT GGATTCACTG AAATACGCGC GTAAGAAATG A
|
Protein sequence | MSALNKKSFL TYLKEGGIYV VLLVLLAIII FQDPTFLSLL NLSNILTQSS VRIIIALGVA GLIVTQGTDL SAGRQVGLAA VVAATLLQSM DNANKVFPEM ATMPIALVIL IVCAIGAVIG LINGLIIAYL NVTPFITTLG TMIIVYGINS LYYDFVGASP ISGFDSGFST FAQGFVALGS FRLSYITFYA LIAVVFVWVL WNKTRFGKNI FAIGGNPEAA KVSGVNVGLN LLMIYALSGV FYAFGGMLEA GRIGSATNNL GFMYELDAIA ACVVGGVSFS GGVGTVIGVV TGVIIFTVIN YGLTYIGVNP YWQYIIKGAI IIFAVALDSL KYARKK
|
| |