Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1103 |
Symbol | |
ID | 6142867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1117938 |
End bp | 1119332 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641615987 |
Product | major facilitator family transporter |
Protein accession | YP_001743179 |
Protein GI | 170682915 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.338347 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACGC ATTCATCCAC TGTTTCCCTG ACAGATAACC AACAGGCAGC TCAGGAGCGG CTGGAAACCT CCGAGGGACG CCGCGAGTTT TGGCGAGCAA CAGTTTCCTG CTGGTTAGGC ACCACGATGG AATATGCCGA TTTTGCCCTG TACGGCCTTG CCGCAGGTAT TATCTTTGGC GATGTCTTTT TTCCGGAGGC GACACCCGTC ATGGCATTAC TTTCCAGCTT TGCCGCCTAT TCTGTTGGGT TTATTGCCCG CCCTATTGGT GCATTACTAT TCGGCTGGAT AGGCGATAAA CATGGTCGGA AAATTGTCAT GGTTATCACT ATTGGATTGA TGGGCATGTC GACCATGCTA ATAGGATTAA TCCCCAGTTA CGCCCAGATA GGCGTCTGGG CACCGATATG TCTGGTTATC CTTCGATTTT CTCAGGGGCT GGGAGCAGGA GCGGAACTTT CAGGCGGTAC TGTGATGCTT GGTGAATATG CTCCCGTTAA ACGACGTGGA CTGGTTTCAT CTGTTATTGG TCTGGGTTCG AACAGCGGAA CATTACTGGC TTCGCTGGTT TGGCTCATCG TCCTGCAAAT GGACAAGGAT GACTTATTAA GCTGGGGATG GCGTATTCCT TTTCTTTGCA GCATTCTTAT TGCTGCTGCA GCTCTATTAA TTCGTCGTCA TATACGCGAA ACACCAGTCT TTGAACGTCA AAAAGCCCTT CTGCAGGCTG AACGAGAAAA GGTTATTCGT GAGGAAAAAG CACAGCAACA ACATGACAGT CGTAGCTTCT GGAAACGGAC CCGTGCCTTC TGGACCATGG TCGGATTACG CATAGGAGAG AATGGTCCTT CTTATCTCGC TCAGGGATTT ATCATTGGCT ATGTCGCGAA AGTACTGATG GTGGATAAGT CCGTACCCAC GGCAGCTGTA CTTATTGCAT CCGTTCTGGG ATTTGCCATT ATTCCTCTGG CGGGTTGGCT GTCCGATAGA TTCGGTAGAC GTATCATCTA TCGTTGGTTC TGCTTGTTAC TGATCCTGTA TGCCTTTCCG GCATTTATGT TGCTGGATTC TCGTGAGCCG TGGATTGTTA TCCCGACGAT CATTACCGGG ATGGGGCTGG CTTCACTGGG TATTTTTGGT GTTCAGGCTG CGTGGGGCGT TGAGCTTTTC GGTGTCACTA ATCGTTATAC CAAAATGGCA TTTGCAAAAG AGCTCGGTTC CATTCTGTCT GGCGGGACTG CACCACTTAT CGCCTCTGCG CTACTCTCGT ATTACGGGCA CTGGTGGCCA ATCGCTATCT ATTTCGCCTT TATGGCCGCG ATTGGACTGG TGACCACTTT CTTTGCACCA GAGACTCGCG GACGGGATCT CAACTTACCC GAGGATGCAA TTTAA
|
Protein sequence | MNTHSSTVSL TDNQQAAQER LETSEGRREF WRATVSCWLG TTMEYADFAL YGLAAGIIFG DVFFPEATPV MALLSSFAAY SVGFIARPIG ALLFGWIGDK HGRKIVMVIT IGLMGMSTML IGLIPSYAQI GVWAPICLVI LRFSQGLGAG AELSGGTVML GEYAPVKRRG LVSSVIGLGS NSGTLLASLV WLIVLQMDKD DLLSWGWRIP FLCSILIAAA ALLIRRHIRE TPVFERQKAL LQAEREKVIR EEKAQQQHDS RSFWKRTRAF WTMVGLRIGE NGPSYLAQGF IIGYVAKVLM VDKSVPTAAV LIASVLGFAI IPLAGWLSDR FGRRIIYRWF CLLLILYAFP AFMLLDSREP WIVIPTIITG MGLASLGIFG VQAAWGVELF GVTNRYTKMA FAKELGSILS GGTAPLIASA LLSYYGHWWP IAIYFAFMAA IGLVTTFFAP ETRGRDLNLP EDAI
|
| |