Gene EcSMS35_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1103 
Symbol 
ID6142867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1117938 
End bp1119332 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content50% 
IMG OID641615987 
Productmajor facilitator family transporter 
Protein accessionYP_001743179 
Protein GI170682915 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.338347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACGC ATTCATCCAC TGTTTCCCTG ACAGATAACC AACAGGCAGC TCAGGAGCGG 
CTGGAAACCT CCGAGGGACG CCGCGAGTTT TGGCGAGCAA CAGTTTCCTG CTGGTTAGGC
ACCACGATGG AATATGCCGA TTTTGCCCTG TACGGCCTTG CCGCAGGTAT TATCTTTGGC
GATGTCTTTT TTCCGGAGGC GACACCCGTC ATGGCATTAC TTTCCAGCTT TGCCGCCTAT
TCTGTTGGGT TTATTGCCCG CCCTATTGGT GCATTACTAT TCGGCTGGAT AGGCGATAAA
CATGGTCGGA AAATTGTCAT GGTTATCACT ATTGGATTGA TGGGCATGTC GACCATGCTA
ATAGGATTAA TCCCCAGTTA CGCCCAGATA GGCGTCTGGG CACCGATATG TCTGGTTATC
CTTCGATTTT CTCAGGGGCT GGGAGCAGGA GCGGAACTTT CAGGCGGTAC TGTGATGCTT
GGTGAATATG CTCCCGTTAA ACGACGTGGA CTGGTTTCAT CTGTTATTGG TCTGGGTTCG
AACAGCGGAA CATTACTGGC TTCGCTGGTT TGGCTCATCG TCCTGCAAAT GGACAAGGAT
GACTTATTAA GCTGGGGATG GCGTATTCCT TTTCTTTGCA GCATTCTTAT TGCTGCTGCA
GCTCTATTAA TTCGTCGTCA TATACGCGAA ACACCAGTCT TTGAACGTCA AAAAGCCCTT
CTGCAGGCTG AACGAGAAAA GGTTATTCGT GAGGAAAAAG CACAGCAACA ACATGACAGT
CGTAGCTTCT GGAAACGGAC CCGTGCCTTC TGGACCATGG TCGGATTACG CATAGGAGAG
AATGGTCCTT CTTATCTCGC TCAGGGATTT ATCATTGGCT ATGTCGCGAA AGTACTGATG
GTGGATAAGT CCGTACCCAC GGCAGCTGTA CTTATTGCAT CCGTTCTGGG ATTTGCCATT
ATTCCTCTGG CGGGTTGGCT GTCCGATAGA TTCGGTAGAC GTATCATCTA TCGTTGGTTC
TGCTTGTTAC TGATCCTGTA TGCCTTTCCG GCATTTATGT TGCTGGATTC TCGTGAGCCG
TGGATTGTTA TCCCGACGAT CATTACCGGG ATGGGGCTGG CTTCACTGGG TATTTTTGGT
GTTCAGGCTG CGTGGGGCGT TGAGCTTTTC GGTGTCACTA ATCGTTATAC CAAAATGGCA
TTTGCAAAAG AGCTCGGTTC CATTCTGTCT GGCGGGACTG CACCACTTAT CGCCTCTGCG
CTACTCTCGT ATTACGGGCA CTGGTGGCCA ATCGCTATCT ATTTCGCCTT TATGGCCGCG
ATTGGACTGG TGACCACTTT CTTTGCACCA GAGACTCGCG GACGGGATCT CAACTTACCC
GAGGATGCAA TTTAA
 
Protein sequence
MNTHSSTVSL TDNQQAAQER LETSEGRREF WRATVSCWLG TTMEYADFAL YGLAAGIIFG 
DVFFPEATPV MALLSSFAAY SVGFIARPIG ALLFGWIGDK HGRKIVMVIT IGLMGMSTML
IGLIPSYAQI GVWAPICLVI LRFSQGLGAG AELSGGTVML GEYAPVKRRG LVSSVIGLGS
NSGTLLASLV WLIVLQMDKD DLLSWGWRIP FLCSILIAAA ALLIRRHIRE TPVFERQKAL
LQAEREKVIR EEKAQQQHDS RSFWKRTRAF WTMVGLRIGE NGPSYLAQGF IIGYVAKVLM
VDKSVPTAAV LIASVLGFAI IPLAGWLSDR FGRRIIYRWF CLLLILYAFP AFMLLDSREP
WIVIPTIITG MGLASLGIFG VQAAWGVELF GVTNRYTKMA FAKELGSILS GGTAPLIASA
LLSYYGHWWP IAIYFAFMAA IGLVTTFFAP ETRGRDLNLP EDAI