Gene EcSMS35_1422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1422 
SymbolydjE 
ID6144689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1406334 
End bp1407692 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content45% 
IMG OID641616300 
Productmajor facilitator family transporter 
Protein accessionYP_001743480 
Protein GI170683543 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.452173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAAT ATGATCAAAT TGGCGCAAGA CTGGACCGCT TGCCTTTGGC CCGGTTTCAT 
TATCGTATAT TTGGTATTAT AAGCTTTAGT CTGTTATTAA CGGGTTTTTT GAGTTATTCA
GGCAATGTTG TTTTAGCAAA GCTGGTAAGC AATGGATGGT CAAATAATTT CCTCAATGCC
GCCTTTACCT CGGCATTAAT GTTTGGTTAT TTCATCGGCT CTCTTACCGG TGGGTTCATC
GGTGATTATT TTGGGCGGCG CAGGGCTTTT CGCATAAATC TTCTCATCGT CGGTATTGCG
GCAACAGGGG CCGCTTTTGT CCCTGATATG TACTGGCTCA TCTTCTTTCG CTTCCTGATG
GGCACAGGAA TGGGGGCGCT GATTATGGTT GGCTATGCCT CATTTACGGA GTTTATCCCC
GCGACGGTGC GTGGAAAATG GTCCGCGCGG CTCTCATTTG TTGGTAACTG GTCGCCCATG
CTGTCTGCGG CGATAGGCGT GGTGGTTATC GCTTTTTTTA GTTGGCGGAT AATGTTTCTG
CTGGGCGGTA TTGGCATACT GTTAGCCTGG TTTCTCTCAG GTAAATACTT TATTGAGTCG
CCACGATGGC TGGCAGGGAA AGGGCAAATC GCAGGTGCAG AAAGCCAACT TCGTGAAGTA
GAGCAGCAAA TTGAAAGAGA GAAGAGTATT CGTTTACCCC CGCTTACTTT GAACCAGAGC
AACAGCAAGG TTAAGGTAAT CAAGGGTACC TTCTGGCTCC TGTTTAAAGG GGAAATGTTA
CGACGTACAT TAGTCGCGAT TACTGTGTTA ATTGCAATGA ACATTTCGCT TTATACCATC
ACCGTATGGA TACCGACCAT ATTTGTTAAC TCCGGCATTG ATGTTGATAA ATCAATATTA
ATGACCGCTG TTATTATGAT TGGCGCTCCG GTAGGGATAT TTATTGCGGC ATTAATTATT
GATCATTTTC CTCGTCGATT ATTTGGCTCT GCCTTACTTA TTATTATTGC TGTGTTAGGC
TATATCTATT CAATTCAAAC TACAGAGTGG GCGATTTTAA TCTATGGCCT GGTGATGATC
TTCTTTTTAT ACATGTATGT GTGCTTCGCG TCGGCGGTTT ATATCCCGGA ACTTTGGCCA
ACGCATTTAC GCCTGCGTGG TTCGGGTTTC GTTAATGCCG TCGGACGGAT CGTCGCAGTC
TTCACGCCCT ATGGCGTTGC GGCATTATTA ACACATTATG GGTCGATTAC GGTGTTTATG
GTGCTTGGTG TCATGTTAGT GCTCTGTGCG CTGGTTCTCT CCATTTTTGG CATCGAAACG
CGGAAGGTGT CGCTGGAAGA GATTTCTGAG GTGAATTAA
 
Protein sequence
MEQYDQIGAR LDRLPLARFH YRIFGIISFS LLLTGFLSYS GNVVLAKLVS NGWSNNFLNA 
AFTSALMFGY FIGSLTGGFI GDYFGRRRAF RINLLIVGIA ATGAAFVPDM YWLIFFRFLM
GTGMGALIMV GYASFTEFIP ATVRGKWSAR LSFVGNWSPM LSAAIGVVVI AFFSWRIMFL
LGGIGILLAW FLSGKYFIES PRWLAGKGQI AGAESQLREV EQQIEREKSI RLPPLTLNQS
NSKVKVIKGT FWLLFKGEML RRTLVAITVL IAMNISLYTI TVWIPTIFVN SGIDVDKSIL
MTAVIMIGAP VGIFIAALII DHFPRRLFGS ALLIIIAVLG YIYSIQTTEW AILIYGLVMI
FFLYMYVCFA SAVYIPELWP THLRLRGSGF VNAVGRIVAV FTPYGVAALL THYGSITVFM
VLGVMLVLCA LVLSIFGIET RKVSLEEISE VN