Gene EcSMS35_4901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4901 
Symbol 
ID6147166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5021694 
End bp5023055 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content52% 
IMG OID641619704 
Productmajor facilitator transporter 
Protein accessionYP_001746811 
Protein GI170683547 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.449685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAAAG AAAATATCAC CCTCGATCCG CGTTCTTCAT TTACTCCATC TTCGTCGGCA 
GATATTCCCG TGCCACCAGA TGGATTAGTT CAACGCAGTA CCCGAATTAA ACGCATTCAA
ACCACCGCCA TGTTGTTATT ATTTTTTGCG GCGGTAATCA ATTATCTCGA CCGCAGTTCG
CTGTCGGTAG CAAATTTAAC GATTCGTGAA GAATTGGGAT TAAGTGCCAC CGAAATCGGC
GCTTTGCTCT CCGTGTTTTC ACTCGCTTAC GGGATTGCGC AACTTCCTTG CGGCCCACTA
TTGGATCGTA AAGGCCCGCG CCTGATGCTG GGACTGGGGA TGTTCTTCTG GTCACTGTTC
CAGGCCATGT CTGGCATGGT GCACAGCTTT ACGCAGTTCG TGTTGGTGCG TATCGGTATG
GGGATTGGTG AAGCGCCGAT GAACCCATGC GGTGTAAAAG TCATTAACGA CTGGTTCAAC
ATCAAAGAGC GCGGACGCCC GATGGGCTTC TTCAACGCAG CTTCTACCAT TGGCGTTGCC
GTAAGCCCAC CGATTCTGGC GGCGATGATG CTGGTGATGG GCTGGCGCGG GATGTTTATT
ACTATTGGTG TACTGGGGAT TTTTCTCGCC ATCGGCTGGT ATATGCTCTA TCGCAACCGC
GAGCACGTAG AACTGACTGC CGTTGAACAA GCTTATCTCA ATGCAGGTAG CGTCAATGCC
CGCCGAGATC CGCTCAGTTT TGCCGAATGG CGCAGCCTGT TCCGTAACCG CACAATGTGG
GGAATGATGC TCGGATTCAG TGGCATCAAC TACACTGCGT GGCTGTATCT GGCCTGGCTT
CCTGGTTACC TGCAAACAGC CTATAACCTG GATTTAAAAA GCACAGGGTT GATGGCGGCT
ATCCCTTTCC TGTTTGGGGC TGCCGGGATG CTGGTCAACG GTTACGTTAC TGACTGGCTG
GTCAAAGGGG GAATGGCTCC GATTAAAAGC CGTAAGATCT GCATTATTGC CGGGATGTTC
TGTTCTGCCG CCTTTACGCT GGTCGTACCG CAAGCGACAA CATCCATGAC GGCGGTTCTG
CTGATTGGTA TGGCACTGTT TTGTATTCAC TTTGCCGGAA CATCCTGCTG GGGATTGATC
CACGTCGCAG TAGCTTCTCG CATGACTGCG TCGGTGGGCA GTATCCAGAA CTTTGCCAGC
TTCATCTGCG CCTCTTTCGC GCCGATCATT ACTGGTTTTA TTGTTGATAC CACCCATTCA
TTCCGTCTGG CACTAATCAT CTGCGGTTGC GTCACTGCGG CGGGTGCACT GGCGTACATC
TTCCTGGTTC GTCAGCCGAT CAACGACCCA CGGAAAGATT AA
 
Protein sequence
MEKENITLDP RSSFTPSSSA DIPVPPDGLV QRSTRIKRIQ TTAMLLLFFA AVINYLDRSS 
LSVANLTIRE ELGLSATEIG ALLSVFSLAY GIAQLPCGPL LDRKGPRLML GLGMFFWSLF
QAMSGMVHSF TQFVLVRIGM GIGEAPMNPC GVKVINDWFN IKERGRPMGF FNAASTIGVA
VSPPILAAMM LVMGWRGMFI TIGVLGIFLA IGWYMLYRNR EHVELTAVEQ AYLNAGSVNA
RRDPLSFAEW RSLFRNRTMW GMMLGFSGIN YTAWLYLAWL PGYLQTAYNL DLKSTGLMAA
IPFLFGAAGM LVNGYVTDWL VKGGMAPIKS RKICIIAGMF CSAAFTLVVP QATTSMTAVL
LIGMALFCIH FAGTSCWGLI HVAVASRMTA SVGSIQNFAS FICASFAPII TGFIVDTTHS
FRLALIICGC VTAAGALAYI FLVRQPINDP RKD