Gene EcSMS35_2381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2381 
Symbol 
ID6143507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2424732 
End bp2426060 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content40% 
IMG OID641617254 
Productmajor facilitator transporter 
Protein accessionYP_001744426 
Protein GI170683353 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.411675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.855412 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGTC CTCAGAAAGA TTTGTATGAA GCCATTTCTA TTGTGAAAAG GAAGATTATT 
CCGTTAGCAT TTATTCTGTA TTTTTTTAAC TATATGGATC GTGTTAACAT CGGATTTGCC
GCTTTACGCA TGAATGAATC CCTTGGTATT ACACCAGAGG ATTTTGCTAA TATATCTTCA
ATATTTTTTG TGTCCTATTT GATTTTTCAA ATTCCGAGCA GTATTGGATT GCAAAAGTTA
GGTGCAAGAA AATGGATTAG CGCCATTATT GTTGGTTGGG GAACAGTAAC AACGTTAATT
TTCTTTGCCA AAGACACCCA GCATATATTG CTCGCTCGGG TCTTTTTGGG AATATTTGAA
GCGGGGTTTT TCCCTGGCAT GGTTTACTAC CTGGCATGTT GGTTCCCGGC GCGTGAACGT
GGAAAAGTGA ATAGCTTTTT TATGTTATCA ATCGCAGTTG CATCTGTGCT GGCGGCACCA
ATGTCTGGTT GGATTATTGA ACATATGAAT ACCGGTAACT ATGAAGGATG GCGCTGGCTT
TTTGCTATTG AAGGCATTCC TACGGTGTTA TTGGGGGCGC TGACCTTTTA TTTACTGCCA
GATCGTCCGG AAAAAGCCCG CTGGCTTACA CCATCGCAAG TCACCGCATT GGTCGATAAA
CTTAATTCTG ATAACGAAAA AGCCGCAGCA TTAAATAAAA ATACAAATGC ATCTTTTCTG
TCAATCATTA AAAATCCAGT ATTACTACAA CTCTCATTTG CTTATATGTT TATTCAGGCA
GCGGCGTTGG CGGCAAATTA CTGGTTACCG GGGTTGGTTA AAGGATTTTC ATCGGATTTC
ACAGATACCG ATGTCGGTTT GATTATGAGT ATTCCGTTTA TTTTCGCGAT GTTCAGTATG
CCATTATGGG GATGGCATTC CGATAAAAAG AATGAAAGGA AATGGCATGC CGCACTACCG
ATGTTGGTTG CAGGTTGCGG TTTTTTAATG ATCGCACTTG TACCTTCGAT GACGCTGCGC
ATGATGGGAT TAACATTATA CGGTGTAGGC ATTCTCAGTT ATTACGGTCC GTACTGGGCT
TTACCCTCAG CTTTACTTTC ACCATCAGGC CTGGCAATCA GTATTGCGTT TATTAACTCT
TGTTCAAGTT TGGGTGGTTT TCTGATCAAT AAATCACTTG GCTATGTTTC TACACATTAT
GGAGCCACGG GGATATTTAT TGTTGAGGCG ATTCTTTGCT TTATCGCCGT AGCTATTTTG
TTATCAATGA AAATTGATAA TAAAAAAGAT AACGCCCAGA AAGACAATGT TATTTCACAT
GCTAAATAA
 
Protein sequence
MSSPQKDLYE AISIVKRKII PLAFILYFFN YMDRVNIGFA ALRMNESLGI TPEDFANISS 
IFFVSYLIFQ IPSSIGLQKL GARKWISAII VGWGTVTTLI FFAKDTQHIL LARVFLGIFE
AGFFPGMVYY LACWFPARER GKVNSFFMLS IAVASVLAAP MSGWIIEHMN TGNYEGWRWL
FAIEGIPTVL LGALTFYLLP DRPEKARWLT PSQVTALVDK LNSDNEKAAA LNKNTNASFL
SIIKNPVLLQ LSFAYMFIQA AALAANYWLP GLVKGFSSDF TDTDVGLIMS IPFIFAMFSM
PLWGWHSDKK NERKWHAALP MLVAGCGFLM IALVPSMTLR MMGLTLYGVG ILSYYGPYWA
LPSALLSPSG LAISIAFINS CSSLGGFLIN KSLGYVSTHY GATGIFIVEA ILCFIAVAIL
LSMKIDNKKD NAQKDNVISH AK