Gene EcSMS35_4262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4262 
Symbol 
ID6146474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4360441 
End bp4361826 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content52% 
IMG OID641619083 
Productsugar transporter family protein 
Protein accessionYP_001746207 
Protein GI170680543 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCACA TCACAACGGA AGATCCAGCA ACTTTACGCC TGCCCTTTAA AGAGAAACTC 
TCTTACGGTA TTGGCGACCT GGCCTCTAAC ATCCTGCTGG ATATTGGTAC GCTTTATCTT
TTGAAGTTTT ATACCGACGT TCTGGGGCTG CCTGGCACCT ATGGTGGCAT TATCTTTTTG
ATTTCAAAAT TCTTTACTGC GTTTACCGAT ATGGGAACAG GCATCATGCT GGATTCCCGA
CGCAAGATCG GCCCAAAAGG CAAGTTCCGT CCTTTTATTC TGTATGCGTC ATTCCCGGTC
ACCCTGCTGG CGATCGCCAA CTTTATCGGT ACGCCGTTCG ATGTCACCGG TAAAACGGTG
ATGGCCACCA TCCTGTTTAT GCTTTACGGA CTGTTTTTCA GCATGATGAA CTGCTCGTAT
GGCGCGATGG TCCCCGCTAT TACTAAAAAC CCCAACGAAC GTGCATCGCT GGCAGCATGG
CGTCAGGGTG GCGCTACATT AGGCCTGCTG CTGTGTACGG TGGGATTCGT GCCGGTTATG
AATCTTATCG AAGGTAATCA GCAACTTGGC TATATCTTCG CCGCCACGCT GTTTTCACTG
TTCGGCCTGC TGTTTATGTG GATCTGCTAC TCGGGCGTGA AAGAGCGTTA TGTCGAAACC
CAACCTACCA ATCCGGCGCA AAAGCCTGGC TTGTTGCAGT CTTTCCGCGC GATTGCTGGT
AACCGCCCAC TGTTCATTCT GTGTATTGCC AACCTCTGCA CTTTAGGGGC GTTTAACGTC
AAGCTCGCCA TTCAGGTCTA TTACACCCAG TACGTGCTCA ACGATCCCAT CCTGTTGTCA
TATATGGGAT TTTTCAGCAT GGGCTGTATT TTCATCGGCG TGTTCCTGAT GCCCGGCGCA
GTCAGACGTT TTGGTAAGAA GAAGGTCTAT ATCGGCGGCC TGCTGATTTG GGTGCTTGGC
GATCTGCTCA ACTATTTCTT TGGCGGCGGT TCGGTCAGCT TCGTGGCGTT CTCCTGCCTG
GCATTCTTCG GCTCAGCGTT TGTTAACAGC CTGAACTGGG CGCTGGTTTC CGACACCGTC
GAGTACGGCG AGTGGCGCAC CGGTGTGCGT TCGGAAGGTA CGGTCTACAC CGGTTTCACC
TTCTTTCGCA AAGTATCTCA GGCGCTGGCA GGTTTCTTCC CAGGCTGGAT GCTGACGCAA
ATCGGCTATG TGCCAAACGT CGCCCAGGCT GACCACACTA TCGAAGGGTT GCGCCAACTG
ATCTTCATCT ACCCAAGCGC ACTGGCAGTA GTCACCATCG TGGCAATGGG TTGCTTCTAC
AGTCTGAACG AGAAGATGTA CGTCCGCATT GTTGAAGAGA TAGAAGCCCG TAAACGCACG
GCGTAA
 
Protein sequence
MSHITTEDPA TLRLPFKEKL SYGIGDLASN ILLDIGTLYL LKFYTDVLGL PGTYGGIIFL 
ISKFFTAFTD MGTGIMLDSR RKIGPKGKFR PFILYASFPV TLLAIANFIG TPFDVTGKTV
MATILFMLYG LFFSMMNCSY GAMVPAITKN PNERASLAAW RQGGATLGLL LCTVGFVPVM
NLIEGNQQLG YIFAATLFSL FGLLFMWICY SGVKERYVET QPTNPAQKPG LLQSFRAIAG
NRPLFILCIA NLCTLGAFNV KLAIQVYYTQ YVLNDPILLS YMGFFSMGCI FIGVFLMPGA
VRRFGKKKVY IGGLLIWVLG DLLNYFFGGG SVSFVAFSCL AFFGSAFVNS LNWALVSDTV
EYGEWRTGVR SEGTVYTGFT FFRKVSQALA GFFPGWMLTQ IGYVPNVAQA DHTIEGLRQL
IFIYPSALAV VTIVAMGCFY SLNEKMYVRI VEEIEARKRT A