Gene EcSMS35_3908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3908 
Symbol 
ID6145753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3977925 
End bp3979325 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content55% 
IMG OID641618734 
Productsugar glycoside-pentoside-hexuronide family protein 
Protein accessionYP_001745873 
Protein GI170680906 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.845629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCTA CACCGATTAC TACCGCTGAT ATCGCTAAAG GTAAAATTGA CGATGCGTTA 
TCTGTACGGG AAAAAATAGG CTACGGCCTG GGTGACGCAG GCGGCACCGT AATAACTTGC
CTGATCATGA ACTTTCTCAC CTTTTTCTAC ACCGACGTTT TTGGCTTAAC TCCGGCGCTG
GTCGGCACGC TGTTTATTGC ACTGCGCGTG TTTGATGCCA TCTCCGACCC GGTGATGGGC
GTCATTGCCG ACCGGACGCA AAGTCGCTGG GGGCGCTTTC GTCCGTGGCA GTTGTGGATT
GCCATTCCCA TCGGCATTAT CGGCATCCTG ACGTTCACCG TGCCAGATGC CAGCATGGGA
GTAAAAATCG CCTGGGCGTT CGGCACTTAC CTGCTCCTTT CAGTCGGTTA TACCGCCATC
AACGTGCCGT ACTGCGCGCT GATCAACACC ATGACCACCC GCCACAATGA AGTGATCTCC
TGCCAGTCCT GGCGATTCGT TCTCTGCGGC GTGGCGGGAT TTCTGGTTTC GGTAGGCTTA
CCGTGGATGG TAGATCTCTT CGGTCAGGGT AATGCCGCGC GCGGCTATCA ACTGGGCGTC
GGCGTGCTGT GCGCCATCGC TGTGGTGATG TTCCTGTGCT GTTTCTTCTG GGTTCGTGAA
CGGGTGCCAC TCTCCACAAT GGGGAAATTT ACCCTGCGCG AACATCTTGC CGGGCTGCGG
AACAACGACC AACTGCTGCT GATGCTGGTC ATGTCTTTCC TGCTGATTAA CGTCTTTAAT
ATTCGCGGCG GTGGGTATAT GTACTTCATT ACCTACGTCT TACAAGGCAG TACGGGCTAC
ACGTCGCTGT TCTTCACTAT GGTCACCTTC GCCTCCATTA TCGGCTCGGT GATTGTCAGC
CCGTTAACGC GGCGTTTCGA TACCGTCAAA ATTTATTACT ACACCAACCT GCTCCTCGCT
GCGCTGGCGG TGTTGATGTG GTTCCTGCCC TCCAGCCCGG CTTATCAAAC GCTGTGGCTG
GCGGTGATCC TCGGTAATGG CGTGATTCTT GGCTTCACAT TGCCATTGCA CTTCTCATTG
ATGGCCTTTG CCGATGACTA CGGCGAGTGG AAAACCCGCG TACGTTCTTC CGGCATGAAC
TTCGCCTTCA ATCTGTTTTT CATCAAGCTG GCCTGGGCCT CCAGCGCCGG GATCATCAGC
CTGCTGTTTA TTTTTGTCGC CTACCAGCCA GGCGTGGAAA ACCAGACCGC CAGTTCGCTT
GGCGGGATCA CAGCAATGGA AACATTGCTG CCTGCGCTGT TCCACCTGCT GCTGGCGATG
GCGATCCGCT TTTGCAAACT CAATAACCCT ATGATGTCAC GCATTGCTAG CGACCTGCGT
CAGCGTCATG TACAGCCTTA A
 
Protein sequence
MTSTPITTAD IAKGKIDDAL SVREKIGYGL GDAGGTVITC LIMNFLTFFY TDVFGLTPAL 
VGTLFIALRV FDAISDPVMG VIADRTQSRW GRFRPWQLWI AIPIGIIGIL TFTVPDASMG
VKIAWAFGTY LLLSVGYTAI NVPYCALINT MTTRHNEVIS CQSWRFVLCG VAGFLVSVGL
PWMVDLFGQG NAARGYQLGV GVLCAIAVVM FLCCFFWVRE RVPLSTMGKF TLREHLAGLR
NNDQLLLMLV MSFLLINVFN IRGGGYMYFI TYVLQGSTGY TSLFFTMVTF ASIIGSVIVS
PLTRRFDTVK IYYYTNLLLA ALAVLMWFLP SSPAYQTLWL AVILGNGVIL GFTLPLHFSL
MAFADDYGEW KTRVRSSGMN FAFNLFFIKL AWASSAGIIS LLFIFVAYQP GVENQTASSL
GGITAMETLL PALFHLLLAM AIRFCKLNNP MMSRIASDLR QRHVQP