Gene EcSMS35_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0020 
SymbolxlyP 
ID6145471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp24218 
End bp25591 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content48% 
IMG OID641614921 
Productxylose-proton symporter 
Protein accessionYP_001742137 
Protein GI170680613 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.06497e-05 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCCG TCATTGAAAC TACTCAATCA ACTTCATCCG ATTCCTTACC TTTAATACAG 
CGGATTAGCT ACGGCTCACT GGATGTTGCC GGTAATCTGC TCTACTGTTT TGGCTCTACC
TACATTCTTT ATTTCTACAC CGACGTAGCA GGGATTAGTC TGGCGGTTGC AGGCATAATC
CTACTGCTGG CGCGCATTGT AGACGGTATT GATGCGCCAG TGTGGGGAAT CATTATCGAT
AAAACCCATT CGCGTTATGG AAAATGCCGT CCATGGTTTT TATGGTTACC ATTACCTTTT
GCTGTATTTA GTGCGCTTTC TTTCTGGTCA CCCGATATCA GCATGACAGG GAAAGCTGTT
TATGCAGCAA TATCGTACAT GTTAGCCAGC ATTCTGTTCA CTGGGTTGAA TACACCATTA
AGCGCTATAT TACCGCTGAT GACGCTGTCA CCAAAGGAAC GCCTGGTATT AAACTCATAT
CGCATGACTG GTGGGCAAAT TGGCGTACTA TTAATGAATG CGACAGCATT GCCATTGGTC
GCTTTCCTTG GTAATGGTAA CGATAAAGCA GGCTTTATAT ATACTGCCAT AGTCTTTGCC
GTAATATCCT GTGCCTTAAC GTTGTTTGCC TTTAAAAATA TTCGCGAACT GGATACCGAT
AAAATACAGC AAGAACCACG ACTGCCGATG AAAAAGAGTT TTTCGGCAAT GAAAGGGAAC
TGGCCGTGGC TCCTGATGGT AGTGGCGAAC CTTATCTTCT GGATTGCCCT ACAGCAGCGC
AACACTACTA TTGTTTACTA TTTAACCTAC AATCTGGATC GCAAAGATCT GGTCCCGCTG
GTTAACAGCC TCGCCACCAT TCAGATCCTG TTTATCATTG CCATTCCGTT CTTCAGTCGC
TATCTCACCA AAACCTGGAT TTGGATTACC GGGCTGCTGG TTGCGATGTT AGGCGGGGGC
CTCATGTGGC TGGCTGCCGA CAGCATCCCT TTGATGATCG CCGCCTGGGT ACTCGCCAAT
ATCGGCAGCG GTATCGCCTG TTCTATGCCT TTCGCGATGC TCGGTTTCGC CGTTGACTTT
GGTCGCTGGA AAACCGGCAT TAAAGCTACC GGCATTCTGA TCGCCTTCGG CAGCACCTTC
TGCATCAAGA TGGGCAGTGG TATCGGTACA GCTTTCGCTG CCTTTATTAT GGACAGCTTT
GGTTATATCC CCAACCAACA ACAGACGGCT GCGGGGTTGG AAGGTATCAG CCTGGCATTT
ATCTGGGTCC CTGCGCTGCT CTTTGCTCTC GCGGCTGTAC CACTGCTCTT CTTTCGCCAG
TATGAAGCGA TGGAAGGTCG TATCCAACAC GATCTGCAAG CCCACAATCG TTGA
 
Protein sequence
MSSVIETTQS TSSDSLPLIQ RISYGSLDVA GNLLYCFGST YILYFYTDVA GISLAVAGII 
LLLARIVDGI DAPVWGIIID KTHSRYGKCR PWFLWLPLPF AVFSALSFWS PDISMTGKAV
YAAISYMLAS ILFTGLNTPL SAILPLMTLS PKERLVLNSY RMTGGQIGVL LMNATALPLV
AFLGNGNDKA GFIYTAIVFA VISCALTLFA FKNIRELDTD KIQQEPRLPM KKSFSAMKGN
WPWLLMVVAN LIFWIALQQR NTTIVYYLTY NLDRKDLVPL VNSLATIQIL FIIAIPFFSR
YLTKTWIWIT GLLVAMLGGG LMWLAADSIP LMIAAWVLAN IGSGIACSMP FAMLGFAVDF
GRWKTGIKAT GILIAFGSTF CIKMGSGIGT AFAAFIMDSF GYIPNQQQTA AGLEGISLAF
IWVPALLFAL AAVPLLFFRQ YEAMEGRIQH DLQAHNR