Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0020 |
Symbol | xlyP |
ID | 6145471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 24218 |
End bp | 25591 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641614921 |
Product | xylose-proton symporter |
Protein accession | YP_001742137 |
Protein GI | 170680613 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | [TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000106497 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCCG TCATTGAAAC TACTCAATCA ACTTCATCCG ATTCCTTACC TTTAATACAG CGGATTAGCT ACGGCTCACT GGATGTTGCC GGTAATCTGC TCTACTGTTT TGGCTCTACC TACATTCTTT ATTTCTACAC CGACGTAGCA GGGATTAGTC TGGCGGTTGC AGGCATAATC CTACTGCTGG CGCGCATTGT AGACGGTATT GATGCGCCAG TGTGGGGAAT CATTATCGAT AAAACCCATT CGCGTTATGG AAAATGCCGT CCATGGTTTT TATGGTTACC ATTACCTTTT GCTGTATTTA GTGCGCTTTC TTTCTGGTCA CCCGATATCA GCATGACAGG GAAAGCTGTT TATGCAGCAA TATCGTACAT GTTAGCCAGC ATTCTGTTCA CTGGGTTGAA TACACCATTA AGCGCTATAT TACCGCTGAT GACGCTGTCA CCAAAGGAAC GCCTGGTATT AAACTCATAT CGCATGACTG GTGGGCAAAT TGGCGTACTA TTAATGAATG CGACAGCATT GCCATTGGTC GCTTTCCTTG GTAATGGTAA CGATAAAGCA GGCTTTATAT ATACTGCCAT AGTCTTTGCC GTAATATCCT GTGCCTTAAC GTTGTTTGCC TTTAAAAATA TTCGCGAACT GGATACCGAT AAAATACAGC AAGAACCACG ACTGCCGATG AAAAAGAGTT TTTCGGCAAT GAAAGGGAAC TGGCCGTGGC TCCTGATGGT AGTGGCGAAC CTTATCTTCT GGATTGCCCT ACAGCAGCGC AACACTACTA TTGTTTACTA TTTAACCTAC AATCTGGATC GCAAAGATCT GGTCCCGCTG GTTAACAGCC TCGCCACCAT TCAGATCCTG TTTATCATTG CCATTCCGTT CTTCAGTCGC TATCTCACCA AAACCTGGAT TTGGATTACC GGGCTGCTGG TTGCGATGTT AGGCGGGGGC CTCATGTGGC TGGCTGCCGA CAGCATCCCT TTGATGATCG CCGCCTGGGT ACTCGCCAAT ATCGGCAGCG GTATCGCCTG TTCTATGCCT TTCGCGATGC TCGGTTTCGC CGTTGACTTT GGTCGCTGGA AAACCGGCAT TAAAGCTACC GGCATTCTGA TCGCCTTCGG CAGCACCTTC TGCATCAAGA TGGGCAGTGG TATCGGTACA GCTTTCGCTG CCTTTATTAT GGACAGCTTT GGTTATATCC CCAACCAACA ACAGACGGCT GCGGGGTTGG AAGGTATCAG CCTGGCATTT ATCTGGGTCC CTGCGCTGCT CTTTGCTCTC GCGGCTGTAC CACTGCTCTT CTTTCGCCAG TATGAAGCGA TGGAAGGTCG TATCCAACAC GATCTGCAAG CCCACAATCG TTGA
|
Protein sequence | MSSVIETTQS TSSDSLPLIQ RISYGSLDVA GNLLYCFGST YILYFYTDVA GISLAVAGII LLLARIVDGI DAPVWGIIID KTHSRYGKCR PWFLWLPLPF AVFSALSFWS PDISMTGKAV YAAISYMLAS ILFTGLNTPL SAILPLMTLS PKERLVLNSY RMTGGQIGVL LMNATALPLV AFLGNGNDKA GFIYTAIVFA VISCALTLFA FKNIRELDTD KIQQEPRLPM KKSFSAMKGN WPWLLMVVAN LIFWIALQQR NTTIVYYLTY NLDRKDLVPL VNSLATIQIL FIIAIPFFSR YLTKTWIWIT GLLVAMLGGG LMWLAADSIP LMIAAWVLAN IGSGIACSMP FAMLGFAVDF GRWKTGIKAT GILIAFGSTF CIKMGSGIGT AFAAFIMDSF GYIPNQQQTA AGLEGISLAF IWVPALLFAL AAVPLLFFRQ YEAMEGRIQH DLQAHNR
|
| |