Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3891 |
Symbol | xylH |
ID | 6147286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3959878 |
End bp | 3961059 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618717 |
Product | D-xylose ABC transporter, permease protein |
Protein accession | YP_001745856 |
Protein GI | 170683622 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4214] ABC-type xylose transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAAA GCAATCCGTC TGAAGTGAAA TTAGCCGTAC CGACATCCGG TGGCTTTTCC GGGCTGAAAT CACTGAATTT GCAGGTCTTC GTGATGATTG CAGCTATCAT CGCAATCATG CTGTTCTTTA CCTGGACCAC CGATGGTGCC TACTTAAGCG CCCGTAACGT CTCCAACCTG TTACGCCAAA CCGCGATTAC CGGCATCCTC GCGGTAGGAA TGGTGTTCGT CATAATTTCT GCTGAAATCG ACCTTTCCGT CGGCTCAATG ATGGGGCTGT TAGGTGGCGT CGCGGCGATT TGTGACGTCT GGCTAGGCTG GCCTTTGCCA CTTACCATCA TTGTGACTTT GGTTCTGGGA CTGCTTCTCG GTGCCTGGAA CGGATGGTGG GTCGCGTACC GCAAAGTCCC TTCATTTATT GTCACCCTCG CGGGCATGTT GGCATTTCGC GGCATACTCA TTGGCATCAC CAACGGCACG ACTGTTTCCC CCACCAGCGC CGCGATGTCA CAAATCGGGC AAAGCTATCT ACCCGCCAGC ACCGGCTTCA TCATTGGCGC GCTTGGCTTA ATGGCTTTTG TTGGCTGGCA ATGGCGCGGA AGAATGCGCC GTCAGGCTTT GGGTTTGCAG TCTCCGGCTT CTACCGCAGT AGTCGGTCGC CAGGCTTTAA CCGCTATCAT CGTATTAGGC GCAATCTGGC TGTTGAATGA TTACCGTGGC GTTCCCACTC CTGTTCTGCT GCTGACGTTG CTGTTACTCG GCGGAATGTT TATGGCAACG CGGACGGCAT TTGGACGACG CATTTATGCC ATCGGCGGCA ATCTGGAAGC AGCACGGCTC TCCGGGATTA ACGTTGAACG CACCAAACTT GCCGTGTTCG CTATTAACGG ATTAATGGTA GCCATCGCCG GATTAATCCT CAGTTCTCGA CTTGGCGCTG GTTCACCTTC TGCGGGAAAT ATCGCCGAAC TGGACGCAAT TGCCGCATGT GTGATTGGCG GCACCAGTCT GGCAGGCGGC GTCGGGAGCG TGGCCGGAGC GGTAATGGGG GCATTCATCA TGGCTTCACT GGATAACGGC ATGAGTATGA TGGATGTACC GACCTTCTGG CAGTATATCG TTAAAGGTGC GATTCTGTTG CTGGCAGTAT GGATGGACTC CGCAACCAAA CGCCGTTCTT GA
|
Protein sequence | MSKSNPSEVK LAVPTSGGFS GLKSLNLQVF VMIAAIIAIM LFFTWTTDGA YLSARNVSNL LRQTAITGIL AVGMVFVIIS AEIDLSVGSM MGLLGGVAAI CDVWLGWPLP LTIIVTLVLG LLLGAWNGWW VAYRKVPSFI VTLAGMLAFR GILIGITNGT TVSPTSAAMS QIGQSYLPAS TGFIIGALGL MAFVGWQWRG RMRRQALGLQ SPASTAVVGR QALTAIIVLG AIWLLNDYRG VPTPVLLLTL LLLGGMFMAT RTAFGRRIYA IGGNLEAARL SGINVERTKL AVFAINGLMV AIAGLILSSR LGAGSPSAGN IAELDAIAAC VIGGTSLAGG VGSVAGAVMG AFIMASLDNG MSMMDVPTFW QYIVKGAILL LAVWMDSATK RRS
|
| |