Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3889 |
Symbol | xylF |
ID | 6145527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3957289 |
End bp | 3958281 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641618715 |
Product | D-xylose transporter subunit XylF |
Protein accession | YP_001745854 |
Protein GI | 170682540 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | [TIGR02634] D-xylose ABC transporter, substrate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAA AGAACATTCT ACTCACCCTT TGCACCTCAC TTCTGCTTAC CAACGTTGCG GCTCACGCCA AAGAAGTCAA AATAGGTATG GCGATTGATG ATCTCCGTCT TGAACGCTGG CAAAAAGATC GAGATATTTT TGTGAAAAAG GCAGAATCTC TCGGCGCGAA AGTATTTGTA CAGTCTGCAA ATGGCAATGA AGAAACACAA ATGTCGCAGA TTGAAAACAT GATTAACCGG GGCGTCGATG TTCTTGTTAT TATTCCGTAT AACGGTCAGG TATTAAGTAA CGTTGTAAAA GAAGCCAAAC AAGAAGGTAT TAAAGTATTA GCTTACGACC GTATGATTAA CGATGCGGAT ATCGATTTTT ATATTTCTTT CGATAACGAA AAAGTCGGCG AACTGCAGGC AAAAGCCCTG GTCGATATTG TTCCGCAAGG TAATTACTTC CTGATGGGCG GCTCGCCGGT AGATAACAAC GCCAAGCTGT TCCGCGCCGG GCAAATGAAA GTATTAAAAC CTTATGTTGA TTCCGGAAAA ATTAAAGTCG TTGGTGACCA ATGGGTTGAT GGCTGGTTAC CGGAAAACGC ATTGAAAATT ATGGAAAACG CGCTAACCGC CAATAATAAC AAAATTGATG CTGTAGTTGC CTCAAACGAT GCCACCGCAG GAGGAGCAAT TCAGGCATTA AGCGCGCAAG GTTTATCAGG GAAAGTAGCA ATTTCCGGCC AGGATGCGGA TCTTGCAGGT ATTAAACGTA TTGCTGCCGG TACGCAAACT ATGACGGTGT ATAAACCTAT TACATTGTTG GCAAATACTG CCGCAGAAAT TGCCGTTGAG TTGGGCAATG GTCAGGAGCC AAAAGCGGAT ACCACACTGA ATAATGGCCT GAAAGATGTT CCCTCCCGCC TGCTGACACC GATCGATGTG AATAAAAACA ACATCAAAGA TACGGTAGTT AAAGACGGAT TCCACAAAGA GAGCGAGCTG TAA
|
Protein sequence | MKIKNILLTL CTSLLLTNVA AHAKEVKIGM AIDDLRLERW QKDRDIFVKK AESLGAKVFV QSANGNEETQ MSQIENMINR GVDVLVIIPY NGQVLSNVVK EAKQEGIKVL AYDRMINDAD IDFYISFDNE KVGELQAKAL VDIVPQGNYF LMGGSPVDNN AKLFRAGQMK VLKPYVDSGK IKVVGDQWVD GWLPENALKI MENALTANNN KIDAVVASND ATAGGAIQAL SAQGLSGKVA ISGQDADLAG IKRIAAGTQT MTVYKPITLL ANTAAEIAVE LGNGQEPKAD TTLNNGLKDV PSRLLTPIDV NKNNIKDTVV KDGFHKESEL
|
| |