Gene EcSMS35_3889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3889 
SymbolxylF 
ID6145527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3957289 
End bp3958281 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content44% 
IMG OID641618715 
ProductD-xylose transporter subunit XylF 
Protein accessionYP_001745854 
Protein GI170682540 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID[TIGR02634] D-xylose ABC transporter, substrate-binding protein 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA AGAACATTCT ACTCACCCTT TGCACCTCAC TTCTGCTTAC CAACGTTGCG 
GCTCACGCCA AAGAAGTCAA AATAGGTATG GCGATTGATG ATCTCCGTCT TGAACGCTGG
CAAAAAGATC GAGATATTTT TGTGAAAAAG GCAGAATCTC TCGGCGCGAA AGTATTTGTA
CAGTCTGCAA ATGGCAATGA AGAAACACAA ATGTCGCAGA TTGAAAACAT GATTAACCGG
GGCGTCGATG TTCTTGTTAT TATTCCGTAT AACGGTCAGG TATTAAGTAA CGTTGTAAAA
GAAGCCAAAC AAGAAGGTAT TAAAGTATTA GCTTACGACC GTATGATTAA CGATGCGGAT
ATCGATTTTT ATATTTCTTT CGATAACGAA AAAGTCGGCG AACTGCAGGC AAAAGCCCTG
GTCGATATTG TTCCGCAAGG TAATTACTTC CTGATGGGCG GCTCGCCGGT AGATAACAAC
GCCAAGCTGT TCCGCGCCGG GCAAATGAAA GTATTAAAAC CTTATGTTGA TTCCGGAAAA
ATTAAAGTCG TTGGTGACCA ATGGGTTGAT GGCTGGTTAC CGGAAAACGC ATTGAAAATT
ATGGAAAACG CGCTAACCGC CAATAATAAC AAAATTGATG CTGTAGTTGC CTCAAACGAT
GCCACCGCAG GAGGAGCAAT TCAGGCATTA AGCGCGCAAG GTTTATCAGG GAAAGTAGCA
ATTTCCGGCC AGGATGCGGA TCTTGCAGGT ATTAAACGTA TTGCTGCCGG TACGCAAACT
ATGACGGTGT ATAAACCTAT TACATTGTTG GCAAATACTG CCGCAGAAAT TGCCGTTGAG
TTGGGCAATG GTCAGGAGCC AAAAGCGGAT ACCACACTGA ATAATGGCCT GAAAGATGTT
CCCTCCCGCC TGCTGACACC GATCGATGTG AATAAAAACA ACATCAAAGA TACGGTAGTT
AAAGACGGAT TCCACAAAGA GAGCGAGCTG TAA
 
Protein sequence
MKIKNILLTL CTSLLLTNVA AHAKEVKIGM AIDDLRLERW QKDRDIFVKK AESLGAKVFV 
QSANGNEETQ MSQIENMINR GVDVLVIIPY NGQVLSNVVK EAKQEGIKVL AYDRMINDAD
IDFYISFDNE KVGELQAKAL VDIVPQGNYF LMGGSPVDNN AKLFRAGQMK VLKPYVDSGK
IKVVGDQWVD GWLPENALKI MENALTANNN KIDAVVASND ATAGGAIQAL SAQGLSGKVA
ISGQDADLAG IKRIAAGTQT MTVYKPITLL ANTAAEIAVE LGNGQEPKAD TTLNNGLKDV
PSRLLTPIDV NKNNIKDTVV KDGFHKESEL