Gene ECH74115_4942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4942 
SymbolxylF 
ID6971899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4580406 
End bp4581398 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content44% 
IMG OID643388625 
ProductD-xylose transporter subunit XylF 
Protein accessionYP_002273052 
Protein GI209400101 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID[TIGR02634] D-xylose ABC transporter, substrate-binding protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0258127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA AGAACATTCT ACTCACCCTT TGCACCTCAC TCCTGCTTAC CAACGTTGCT 
GCACACGCCA AAGAAGTCAA AATAGGTATG GCGATTGATG ATCTCCGTCT TGAACGCTGG
CAAAAAGATC GAGATATCTT TGTGAAAAAG GCAGAATCTC TCGGCGCGAA AGTATTTGTA
CAGTCTGCAA ATGGCAATGA AGAAACACAA ATGTCGCAGA TTGAAAACAT GATAAACCGG
GGTGTCGATG TTCTTGTCAT TATTCCGTAT AACGGTCAGG TATTAAGTAA CGTTGTAAAA
GAAGCCAAAC AAGAAGGCAT TAAAGTATTA GCTTACGACC GTATGATTAA CGATGCGGAT
ATCGATTTTT ATATTTCTTT CGATAACGAA AAAGTCGGTG AACTGCAGGC AAAAGCCCTG
GTCGATATTG TTCCGCAAGG TAATTACTTC CTGATGGGCG GCTCGCCGGT AGATAACAAC
GCTAAGCTGT TCCGCGCCGG ACAAATGAAA GTGTTACAAC CTTACGTTGA TTCCGGAAAA
ATTAAAGTCG TTGGTGACCA ATGGGTTGAT GGCTGGTTAC CGGAAAACGC ATTGAAAATT
ATGGAAAACG CGCTAACCGC CAATAATAAC AAAATTGATG CTGTAGTTGC CTCAAACGAT
GCCACCGCAG GTGGGGCAAT TCAGGCATTA AGCGCGCAAG GTTTATCAGG GAAAGTAGCA
ATTTCTGGCC AGGATGCGGA TCTCGCAGGT ATTAAACGTA TTGCTGCCGG TACGCAAACT
ATGACGGTAT ATAAACCTAT TACATTGCTG GCAAATACTG CCGCAGAAAT TGCCGTTGAA
TTGGGCAATG GTCAGGAGCC AAAAGCAGAT ACTTCACTAA ATAATGGCCT GAAAGATGTC
CCCTCCCGCC TCCTGACACC GATCGATGTG AATAAAAACA ACATCAAAGA TACGGTAATT
AAAGACGGAT TCCACAAAGA GAGCGAGCTG TAA
 
Protein sequence
MKIKNILLTL CTSLLLTNVA AHAKEVKIGM AIDDLRLERW QKDRDIFVKK AESLGAKVFV 
QSANGNEETQ MSQIENMINR GVDVLVIIPY NGQVLSNVVK EAKQEGIKVL AYDRMINDAD
IDFYISFDNE KVGELQAKAL VDIVPQGNYF LMGGSPVDNN AKLFRAGQMK VLQPYVDSGK
IKVVGDQWVD GWLPENALKI MENALTANNN KIDAVVASND ATAGGAIQAL SAQGLSGKVA
ISGQDADLAG IKRIAAGTQT MTVYKPITLL ANTAAEIAVE LGNGQEPKAD TSLNNGLKDV
PSRLLTPIDV NKNNIKDTVI KDGFHKESEL