Gene ECH74115_4945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4945 
SymbolxylR 
ID6970086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4584254 
End bp4585432 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content49% 
IMG OID643388628 
Productxylose operon regulatory protein 
Protein accessionYP_002273055 
Protein GI209398178 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators
[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.462654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.028978 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACTA AACGTCACCG CATCACATTA CTGTTCAATG CCAATAAAGC CTATGACCGG 
CAGGTAGTAG AAGGCGTAGG GGAATATTTA CAGGCGTCAC AATCGGAATG GGATATTTTC
ATTGAAGAAG ATTTCCGCGC CCGCATTGAT AAAATCAAGG ACTGGTTAGG AGATGGCGTC
ATTGCCGACT TCGACGACAA ACAGATCGAG CAAGCGCTGG CTGATGTCGA CGTCCCCATT
GTTGGGGTTG GCGGCTCGTA TCACCTTGCA GAAAGTTACC CACCCGTTCA TTACATTGCC
ACCGATAACT ATGCGCTGGT TGAAAGCGCA TTTTTGCATT TAAAAGAGAA AGGCGTTAAC
CGCTTTGCTT TTTATGGTCT TCCGGAATCA AGCGGCAAAC GTTGGGCCAC TGAACGCGAA
TATGCATTTC GTCAGCTTGT CGCCGAAGAA AAGTATCGCG GAGTGGTTTA TCAGGGGTTA
GAAACCGCAC CAGAGAACTG GCAACACGCG CAAAATCGGC TGGCAGACTG GCTACAAACA
CTGCCACCGC AAACCGGGAT TATTGCCGTT ACTGACGCCC GGGCGCGGCA TATTCTGCAA
GTATGTGAAC ATCTACACAT TCCCGTACCG GAAAAATTAT GCGTGATTGG CATCGATAAC
GAAGAACTGA CCCGCTATCT GTCGCGTGTC GCCCTTTCTT CGGTCGCTCA GGGCGCGCGG
CAAATGGGCT ATCAGGCGGC AAAACTGTTG CATCGATTAT TAGATAAAGA AGAAATGCCG
CTACAGCGGA TTTTGGTCCC TCCAGTTCGC GTCATTGAAC GGCGCTCAAC AGATTACCGT
TCGCTGACCG ATCCCGCCGT TATTCAGGCC ATGCATTACA TTCGTAATCA CGCCTGTAAA
GGGATTAAAG TGGATCAGGT ACTCGATGCG GTCGGGATCT CGCGCTCCAA TCTTGAGAAG
CGTTTTAAAG AAGAGGTGGG TGAAACCATC CATGCCATGA TTCATGCTGA GAAGCTGGAG
AAAGCGCGCA GTCTGCTGAT TTCAACCACC TTGTCGATCA ATGAGATATC GCAAATGTGC
GGTTATCCAT CGCTGCAATA TTTCTACTCT GTTTTTAAAA AAGCATATGA CACGACGCCA
AAAGAGTATC GCGATGTAAA TAGCGAGGTC ATGTTGTAG
 
Protein sequence
MFTKRHRITL LFNANKAYDR QVVEGVGEYL QASQSEWDIF IEEDFRARID KIKDWLGDGV 
IADFDDKQIE QALADVDVPI VGVGGSYHLA ESYPPVHYIA TDNYALVESA FLHLKEKGVN
RFAFYGLPES SGKRWATERE YAFRQLVAEE KYRGVVYQGL ETAPENWQHA QNRLADWLQT
LPPQTGIIAV TDARARHILQ VCEHLHIPVP EKLCVIGIDN EELTRYLSRV ALSSVAQGAR
QMGYQAAKLL HRLLDKEEMP LQRILVPPVR VIERRSTDYR SLTDPAVIQA MHYIRNHACK
GIKVDQVLDA VGISRSNLEK RFKEEVGETI HAMIHAEKLE KARSLLISTT LSINEISQMC
GYPSLQYFYS VFKKAYDTTP KEYRDVNSEV ML