Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4945 |
Symbol | xylR |
ID | 6970086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4584254 |
End bp | 4585432 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643388628 |
Product | xylose operon regulatory protein |
Protein accession | YP_002273055 |
Protein GI | 209398178 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.462654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.028978 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTACTA AACGTCACCG CATCACATTA CTGTTCAATG CCAATAAAGC CTATGACCGG CAGGTAGTAG AAGGCGTAGG GGAATATTTA CAGGCGTCAC AATCGGAATG GGATATTTTC ATTGAAGAAG ATTTCCGCGC CCGCATTGAT AAAATCAAGG ACTGGTTAGG AGATGGCGTC ATTGCCGACT TCGACGACAA ACAGATCGAG CAAGCGCTGG CTGATGTCGA CGTCCCCATT GTTGGGGTTG GCGGCTCGTA TCACCTTGCA GAAAGTTACC CACCCGTTCA TTACATTGCC ACCGATAACT ATGCGCTGGT TGAAAGCGCA TTTTTGCATT TAAAAGAGAA AGGCGTTAAC CGCTTTGCTT TTTATGGTCT TCCGGAATCA AGCGGCAAAC GTTGGGCCAC TGAACGCGAA TATGCATTTC GTCAGCTTGT CGCCGAAGAA AAGTATCGCG GAGTGGTTTA TCAGGGGTTA GAAACCGCAC CAGAGAACTG GCAACACGCG CAAAATCGGC TGGCAGACTG GCTACAAACA CTGCCACCGC AAACCGGGAT TATTGCCGTT ACTGACGCCC GGGCGCGGCA TATTCTGCAA GTATGTGAAC ATCTACACAT TCCCGTACCG GAAAAATTAT GCGTGATTGG CATCGATAAC GAAGAACTGA CCCGCTATCT GTCGCGTGTC GCCCTTTCTT CGGTCGCTCA GGGCGCGCGG CAAATGGGCT ATCAGGCGGC AAAACTGTTG CATCGATTAT TAGATAAAGA AGAAATGCCG CTACAGCGGA TTTTGGTCCC TCCAGTTCGC GTCATTGAAC GGCGCTCAAC AGATTACCGT TCGCTGACCG ATCCCGCCGT TATTCAGGCC ATGCATTACA TTCGTAATCA CGCCTGTAAA GGGATTAAAG TGGATCAGGT ACTCGATGCG GTCGGGATCT CGCGCTCCAA TCTTGAGAAG CGTTTTAAAG AAGAGGTGGG TGAAACCATC CATGCCATGA TTCATGCTGA GAAGCTGGAG AAAGCGCGCA GTCTGCTGAT TTCAACCACC TTGTCGATCA ATGAGATATC GCAAATGTGC GGTTATCCAT CGCTGCAATA TTTCTACTCT GTTTTTAAAA AAGCATATGA CACGACGCCA AAAGAGTATC GCGATGTAAA TAGCGAGGTC ATGTTGTAG
|
Protein sequence | MFTKRHRITL LFNANKAYDR QVVEGVGEYL QASQSEWDIF IEEDFRARID KIKDWLGDGV IADFDDKQIE QALADVDVPI VGVGGSYHLA ESYPPVHYIA TDNYALVESA FLHLKEKGVN RFAFYGLPES SGKRWATERE YAFRQLVAEE KYRGVVYQGL ETAPENWQHA QNRLADWLQT LPPQTGIIAV TDARARHILQ VCEHLHIPVP EKLCVIGIDN EELTRYLSRV ALSSVAQGAR QMGYQAAKLL HRLLDKEEMP LQRILVPPVR VIERRSTDYR SLTDPAVIQA MHYIRNHACK GIKVDQVLDA VGISRSNLEK RFKEEVGETI HAMIHAEKLE KARSLLISTT LSINEISQMC GYPSLQYFYS VFKKAYDTTP KEYRDVNSEV ML
|
| |