Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3772 |
Symbol | xylR |
ID | 5593642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3764282 |
End bp | 3765460 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640922886 |
Product | xylose operon regulatory protein |
Protein accession | YP_001460364 |
Protein GI | 157163046 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 64 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTACTA AACGTCACCG CATCACATTA CTGTTCAATG CCAATAAAGC CTATGACCGG CAGGTAGTAG AAGGCGTAGG GGAATATTTA CAGGCGTCAC AATCGGAATG GGATATTTTC ATTGAAGAAG ATTTCCGCGC CCGCATTGAT AAAATCAAGG ACTGGTTAGG AGATGGCGTC ATTGCCGACT TCGACGACAA ACAGATCGAG CAAGCGCTGG CTGATGTCGA CGTCCCCATT GTTGGGGTTG GCGGCTCGTA TCACCTTGCA GAAAGTTACC CACCCGTTCA TTACATTGCC ACCGATAACT ATGCGCTGGT TGAAAGCGCA TTTTTGCATT TAAAAGAGAA AGGCGTTAAC CGCTTTGCTT TTTATGGTCT TCCGGAATCA AGCGGCAAAC GTTGGGCCAC TGAGCGCGAA TATGCATTTC GTCAGCTTGT CGCCGAAGAA AAGTATCGCG GAGTGGTTTA TCAGGGGTTA GAAACCGCGC CAGAGAACTG GCAACACGCG CAAAATCGGC TGGCAGACTG GCTACAAACG CTACCACCGC AAACCGGGAT TATTGCCGTT ACTGACGCCC GAGCGCGGCA TATTCTGCAA GTATGTGAAC ATCTACATAT TCCCGTACCG GAAAAATTAT GCGTGATTGG CATCGATAAC GAAGAACTGA CCCGCTATCT GTCGCGTGTC GCCCTTTCTT CGGTCGCTCA GGGCGCGCGG CAAATGGGCT ATCAGGCGGC AAAACTGTTG CATCGATTAT TAGATAAAGA AGAAATGCCG CTACAGCGAA TTTTGGTCCC ACCAGTTCGC GTCATTGAAC GGCGCTCAAC AGATTATCGC TCGCTGACCG ATCCCGCCGT TATTCAGGCC ATGCATTACA TTCGTAATCA CGCCTGTAAA GGGATTAAAG TGGATCAGGT ACTGGATGCG GTCGGGATCT CGCGCTCCAA TCTTGAGAAG CGTTTTAAAG AAGAGGTGGG TGAAACCATC CATGCCATGA TTCATGCCGA GAAGCTGGAG AAAGCGCGCA GTCTGCTGAT TTCAACCACC TTGTCGATCA ATGAGATATC GCAAATGTGC GGTTATCCAT CGCTGCAATA TTTCTACTCT GTTTTTAAAA AAGCATATGA CACGACGCCA AAAGAGTATC GCGATGTAAA TAGCGAGGTC ATGTTGTAG
|
Protein sequence | MFTKRHRITL LFNANKAYDR QVVEGVGEYL QASQSEWDIF IEEDFRARID KIKDWLGDGV IADFDDKQIE QALADVDVPI VGVGGSYHLA ESYPPVHYIA TDNYALVESA FLHLKEKGVN RFAFYGLPES SGKRWATERE YAFRQLVAEE KYRGVVYQGL ETAPENWQHA QNRLADWLQT LPPQTGIIAV TDARARHILQ VCEHLHIPVP EKLCVIGIDN EELTRYLSRV ALSSVAQGAR QMGYQAAKLL HRLLDKEEMP LQRILVPPVR VIERRSTDYR SLTDPAVIQA MHYIRNHACK GIKVDQVLDA VGISRSNLEK RFKEEVGETI HAMIHAEKLE KARSLLISTT LSINEISQMC GYPSLQYFYS VFKKAYDTTP KEYRDVNSEV ML
|
| |