Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3892 |
Symbol | xylR |
ID | 6142912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3961137 |
End bp | 3962315 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641618718 |
Product | xylose operon regulatory protein |
Protein accession | YP_001745857 |
Protein GI | 170682140 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTACTA AACGTCACCG CATCACATTA CTGTTCAATG CCAATAAAGC CTATGACCGG CAGGTAGTAG AAGGCGTAGG GGAATATTTA CAGGCGTCAC AATCGGAATG GGATATTTTC ATTGAAGAAG ATTTCCGCGC CCGCATTGAT AAAATCAAGG ACTGGTTAGG AGATGGCGTC ATTGCCGACT TCGACGACAA ACAAATCGAG CAAGCGCTGG CTGATGTCGA CGTCCCCATT GTTGGGGTTG GTGGTTCGTA TCACCTTGCC GAAAGTTACC CACCCGTTCA TTACATTGCC ACCGATAACT ATGCGCTGGT TGAAAGCGCA TTTTTGCATT TAAAAGAGAA AGGCGTTAAC CGCTTTGCTT TTTATGGTCT TCCGGAATCA AGCGGCAAAC GTTGGGCCAC TGAACGCGAA TATGCATTTC GTCAGCTTGT CGCCGAAGAA AAGTATCGCG GAGTGGTTTA TCAGGGGTTA GAAACCGCGC CAGAGAACTG GCAACACGCG CAAAATCGGC TGGCAGACTG GCTACAAACG CTGCCACCGC AAACCGGGAT TATTGCCGTT ACTGACGCCC GGGCACGGCA TATTCTGCAA GTATGTGAAC ATCTACACAT TCCCGTACCG GAAAAATTAT GCGTGATTGG CATCGATAAC GAAGAACTGA CCCGCTATCT GTCGCGTGTC GCCCTTTCTT CGGTCGCTCA GGGCGCACGG CAAATGGGCT ATCAGGCGGC AAAACTGTTG CATCGATTAT TAGATAAAGA AGAAATGCCG CTACAGCGGA TTTTGGTCCC ACCAGTTCGC GTCATTGAAC GGCGCTCAAC AGATTACCGC TCGCTGACCG ATCCCGCCGT TATTCAGGCC ATGCATTACA TTCGTAATCA CGCCTGTAAA GGGATTAAAG TGGATCAGGT ACTGGATGCG GTGGGCATAT CACGCTCCAA TCTTGAGAAG CGTTTTAAAG AAGAGGTGGG TGAAACCATC CATGCAATGA TTCATGCCGA GAAGTTGGAG AAAGCGCGCA GTCTGCTGAT TTCAACCACC TTGTCGATCA ATGAGATATC GCAAATGTGC GGTTATCCAT CGCTGCAATA TTTCTACTCT GTTTTTAAAA AAGCATATGA CACGACGCCA AAAGAGTATC GCGATGTAAA TAGCGAGGTC ATGTTGTAG
|
Protein sequence | MFTKRHRITL LFNANKAYDR QVVEGVGEYL QASQSEWDIF IEEDFRARID KIKDWLGDGV IADFDDKQIE QALADVDVPI VGVGGSYHLA ESYPPVHYIA TDNYALVESA FLHLKEKGVN RFAFYGLPES SGKRWATERE YAFRQLVAEE KYRGVVYQGL ETAPENWQHA QNRLADWLQT LPPQTGIIAV TDARARHILQ VCEHLHIPVP EKLCVIGIDN EELTRYLSRV ALSSVAQGAR QMGYQAAKLL HRLLDKEEMP LQRILVPPVR VIERRSTDYR SLTDPAVIQA MHYIRNHACK GIKVDQVLDA VGISRSNLEK RFKEEVGETI HAMIHAEKLE KARSLLISTT LSINEISQMC GYPSLQYFYS VFKKAYDTTP KEYRDVNSEV ML
|
| |