Gene EcSMS35_3892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3892 
SymbolxylR 
ID6142912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3961137 
End bp3962315 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content48% 
IMG OID641618718 
Productxylose operon regulatory protein 
Protein accessionYP_001745857 
Protein GI170682140 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators
[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACTA AACGTCACCG CATCACATTA CTGTTCAATG CCAATAAAGC CTATGACCGG 
CAGGTAGTAG AAGGCGTAGG GGAATATTTA CAGGCGTCAC AATCGGAATG GGATATTTTC
ATTGAAGAAG ATTTCCGCGC CCGCATTGAT AAAATCAAGG ACTGGTTAGG AGATGGCGTC
ATTGCCGACT TCGACGACAA ACAAATCGAG CAAGCGCTGG CTGATGTCGA CGTCCCCATT
GTTGGGGTTG GTGGTTCGTA TCACCTTGCC GAAAGTTACC CACCCGTTCA TTACATTGCC
ACCGATAACT ATGCGCTGGT TGAAAGCGCA TTTTTGCATT TAAAAGAGAA AGGCGTTAAC
CGCTTTGCTT TTTATGGTCT TCCGGAATCA AGCGGCAAAC GTTGGGCCAC TGAACGCGAA
TATGCATTTC GTCAGCTTGT CGCCGAAGAA AAGTATCGCG GAGTGGTTTA TCAGGGGTTA
GAAACCGCGC CAGAGAACTG GCAACACGCG CAAAATCGGC TGGCAGACTG GCTACAAACG
CTGCCACCGC AAACCGGGAT TATTGCCGTT ACTGACGCCC GGGCACGGCA TATTCTGCAA
GTATGTGAAC ATCTACACAT TCCCGTACCG GAAAAATTAT GCGTGATTGG CATCGATAAC
GAAGAACTGA CCCGCTATCT GTCGCGTGTC GCCCTTTCTT CGGTCGCTCA GGGCGCACGG
CAAATGGGCT ATCAGGCGGC AAAACTGTTG CATCGATTAT TAGATAAAGA AGAAATGCCG
CTACAGCGGA TTTTGGTCCC ACCAGTTCGC GTCATTGAAC GGCGCTCAAC AGATTACCGC
TCGCTGACCG ATCCCGCCGT TATTCAGGCC ATGCATTACA TTCGTAATCA CGCCTGTAAA
GGGATTAAAG TGGATCAGGT ACTGGATGCG GTGGGCATAT CACGCTCCAA TCTTGAGAAG
CGTTTTAAAG AAGAGGTGGG TGAAACCATC CATGCAATGA TTCATGCCGA GAAGTTGGAG
AAAGCGCGCA GTCTGCTGAT TTCAACCACC TTGTCGATCA ATGAGATATC GCAAATGTGC
GGTTATCCAT CGCTGCAATA TTTCTACTCT GTTTTTAAAA AAGCATATGA CACGACGCCA
AAAGAGTATC GCGATGTAAA TAGCGAGGTC ATGTTGTAG
 
Protein sequence
MFTKRHRITL LFNANKAYDR QVVEGVGEYL QASQSEWDIF IEEDFRARID KIKDWLGDGV 
IADFDDKQIE QALADVDVPI VGVGGSYHLA ESYPPVHYIA TDNYALVESA FLHLKEKGVN
RFAFYGLPES SGKRWATERE YAFRQLVAEE KYRGVVYQGL ETAPENWQHA QNRLADWLQT
LPPQTGIIAV TDARARHILQ VCEHLHIPVP EKLCVIGIDN EELTRYLSRV ALSSVAQGAR
QMGYQAAKLL HRLLDKEEMP LQRILVPPVR VIERRSTDYR SLTDPAVIQA MHYIRNHACK
GIKVDQVLDA VGISRSNLEK RFKEEVGETI HAMIHAEKLE KARSLLISTT LSINEISQMC
GYPSLQYFYS VFKKAYDTTP KEYRDVNSEV ML