Gene ECH74115_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1968 
SymboltyrR 
ID6968942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1861549 
End bp1863090 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content52% 
IMG OID643385894 
ProductDNA-binding transcriptional regulator TyrR 
Protein accessionYP_002270383 
Protein GI209397678 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG3283] Transcriptional regulator of aromatic amino acids metabolism 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.337458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGG AAGTCTTTTG TGAAGACCGA CTCGGTCTGA CCCGCGAATT ACTCGATCTA 
CTCGTGCTAA GAGGCATTGA TTTACGCGGT ATTGAGATTG ATCCCATTGG GCGAATCTAC
CTCAATTTTG CTGAACTGGA GTTTGAGAGT TTCAGCAGTC TGATGGCCGA AATACGCCGT
ATTGCAGGTG TTACCGATGT GCGTACTGTC CCGTGGATGC CTTCCGAACG TGAGCATCTG
GCGTTGAGCG CGTTACTGGA GGCGTTGCCT GAACCTGTGC TCTCTGTCGA TATGAAAAGC
AAAGTGGATA TGGCGAACCC GGCGAGCTGT CAGCTTTTTG GGCAAAAATT GGATCGACTG
CGCAACCATA CCGCCGCACA ATTGATTAAC GGCTTTAATT TTTTACGTTG GCTGGAAAGC
GAACCGCAAG ATTCCCATAA CGAGCATGTT GTTATTAATG GACAGAACTT CCTGATGGAG
ATTACGCCTG TTTATCTTCA GGATGAAAAT GATCAACACG TCCTGACAGG TGCGGTGGTG
ATGTTGCGAT CAACGATTCG TATGGGCCGT CAGTTGCAAA ATGTCGCCGC CCAGGACGTC
AGCGCCTTCA GTCAAATTGT CGCTGTCAGC CCGAAAATGA AGCATGTTGT CGAACAGGCG
CAGAAACTGG CGATGCTAAG CGCGCCGCTG CTGATTACGG GTGACACAGG TACAGGTAAA
GATCTTTTTG CCTACGCCTG CCATCAGGCA AGCCCAAGAG CGAGCAAACC TTACCTGGCG
CTGAACTGTG CGTCTATACC GGAAGATGCG GTCGAGAGTG AGCTGTTTGG TCATGCTCCG
GAAGGGAAGA AAGGATTCTT TGAGCAGGCG AACGGTGGTT CGGTGCTGTT GGATGAAATA
GGTGAGATGT CACCACGGAT GCAGGCGAAA TTACTGCGTT TCCTTAATGA TGGCACTTTC
CGTCGGGTTG GCGAAGACCA TGAGGTGCAT GTCGATGTGC GGGTGATTTG CGCTACGCAG
AAGAATCTGG TCGAACTGGT GCAAAAAGGC ATGTTCCGTG AAGATCTCTA TTATCGTCTG
AACGTGTTGA CGCTCAACCT GCCGCCGCTA CGTGACTGTC CGCAGGACAT CATGCCGTTA
ACTGAGCTGT TCGTCGCCCG CTTTGCCGAC GAGCAGGGCG TGCCGCGTCC GAAACTGGCC
GCTGACCTGA ATACTGTACT TACGCGTTAT GCGTGGCCGG GAAATGTGCG GCAGTTAAAG
AACGCTATCT ATCGCGCACT GACACAACTG GACGGTTATG AGCTGCGTCC ACAGGATATT
TTGTTGCCGG ATTATGACGC CGCAACGGTA GCCGTGGGCG AAGATGCGAT GGAAGGTTCG
CTGGACGAAA TCACCAGCCG TTTTGAACGC TCGGTATTAA CCCAGCTTTA TCGCAATTAT
CCCAGCACGC GCAAACTGGC AAAACGTCTC GGCGTTTCAC ATACCGCGAT TGCCAATAAG
TTGCGGGAAT ATGGCTTAAG CCAGAAGAAG AACGAAGAGT AA
 
Protein sequence
MRLEVFCEDR LGLTRELLDL LVLRGIDLRG IEIDPIGRIY LNFAELEFES FSSLMAEIRR 
IAGVTDVRTV PWMPSEREHL ALSALLEALP EPVLSVDMKS KVDMANPASC QLFGQKLDRL
RNHTAAQLIN GFNFLRWLES EPQDSHNEHV VINGQNFLME ITPVYLQDEN DQHVLTGAVV
MLRSTIRMGR QLQNVAAQDV SAFSQIVAVS PKMKHVVEQA QKLAMLSAPL LITGDTGTGK
DLFAYACHQA SPRASKPYLA LNCASIPEDA VESELFGHAP EGKKGFFEQA NGGSVLLDEI
GEMSPRMQAK LLRFLNDGTF RRVGEDHEVH VDVRVICATQ KNLVELVQKG MFREDLYYRL
NVLTLNLPPL RDCPQDIMPL TELFVARFAD EQGVPRPKLA ADLNTVLTRY AWPGNVRQLK
NAIYRALTQL DGYELRPQDI LLPDYDAATV AVGEDAMEGS LDEITSRFER SVLTQLYRNY
PSTRKLAKRL GVSHTAIANK LREYGLSQKK NEE