Gene EcSMS35_1799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1799 
SymboltyrR 
ID6146845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1817659 
End bp1819200 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content52% 
IMG OID641616675 
ProductDNA-binding transcriptional regulator TyrR 
Protein accessionYP_001743853 
Protein GI170681764 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG3283] Transcriptional regulator of aromatic amino acids metabolism 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.322981 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGG AAGTCTTTTG TGAAGACCGA CTCGGTCTGA CCCGCGAATT ACTCGATCTA 
CTCGTGCTAA GAGGCATTGA TTTACGCGGT ATTGAGATTG ATCCCATTGG GCGAATCTAC
CTCAATTTTG CTGAACTGGA GTTTGAGAGT TTCAGCAGTC TGATGGCCGA AATACGCCGT
ATTGCAGGTG TTACCGATGT GCGTACTGTC CCGTGGATGC CTTCCGAACG TGAGCATCTG
GCGTTGAGCG CGTTACTGGA GGCGTTGCCT GAACCTGTGC TCTCTGTCGA TATGAAAAGC
AAAGTGGATA TGGCGAACCC GGCGAGCTGT CAGCTTTTTG GGCAAAAATT GGATCGCCTG
CGCAACCATA CCGCCGCGCA ATTGATTAAC GGCTTTAATT TTTTACGCTG GCTGGAAAGC
GAACCGCAAG ATTCGCATAA CGAGCATGTC GTTATTAATG GGCAGAATTT CCTGATGGAG
ATTACGCCTG TTTATCTTCA GGATGAAAAT GATCAACACG TCCTGACCGG TGCGGTGGTG
ATGTTGCGAT CAACGATTCG TATGGGCCGT CAGTTGCAAA ATGTCGCTGC CCAGGACGTC
AGCGCCTTCA GTCAAATTGT CGCTGTCAGC CCGAAAATGA AGCATGTTGT CGAACAGGCG
CAGAAACTGG CGATGCTAAG CGCGCCACTG CTGATTACGG GTGATACCGG CACCGGAAAA
GATCTCTTTG CCTACGCCTG TCATCAGGCA AGCCCCAGAG CGAGCAAACC TTACCTGGCG
CTGAACTGTG CGTCTATACC GGAAGATGCG GTCGAGAGTG AACTGTTTGG TCATGCTCCG
GAAGGGAAGA AAGGGTTCTT TGAGCAGGCG AACGGTGGTT CGGTGCTGTT GGATGAAATA
GGGGAAATGT CACCACGGAT GCAGGCGAAA TTACTGCGTT TCCTTAATGA TGGCACCTTC
CGTCGGGTTG GCGAAGACCA TGAGGTGCAT GTCGATGTGC GGGTGATTTG TGCTACGCAG
AAGAATCTGG TCGAACTGGT GCAAAAAGGC GTGTTCCGTG AAGATCTCTA TTATCGTCTG
AACGTGTTGA CGCTCAATCT GCCGCCGCTA CGTGACTGTC CGCAGGACAT CATGCCGTTA
ACCGAGCTGT TCGTCGCCCG CTTTGCCGAC GAGCAGGGCG TGCCGCGTCC GAAACTGGCC
GCTGATCTGA ATACTGTACT TACGCGTTAT GCGTGGCCGG GAAATGTGCG GCAGTTAAAG
AACGCTATCT ATCGTGCACT GACACAACTG GACGGTTATG AGCTGCGTCC ACAGGATATT
TTGTTGCCGG ATTATGACGC CGCAACGGTA GCCGTGGGCG AAGATGCGAT GGAAGGTTCG
CTGGACGAAA TCACCAGCCG TTTTGAACGC TCGGTATTAA CCCAGCTTTA TCGCAATTAT
CCCAGCACGC GCAAACTGGC AAAACGTCTC GGCGTTTCAC ATACCGCGAT TGCCAATAAG
TTGCGGGAAT ATGGTCTGAG TCAGAAGAAG AACGAAGAGT AA
 
Protein sequence
MRLEVFCEDR LGLTRELLDL LVLRGIDLRG IEIDPIGRIY LNFAELEFES FSSLMAEIRR 
IAGVTDVRTV PWMPSEREHL ALSALLEALP EPVLSVDMKS KVDMANPASC QLFGQKLDRL
RNHTAAQLIN GFNFLRWLES EPQDSHNEHV VINGQNFLME ITPVYLQDEN DQHVLTGAVV
MLRSTIRMGR QLQNVAAQDV SAFSQIVAVS PKMKHVVEQA QKLAMLSAPL LITGDTGTGK
DLFAYACHQA SPRASKPYLA LNCASIPEDA VESELFGHAP EGKKGFFEQA NGGSVLLDEI
GEMSPRMQAK LLRFLNDGTF RRVGEDHEVH VDVRVICATQ KNLVELVQKG VFREDLYYRL
NVLTLNLPPL RDCPQDIMPL TELFVARFAD EQGVPRPKLA ADLNTVLTRY AWPGNVRQLK
NAIYRALTQL DGYELRPQDI LLPDYDAATV AVGEDAMEGS LDEITSRFER SVLTQLYRNY
PSTRKLAKRL GVSHTAIANK LREYGLSQKK NEE