Gene EcHS_A1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1438 
SymboltyrR 
ID5591070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1434171 
End bp1435712 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content52% 
IMG OID640920593 
ProductDNA-binding transcriptional regulator TyrR 
Protein accessionYP_001458152 
Protein GI157160834 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG3283] Transcriptional regulator of aromatic amino acids metabolism 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCTGG AAGTCTTTTG TGAAGACCGA CTCGGTCTGA CCCGCGAATT ACTCGATCTA 
CTCGTGCTAA GAGGCATTGA TTTACGCGGT ATTGAGATTG ATCCCATTGG GCGAATCTAC
CTCAATTTTG CTGAACTGGA GTTTGAGAGT TTCAGCAGTC TGATGGCCGA AATACGCCGT
ATTGCGGGTG TTACCGATGT GCGTACTGTC CCGTGGATGC CTTCCGAACG TGAGCATCTG
GCGTTGAGCG CGTTACTGGA GGCGTTGCCT GAACCTGTGC TCTCTGTCGA TATGAAAAGC
AAAGTGGATA TGGCGAACCC GGCGAGCTGT CAGCTTTTTG GGCAAAAATT GGATCGCCTG
CGCAACCATA CCGCCGCACA ATTGATTAAC GGCTTTAATT TTTTACGTTG GCTGGAAAGC
GAACCGCAAG ATTCGCATAA CGAGCATGTC GTTATTAATG GGCAGAATTT CCTGATGGAG
ATTACGCCTG TTTATCTTCA GGATGAAAAT GATCAACACG TCCTGACCGG TGCGGTGGTG
ATGTTGCGAT CAACGATTCG TATGGGCCGC CAGTTGCAAA ATGTCGCCGC CCAGGACGTC
AGCGCCTTCA GTCAAATTGT CGCCGTCAGC CCGAAAATGA AGCATGTTGT CGAACAGGCG
CAGAAACTGG CGATGCTAAG CGCGCCGCTG CTGATTACGG GTGACACAGG TACAGGTAAA
GATCTCTTTG CCTACGCCTG CCATCAGGCA AGCCCCAGAG CGGGCAAACC TTACCTGGCG
CTGAACTGTG CGTCTATACC GGAAGATGCG GTCGAGAGTG AACTGTTTGG TCATGCTCCG
GAAGGGAAGA AAGGATTCTT TGAGCAGGCG AACGGTGGTT CGGTGCTGTT GGATGAAATA
GGGGAAATGT CACCACGGAT GCAGGCGAAA TTACTGCGTT TCCTTAATGA TGGCACTTTC
CGTCGGGTTG GCGAAGACCA TGAGGTGCAT GTCGATGTGC GGGTGATTTG CGCTACGCAG
AAGAATCTGG TCGAACTGGT GCAAAAAGGC ATGTTCCGTG AAGATCTCTA TTATCGTCTG
AACGTGTTGA CGCTCAATCT GCCGCCGCTA CGTGACTGTC CGCAGGACAT CATGCCGTTA
ACTGAGCTGT TCGTCGCCCG CTTTGCCGAC GAGCAGGGCG TGCCGCGTCC GAAACTGGCC
GCTGACCTGA ATACTGTACT TACGCGTTAT GCGTGGCCGG GAAATGTGCG GCAGTTAAAG
AACGCTATCT ATCGCGCACT GACACAACTG GACGGTTATG AGCTGCGTCC ACAGGATATT
TTGTTGCCGG ATTATGACGC CGCAACGGTA GCCGTGGGCG AAGATGCGAT GGAAGGTTCG
CTGGACGAAA TCACCAGCCG TTTTGAACGC TCGGTATTAA CCCAGCTTTA TCGCAATTAT
CCCAGCACGC GCAAACTGGC AAAACGTCTC GGCGTTTCAC ATACCGCGAT TGCCAATAAG
TTGCGGGAAT ATGGTCTGAG TCAGAAGAAG AACGAAGAGT AA
 
Protein sequence
MRLEVFCEDR LGLTRELLDL LVLRGIDLRG IEIDPIGRIY LNFAELEFES FSSLMAEIRR 
IAGVTDVRTV PWMPSEREHL ALSALLEALP EPVLSVDMKS KVDMANPASC QLFGQKLDRL
RNHTAAQLIN GFNFLRWLES EPQDSHNEHV VINGQNFLME ITPVYLQDEN DQHVLTGAVV
MLRSTIRMGR QLQNVAAQDV SAFSQIVAVS PKMKHVVEQA QKLAMLSAPL LITGDTGTGK
DLFAYACHQA SPRAGKPYLA LNCASIPEDA VESELFGHAP EGKKGFFEQA NGGSVLLDEI
GEMSPRMQAK LLRFLNDGTF RRVGEDHEVH VDVRVICATQ KNLVELVQKG MFREDLYYRL
NVLTLNLPPL RDCPQDIMPL TELFVARFAD EQGVPRPKLA ADLNTVLTRY AWPGNVRQLK
NAIYRALTQL DGYELRPQDI LLPDYDAATV AVGEDAMEGS LDEITSRFER SVLTQLYRNY
PSTRKLAKRL GVSHTAIANK LREYGLSQKK NEE