Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2302 |
Symbol | |
ID | 6066954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2538369 |
End bp | 2539910 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641601705 |
Product | DNA-binding transcriptional regulator TyrR |
Protein accession | YP_001725264 |
Protein GI | 170020310 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG3283] Transcriptional regulator of aromatic amino acids metabolism |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0336241 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCTGG AAGTCTTTTG TGAAGACCGG CTCGGTCTGA CCCGCGAATT ACTCGATCTA CTCGTGCTAA GAGGCATTGA TTTACGCGGT ATTGAGATTG ATCCCATTGG GCGAATCTAC CTCAATTTTG CTGAACTGGA GTTTGAGAGT TTCAGCAGTC TGATGGCCGA AATACGCCGT ATTGCGGGTG TTACCGATGT GCGTACTGTC CCGTGGATGC CTTCCGAACG TGAGCATCTG GCGTTGAGCG CGTTACTGGA GGCGTTGCCT GAACCTGTGC TCTCTGTCGA TATGAAAAGC AAAGTGGATA TGGCGAACCC GGCGAGCTGT CAGCTTTTTG GGCAAAAATT GGATCGCCTG CGCAACCATA CCGCCGCACA ATTGATTAAC GGCTTTAATT TTTTACGTTG GCTGGAAAGC GAACCGCAAG ATTCGCATAA CGAGCATGTC GTTATTAATG GGCAGAATTT CCTGATGGAG ATTACGCCTG TTTATCTTCA GGATGAAAAT GATCAACACG TCCTGACCGG TGCGGTGGTG ATGTTGCGAT CAACGATTCG TATGGGCCGC CAGTTGCAAA ATGTCGCCGC CCAGGACGTC AGCGCCTTCA GTCAAATTGT CGCCGTCAGC CCGAAAATGA AGCATGTTGT CGAACAGGCG CAGAAACTGG CGATGCTAAG CGCGCCGCTG CTGATTACGG GTGACACAGG TACAGGTAAA GATCTCTTTG CCTACGCCTG CCATCAGGCA AGCCCCAGAG CGGGCAAACC TTACCTGGCG CTGAACTGTG CGTCTATACC GGAAGATGCG GTCGAGAGTG AACTGTTTGG TCATGCTCCG GAAGGGAAGA AAGGATTCTT TGAGCAGGCG AACGGTGGTT CGGTGCTGTT GGATGAAATA GGGGAAATGT CACCACGGAT GCAGGCGAAA TTACTGCGTT TCCTTAATGA TGGCACTTTC CGTCGGGTTG GCGAAGACCA TGAGGTGCAT GTCGATGTGC GGGTGATTTG CGCTACGCAG AAGAATCTGG TCGAACTGGT GCAAAAAGGC ATGTTCCGTG AAGATCTCTA TTATCGTCTG AACGTGTTGA CGCTCAATCT GCCGCCGCTA CGTGACTGTC CGCAGGACAT CATGCCGTTA ACTGAGCTGT TCGTCGCCCG CTTTGCCGAC GAGCAGGGCG TGCCGCGTCC GAAACTGGCC GCTGACCTGA ATACTGTACT TACGCGTTAT GCGTGGCCGG GAAATGTGCG GCAGTTAAAG AACGCTATCT ATCGCGCACT GACACAACTG GACGGTTATG AGCTGCGTCC ACAGGATATT TTGTTGCCGG ATTATGACGC CGCAACGGTA GCCGTGGGCG AAGATGCGAT GGAAGGTTCG CTGGACGAAA TCACCAGCCG TTTTGAACGC TCGGTATTAA CCCAGCTTTA TCGCAATTAT CCCAGCACGC GCAAACTGGC AAAACGTCTC GGCGTTTCAC ATACCGCGAT TGCCAATAAG TTGCGGGAAT ATGGTCTGAG TCAGAAGAAG AACGAAGAGT AA
|
Protein sequence | MRLEVFCEDR LGLTRELLDL LVLRGIDLRG IEIDPIGRIY LNFAELEFES FSSLMAEIRR IAGVTDVRTV PWMPSEREHL ALSALLEALP EPVLSVDMKS KVDMANPASC QLFGQKLDRL RNHTAAQLIN GFNFLRWLES EPQDSHNEHV VINGQNFLME ITPVYLQDEN DQHVLTGAVV MLRSTIRMGR QLQNVAAQDV SAFSQIVAVS PKMKHVVEQA QKLAMLSAPL LITGDTGTGK DLFAYACHQA SPRAGKPYLA LNCASIPEDA VESELFGHAP EGKKGFFEQA NGGSVLLDEI GEMSPRMQAK LLRFLNDGTF RRVGEDHEVH VDVRVICATQ KNLVELVQKG MFREDLYYRL NVLTLNLPPL RDCPQDIMPL TELFVARFAD EQGVPRPKLA ADLNTVLTRY AWPGNVRQLK NAIYRALTQL DGYELRPQDI LLPDYDAATV AVGEDAMEGS LDEITSRFER SVLTQLYRNY PSTRKLAKRL GVSHTAIANK LREYGLSQKK NEE
|
| |