Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4106 |
Symbol | galR |
ID | 6972310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3803397 |
End bp | 3804428 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387861 |
Product | DNA-binding transcriptional regulator GalR |
Protein accession | YP_002272301 |
Protein GI | 209398804 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.73393 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.000026097 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGACCA TAAAGGATGT AGCCCGACTG GCAGGCGTTT CAGTCGCCAC CGTTTCCCGC GTCATTAATA ATTCACCCAA AGCCAGCGAA GCTTCCCGGC TGGCTGTGCA TAGTGCAATG GAGTCTCTTA GCTATCACCC GAACGCCAAC GCCCGTGCGC TGGCGCAGCA GACCACTGAA ACGGTCGGTC TGGTCGTTGG TGATGTTTCC GATCCGTTTT TCGGTGCAAT GGTGAAAGCG GTCGAACAGG TGGCTTATCA CACCGGTAAT TTCTTATTGA TTGGCAATGG TTACCACAAC GAACAAAAAG AGCGTCAGGC CATTGAGCAA CTGATCCGCC ATCGCTGTGC TGCGTTGGTC GTCCATGCCA AAATGATCCC GGATGCCGAT TTAGCCTCAT TATTGAAACA AATGCCCGGT ATGGTGCTGA TCAACCGTAT CCTGCCTGGC TTTGAAAACC GTTGTATTGC TCTGGACGAT CGTTACGGTG CCTGGCTGGC TACGCGTCAT TTAATTCAGC AAGGTCATAC CCGCATTGGT TATCTGTGCT CTAACCACTC TATTTCTGAC GCCGAAGATC GTCTGCAAGG GTATTACGAT GCCCTTGCTG AAAGTGGTAT TGCGGCCAAT GACCGGCTGG TGACATTTGG CGAACCAGAC GAAAGCGGCG GCGAACAGGC AATGACCGAG CTTTTGGGAC GAGGAAGAAA TTTCACTGCG GTAGCCTGTT ATAACGATTC AATGGCGGCG GGTGCGATGG GCGTTCTCAA TGATAATGGT ATTGATGTAC CGGGTGAGAT TTCGTTAATT GGCTTTGATG ATGTGCTGGT GTCACGCTAT GTGCGTCCGC GCCTGACCAC CGTGCGTTAC CCAATCGTGA CGATGGCGAC CCAGGCTGCC GAACTGGCTT TGGCGCTGGC GGATAATCGC CCTCTCCCGG AAATCACTAA TGTCTTTAGT CCGACGCTGG TACGTCGTCA TTCAGTGTCA ACTCCGTCGC TGGAGGCAAG TCATCATGCA ACCAGCGACT AA
|
Protein sequence | MATIKDVARL AGVSVATVSR VINNSPKASE ASRLAVHSAM ESLSYHPNAN ARALAQQTTE TVGLVVGDVS DPFFGAMVKA VEQVAYHTGN FLLIGNGYHN EQKERQAIEQ LIRHRCAALV VHAKMIPDAD LASLLKQMPG MVLINRILPG FENRCIALDD RYGAWLATRH LIQQGHTRIG YLCSNHSISD AEDRLQGYYD ALAESGIAAN DRLVTFGEPD ESGGEQAMTE LLGRGRNFTA VACYNDSMAA GAMGVLNDNG IDVPGEISLI GFDDVLVSRY VRPRLTTVRY PIVTMATQAA ELALALADNR PLPEITNVFS PTLVRRHSVS TPSLEASHHA TSD
|
| |