Gene ECH74115_4106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4106 
SymbolgalR 
ID6972310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3803397 
End bp3804428 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content53% 
IMG OID643387861 
ProductDNA-binding transcriptional regulator GalR 
Protein accessionYP_002272301 
Protein GI209398804 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.73393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000026097 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGACCA TAAAGGATGT AGCCCGACTG GCAGGCGTTT CAGTCGCCAC CGTTTCCCGC 
GTCATTAATA ATTCACCCAA AGCCAGCGAA GCTTCCCGGC TGGCTGTGCA TAGTGCAATG
GAGTCTCTTA GCTATCACCC GAACGCCAAC GCCCGTGCGC TGGCGCAGCA GACCACTGAA
ACGGTCGGTC TGGTCGTTGG TGATGTTTCC GATCCGTTTT TCGGTGCAAT GGTGAAAGCG
GTCGAACAGG TGGCTTATCA CACCGGTAAT TTCTTATTGA TTGGCAATGG TTACCACAAC
GAACAAAAAG AGCGTCAGGC CATTGAGCAA CTGATCCGCC ATCGCTGTGC TGCGTTGGTC
GTCCATGCCA AAATGATCCC GGATGCCGAT TTAGCCTCAT TATTGAAACA AATGCCCGGT
ATGGTGCTGA TCAACCGTAT CCTGCCTGGC TTTGAAAACC GTTGTATTGC TCTGGACGAT
CGTTACGGTG CCTGGCTGGC TACGCGTCAT TTAATTCAGC AAGGTCATAC CCGCATTGGT
TATCTGTGCT CTAACCACTC TATTTCTGAC GCCGAAGATC GTCTGCAAGG GTATTACGAT
GCCCTTGCTG AAAGTGGTAT TGCGGCCAAT GACCGGCTGG TGACATTTGG CGAACCAGAC
GAAAGCGGCG GCGAACAGGC AATGACCGAG CTTTTGGGAC GAGGAAGAAA TTTCACTGCG
GTAGCCTGTT ATAACGATTC AATGGCGGCG GGTGCGATGG GCGTTCTCAA TGATAATGGT
ATTGATGTAC CGGGTGAGAT TTCGTTAATT GGCTTTGATG ATGTGCTGGT GTCACGCTAT
GTGCGTCCGC GCCTGACCAC CGTGCGTTAC CCAATCGTGA CGATGGCGAC CCAGGCTGCC
GAACTGGCTT TGGCGCTGGC GGATAATCGC CCTCTCCCGG AAATCACTAA TGTCTTTAGT
CCGACGCTGG TACGTCGTCA TTCAGTGTCA ACTCCGTCGC TGGAGGCAAG TCATCATGCA
ACCAGCGACT AA
 
Protein sequence
MATIKDVARL AGVSVATVSR VINNSPKASE ASRLAVHSAM ESLSYHPNAN ARALAQQTTE 
TVGLVVGDVS DPFFGAMVKA VEQVAYHTGN FLLIGNGYHN EQKERQAIEQ LIRHRCAALV
VHAKMIPDAD LASLLKQMPG MVLINRILPG FENRCIALDD RYGAWLATRH LIQQGHTRIG
YLCSNHSISD AEDRLQGYYD ALAESGIAAN DRLVTFGEPD ESGGEQAMTE LLGRGRNFTA
VACYNDSMAA GAMGVLNDNG IDVPGEISLI GFDDVLVSRY VRPRLTTVRY PIVTMATQAA
ELALALADNR PLPEITNVFS PTLVRRHSVS TPSLEASHHA TSD