Gene EcSMS35_2985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2985 
SymbolgalR 
ID6143007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3065232 
End bp3066263 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content52% 
IMG OID641617854 
ProductDNA-binding transcriptional regulator GalR 
Protein accessionYP_001745006 
Protein GI170682732 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0409403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCA TAAAGGATGT AGCCCGACTG GCAGGCGTTT CAGTCGCCAC CGTTTCCCGC 
GTCATTAATA ATTCACCCAA AGCCAGCGAA GCTTCACGGC TTGCTGTGCA TAGTGCAATG
GAGTCTCTTA GCTATCACCC GAACGCCAAC GCCCGTGCGC TGGCGCAGCA GACCACTGAA
ACGGTCGGTC TGGTCGTTGG TGATGTTTCC GATCCGTTTT TCGGTGCAAT GGTGAAAGCG
GTCGAACAGG TGGCTTATCA CACTGGTAAT TTTTTATTGA TTGGCAACGG TTACCACAAC
GAACAAAAAG AGCGTCAGGC TATTGAGCAA CTGATCCGCC ATCGCTGTGC TGCGTTGGTC
GTCCATGCCA AAATGATACC GGATGCCGAT TTAGCCTCAT TAATGAAACA AATGCCCGGT
ATGGTGCTGA TCAACCGTAT CCTGCCTGGC TTTGAAAACC GTTGTATTGC TCTGGACGAT
CGTTACGGTG CCTGGCTGGC AACGCGTCAT TTAATTCAGC AAGGTCATAC CCGCATTGGT
TATCTGTGTT CTAACCACTC TATTTCTGAC GCCGAAGATC GTCTGCAAGG GTATTACGAT
GCCCTTGCTG AAAGTGGTAT TGCGGCCAAT GACCGGCTGG TGACATTTGG CGAACCAGAC
GAAAGCGGCG GCGAACAGGC AATGACCGAG CTTTTGGGAC GAGGCAGAAA TTTCACTGCG
GTAGCCTGTT ATAACGATTC AATGGCGGCG GGTGCGATGG GCGTTCTCAA TGATAATGGT
ATTGATGTAC CGGGTGAGAT TTCGTTAATT GGCTTTGATG ATGTGCTGGT GTCACGCTAT
GTGCGTCCGC GCCTGACCAC CGTGCGTTAC CCAATCGTGA CGATGGCGAC GCAGGCTGCC
GAACTGGCTT TGGCACTGGC GGATAATCGC CCTCTCCCGG AAATCACTAA TGTCTTTAGT
CCGACGCTGG TACGTCGCCA TTCAGTGTCA ACTCCGTCGC TGGAGGCAAG TCATCATGCA
ACCAGCGACT AA
 
Protein sequence
MATIKDVARL AGVSVATVSR VINNSPKASE ASRLAVHSAM ESLSYHPNAN ARALAQQTTE 
TVGLVVGDVS DPFFGAMVKA VEQVAYHTGN FLLIGNGYHN EQKERQAIEQ LIRHRCAALV
VHAKMIPDAD LASLMKQMPG MVLINRILPG FENRCIALDD RYGAWLATRH LIQQGHTRIG
YLCSNHSISD AEDRLQGYYD ALAESGIAAN DRLVTFGEPD ESGGEQAMTE LLGRGRNFTA
VACYNDSMAA GAMGVLNDNG IDVPGEISLI GFDDVLVSRY VRPRLTTVRY PIVTMATQAA
ELALALADNR PLPEITNVFS PTLVRRHSVS TPSLEASHHA TSD