Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2985 |
Symbol | galR |
ID | 6143007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3065232 |
End bp | 3066263 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617854 |
Product | DNA-binding transcriptional regulator GalR |
Protein accession | YP_001745006 |
Protein GI | 170682732 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0409403 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACCA TAAAGGATGT AGCCCGACTG GCAGGCGTTT CAGTCGCCAC CGTTTCCCGC GTCATTAATA ATTCACCCAA AGCCAGCGAA GCTTCACGGC TTGCTGTGCA TAGTGCAATG GAGTCTCTTA GCTATCACCC GAACGCCAAC GCCCGTGCGC TGGCGCAGCA GACCACTGAA ACGGTCGGTC TGGTCGTTGG TGATGTTTCC GATCCGTTTT TCGGTGCAAT GGTGAAAGCG GTCGAACAGG TGGCTTATCA CACTGGTAAT TTTTTATTGA TTGGCAACGG TTACCACAAC GAACAAAAAG AGCGTCAGGC TATTGAGCAA CTGATCCGCC ATCGCTGTGC TGCGTTGGTC GTCCATGCCA AAATGATACC GGATGCCGAT TTAGCCTCAT TAATGAAACA AATGCCCGGT ATGGTGCTGA TCAACCGTAT CCTGCCTGGC TTTGAAAACC GTTGTATTGC TCTGGACGAT CGTTACGGTG CCTGGCTGGC AACGCGTCAT TTAATTCAGC AAGGTCATAC CCGCATTGGT TATCTGTGTT CTAACCACTC TATTTCTGAC GCCGAAGATC GTCTGCAAGG GTATTACGAT GCCCTTGCTG AAAGTGGTAT TGCGGCCAAT GACCGGCTGG TGACATTTGG CGAACCAGAC GAAAGCGGCG GCGAACAGGC AATGACCGAG CTTTTGGGAC GAGGCAGAAA TTTCACTGCG GTAGCCTGTT ATAACGATTC AATGGCGGCG GGTGCGATGG GCGTTCTCAA TGATAATGGT ATTGATGTAC CGGGTGAGAT TTCGTTAATT GGCTTTGATG ATGTGCTGGT GTCACGCTAT GTGCGTCCGC GCCTGACCAC CGTGCGTTAC CCAATCGTGA CGATGGCGAC GCAGGCTGCC GAACTGGCTT TGGCACTGGC GGATAATCGC CCTCTCCCGG AAATCACTAA TGTCTTTAGT CCGACGCTGG TACGTCGCCA TTCAGTGTCA ACTCCGTCGC TGGAGGCAAG TCATCATGCA ACCAGCGACT AA
|
Protein sequence | MATIKDVARL AGVSVATVSR VINNSPKASE ASRLAVHSAM ESLSYHPNAN ARALAQQTTE TVGLVVGDVS DPFFGAMVKA VEQVAYHTGN FLLIGNGYHN EQKERQAIEQ LIRHRCAALV VHAKMIPDAD LASLMKQMPG MVLINRILPG FENRCIALDD RYGAWLATRH LIQQGHTRIG YLCSNHSISD AEDRLQGYYD ALAESGIAAN DRLVTFGEPD ESGGEQAMTE LLGRGRNFTA VACYNDSMAA GAMGVLNDNG IDVPGEISLI GFDDVLVSRY VRPRLTTVRY PIVTMATQAA ELALALADNR PLPEITNVFS PTLVRRHSVS TPSLEASHHA TSD
|
| |