Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2298 |
Symbol | galS |
ID | 6144481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2326342 |
End bp | 2327382 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617172 |
Product | DNA-binding transcriptional regulator GalS |
Protein accession | YP_001744345 |
Protein GI | 170679819 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.00335365 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCACCA TTCGTGATGT AGCGCGTCAG GCTGGCGTCT CTGTGGCAAC GGTTTCCCGA GTGCTCAATA ACAGCACGCT GGTCAGTGCC GACACGCGTG AAGCAGTAAT GAAAGCCGTG AGTGAGCTGG ATTATCGACC AAATGCCAAT GCCCAGGCGC TGGCAACTCA GGTTAGCGAC ACCATTGGCG TGGTGGTGAT GGACGTTTCT GATGCGTTTT TCGGCGCGCT GGTAAAAGCG GTGGATTTGG TCGCCCAGCA GCATCAGAAA TACGTGCTAA TCGGCAATAG CTATCATGAA GCGGAAAAAG AGCGTCACGC CATTGAGGTG TTAATTCGCC AGCGTTGTAA TGCGTTGATT GTTCACTCAA AAGCATTGAG TGATGATGAA CTGGCACAAT TTATGGATAA CATTCCCGGT ATGGTGTTAA TCAACCGCGT TGTGCCGGGG TACGCCCATC GTTGCGTTTG CCTGGATAAT CTCAGCGGTG CCCGAATGGC GACGCGCATG TTGCTGAATA ACGGTCATCA ACGTATTGGT TATCTTTCTT CCAGCCACGG CATTGAAGAT GACGCCATGC GTAAAGCAGG CTGGATGAGT GCGTTGAAAG AGCAGGATAT TATTCCGCCG GAAAGCTGGA TTGGCACTGG CACGCCGGAC ATGCCGGGCG GTGAGGCGGC GATGGTTGAA CTGCTGGGGC GCAATCTACA ACTTACCGCT GTATTTGCTT ATAACGACAA TATGGCTGCT GGCGCACTGA CAGCATTAAA AGATAATGGC ATTGCGATTC CGTTACATCT CTCAATCATC GGTTTCGATG ATATTCCCAT CGCCCGTTAC ACCGACCCGC AATTAACGAC CGTGCGTTAT CCCATTGCTT CAATGGCTAA ATTAGCCACC GAACTGGCCT TGCAGGGGGC AGCAGGCAAT ATTGATCCTC GTGCCAGCCA CTGTTTTATG CCAACGTTAG TGCGTCGTCA TTCTGTCGCA ACGCGCCAGA ATGCGGCGGC GATCACTAAC TCAACAAATC AGGCGATGTA A
|
Protein sequence | MITIRDVARQ AGVSVATVSR VLNNSTLVSA DTREAVMKAV SELDYRPNAN AQALATQVSD TIGVVVMDVS DAFFGALVKA VDLVAQQHQK YVLIGNSYHE AEKERHAIEV LIRQRCNALI VHSKALSDDE LAQFMDNIPG MVLINRVVPG YAHRCVCLDN LSGARMATRM LLNNGHQRIG YLSSSHGIED DAMRKAGWMS ALKEQDIIPP ESWIGTGTPD MPGGEAAMVE LLGRNLQLTA VFAYNDNMAA GALTALKDNG IAIPLHLSII GFDDIPIARY TDPQLTTVRY PIASMAKLAT ELALQGAAGN IDPRASHCFM PTLVRRHSVA TRQNAAAITN STNQAM
|
| |