Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2285 |
Symbol | galS |
ID | 5592658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2282855 |
End bp | 2283895 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640921413 |
Product | DNA-binding transcriptional regulator GalS |
Protein accession | YP_001458949 |
Protein GI | 157161631 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 66 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCACCA TTCGTGATGT AGCGCGTCAG GCTGGCGTCT CTGTGGCAAC GGTTTCCCGG GTGCTCAATA ACAGCACGCT GGTCAGTGCC GACACGCGTG AAGCAGTAAT GAAAGCCGTG AGTGAGCTGG ATTATCGGCC AAACGCCAAT GCCCAGGCGC TGGCAACTCA GGTTAGCGAC ACCATTGGCG TGGTGGTGAT GGACGTTTCT GATGCGTTTT TCGGCGCGCT GGTAAAAGCG GTGGATCTAG TCGCTCAGCA GCATCAGAAA TACGTGCTAA TCGGCAATAG CTATCATGAA GCGGAAAAAG AGCGTCACGC CATTGAGGTG TTAATTCGCC AGCGTTGTAA TGCGTTGATT GTTCACTCAA AAGCATTGAG TGACGATGAA CTGGCGCAAT TTATGGATAA CATTCCCGGT ATGGTGTTAA TCAACCGCGT TGTGCCGGGG TACGCCCATC GTTGCGTTTG CCTGGATAAT CTCAGCGGTG CCCGAATGGC GACGCGCATG TTGCTGAATA ACGGTCATCA ACGTATTGGT TATCTTTCTT CCAGTCACGG CATTGAAGAT GACGCCATGC GTAAAGCAGG CTGGATGAGT GCGTTGAAAG AGCAGGATAT TATTCCGCCG GAAAGCTGGA TTGGCACTGG TACGCCGGAC ATGCCGGGCG GTGAGGCGGC GATGGTTGAA CTGCTGGGGC GCAATCTACA ACTTACCGCT GTATTTGCTT ATAACGACAA TATGGCCGCT GGCGCACTGA CAGCATTAAA AGATAATGGC ATTGCGATTC CGTTACATCT CTCAATCATC GGTTTCGATG ATATTCCCAT CGCCCGTTAC ACCGACCCGC AATTAACGAC CGTGCGTTAT CCCATTGCTT CAATGGCTAA ATTAGCCACC GAACTGGCCT TGCAGGGGGC AGCAGGCAAT ATTGATCCTC GTGCCAGCCA CTGTTTTATG CCGACGTTAG TGCGTCGCCA TTCTGTCGCA ACGCGCCAGA ATGCGGCGGC GATCACTAAC TCAACAAATC AGGCGATGTA A
|
Protein sequence | MITIRDVARQ AGVSVATVSR VLNNSTLVSA DTREAVMKAV SELDYRPNAN AQALATQVSD TIGVVVMDVS DAFFGALVKA VDLVAQQHQK YVLIGNSYHE AEKERHAIEV LIRQRCNALI VHSKALSDDE LAQFMDNIPG MVLINRVVPG YAHRCVCLDN LSGARMATRM LLNNGHQRIG YLSSSHGIED DAMRKAGWMS ALKEQDIIPP ESWIGTGTPD MPGGEAAMVE LLGRNLQLTA VFAYNDNMAA GALTALKDNG IAIPLHLSII GFDDIPIARY TDPQLTTVRY PIASMAKLAT ELALQGAAGN IDPRASHCFM PTLVRRHSVA TRQNAAAITN STNQAM
|
| |