Gene EcSMS35_2298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2298 
SymbolgalS 
ID6144481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2326342 
End bp2327382 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content52% 
IMG OID641617172 
ProductDNA-binding transcriptional regulator GalS 
Protein accessionYP_001744345 
Protein GI170679819 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.00335365 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCACCA TTCGTGATGT AGCGCGTCAG GCTGGCGTCT CTGTGGCAAC GGTTTCCCGA 
GTGCTCAATA ACAGCACGCT GGTCAGTGCC GACACGCGTG AAGCAGTAAT GAAAGCCGTG
AGTGAGCTGG ATTATCGACC AAATGCCAAT GCCCAGGCGC TGGCAACTCA GGTTAGCGAC
ACCATTGGCG TGGTGGTGAT GGACGTTTCT GATGCGTTTT TCGGCGCGCT GGTAAAAGCG
GTGGATTTGG TCGCCCAGCA GCATCAGAAA TACGTGCTAA TCGGCAATAG CTATCATGAA
GCGGAAAAAG AGCGTCACGC CATTGAGGTG TTAATTCGCC AGCGTTGTAA TGCGTTGATT
GTTCACTCAA AAGCATTGAG TGATGATGAA CTGGCACAAT TTATGGATAA CATTCCCGGT
ATGGTGTTAA TCAACCGCGT TGTGCCGGGG TACGCCCATC GTTGCGTTTG CCTGGATAAT
CTCAGCGGTG CCCGAATGGC GACGCGCATG TTGCTGAATA ACGGTCATCA ACGTATTGGT
TATCTTTCTT CCAGCCACGG CATTGAAGAT GACGCCATGC GTAAAGCAGG CTGGATGAGT
GCGTTGAAAG AGCAGGATAT TATTCCGCCG GAAAGCTGGA TTGGCACTGG CACGCCGGAC
ATGCCGGGCG GTGAGGCGGC GATGGTTGAA CTGCTGGGGC GCAATCTACA ACTTACCGCT
GTATTTGCTT ATAACGACAA TATGGCTGCT GGCGCACTGA CAGCATTAAA AGATAATGGC
ATTGCGATTC CGTTACATCT CTCAATCATC GGTTTCGATG ATATTCCCAT CGCCCGTTAC
ACCGACCCGC AATTAACGAC CGTGCGTTAT CCCATTGCTT CAATGGCTAA ATTAGCCACC
GAACTGGCCT TGCAGGGGGC AGCAGGCAAT ATTGATCCTC GTGCCAGCCA CTGTTTTATG
CCAACGTTAG TGCGTCGTCA TTCTGTCGCA ACGCGCCAGA ATGCGGCGGC GATCACTAAC
TCAACAAATC AGGCGATGTA A
 
Protein sequence
MITIRDVARQ AGVSVATVSR VLNNSTLVSA DTREAVMKAV SELDYRPNAN AQALATQVSD 
TIGVVVMDVS DAFFGALVKA VDLVAQQHQK YVLIGNSYHE AEKERHAIEV LIRQRCNALI
VHSKALSDDE LAQFMDNIPG MVLINRVVPG YAHRCVCLDN LSGARMATRM LLNNGHQRIG
YLSSSHGIED DAMRKAGWMS ALKEQDIIPP ESWIGTGTPD MPGGEAAMVE LLGRNLQLTA
VFAYNDNMAA GALTALKDNG IAIPLHLSII GFDDIPIARY TDPQLTTVRY PIASMAKLAT
ELALQGAAGN IDPRASHCFM PTLVRRHSVA TRQNAAAITN STNQAM