Gene SeD_A2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2540 
SymbolgalS 
ID6874324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2418638 
End bp2419660 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content56% 
IMG OID642785615 
ProductDNA-binding transcriptional regulator GalS 
Protein accessionYP_002216273 
Protein GI198244274 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.182428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.113132 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACCA TTCGTGATGT AGCGCGCCAG GCTGGCGTGT CTGTAGCGAC CGTTTCCCGC 
GTACTGAATA ACAGCGCGTT GGTTAGTCCC GACACCCGTG ACGCCGTTAT GCAGGCCGTC
ACCCTGCTGG GATATCGGCC AAATGCGAAT GCGCAAGCGC TGGCCACTCA GGTGAGCGAC
ACCATCGGCG TCGTGGTCAT GGATGTTTCC GATGCCTTTT TCGGCGCTCT GGTGAAAGCC
GTAGATCTGG TCGCGCAGCA GCACCAGAAA TATGTTCTCA TTGGCAACAG TTATCATGAG
GCGGAAAAAG AGCGCCATGC GATTGAAGTC TTGATCCGTC AGCGTTGTAA CGCATTGATT
GTTCACTCAA AAGCCTTAAC CGATCGCGAG CTGAGCGACT TTATGGATCA GATCCCCGGT
ATGGTGCTGA TTAACCGTAT CGTGCCGGGT TATGCGCATC GTTGTGTTTG TCTCGACAAT
GTGAGCGGCG CCAGAATGGC GACCCGAATG TTGCTGAATA ATGGACATCA ACGCATCGGC
TACCTGGCCT CCAGCCACCG TATTGAAGAT GACGCGATGC GCAGAGAAGG GTGGTTACAC
GCGCTGCAAG AGCAGGGGAT TGCTGCGTCG GAGAGCTGGA TAGGCACCGG CACGCCGGAC
ATGCAGGGCG GCGAGTCGGC AATGGTTGAG TTGCTGGGAC GCAATCTGCA ACTGACGGCG
GTATTTGCCT ATAACGATAA CATGGCGGCG GGCGCGCTGA CGGCGTTAAA AGATAACGGC
ATCGCCATTC CCTTGCATCT GTCTGTCATC GGTTTCGATG ATATCCCTAT TGCTCGTTAT
ACCGATCCTC AGTTGACTAC CGTGCGCTAT CCTATTGCTT CTATGGCGAA AATCGCGACC
GAACTGGCGT TACAGGGGGC CGCAGGCACG CTGGATATCA CGGCGACGCA CTGCTTCATG
CCGACCCTGG TGCGGCGCCA TTCGGTGGCG TGGCGACAGA ATGCGGTACT GATCACTAAC
TGA
 
Protein sequence
MITIRDVARQ AGVSVATVSR VLNNSALVSP DTRDAVMQAV TLLGYRPNAN AQALATQVSD 
TIGVVVMDVS DAFFGALVKA VDLVAQQHQK YVLIGNSYHE AEKERHAIEV LIRQRCNALI
VHSKALTDRE LSDFMDQIPG MVLINRIVPG YAHRCVCLDN VSGARMATRM LLNNGHQRIG
YLASSHRIED DAMRREGWLH ALQEQGIAAS ESWIGTGTPD MQGGESAMVE LLGRNLQLTA
VFAYNDNMAA GALTALKDNG IAIPLHLSVI GFDDIPIARY TDPQLTTVRY PIASMAKIAT
ELALQGAAGT LDITATHCFM PTLVRRHSVA WRQNAVLITN