Gene SeD_A3342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3342 
Symbol 
ID6871025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3218371 
End bp3219399 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content56% 
IMG OID642786347 
ProductDNA-binding transcriptional regulator GalR 
Protein accessionYP_002216986 
Protein GI198242171 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.0205886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCA TAAAAGATGT AGCCCGACTG GCCGGTGTTT CAGTCGCCAC CGTTTCTCGC 
GTTATTAACG ATTCGCCAAA AGCCAGCGAA GCGTCCCGGC TGGCGGTGAC CAGCGCAATG
GAGTCCCTGA GCTATCACCC TAACGCCAAC GCGCGCGCGC TGGCACAGCA GGCAACGGAA
ACCCTCGGTC TGGTGGTCGG CGACGTTTCC GATCCTTTTT TCGGCGCGAT GGTGAAAGCC
GTTGAACAGG TGGCGTATCA CACCGGCAAT TTTTTACTGA TTGGCAACGG GTATCATAAC
GAACAAAAAG AGCGTCAGGC TATTGAACAG TTGATTCGCC ATCGTTGCGC AGCGTTAGTG
GTACACGCCA AAATGATTCC GGATGCGGAC CTGGCCTCAT TAATGAAGCA AATCCCCGGC
ATGGTGCTGA TTAACCGCAT TTTACCGGGG TTAGAACACC GCTGTGTCGC GCTGGATGAC
CGTTACGGGG CATGGCTGGC GACCCGACAT CTGATCCAGC AAGGTCATAC GCGTATTGGG
TATATCTGTT CCAACCACAC CATCTCTGAT GCCGAAGATC GCCTGAGGGG CTATTACGAT
GCGCTGGCGG AAAGCCATAT CCCGGCTAAC GATCGGCTGG TGACGTTCGG CGAACCGGAT
GAAAGCGGCG GCGAGCAGGC GATGACTGAG TTATTAGGCC GCGGCAGACA TTTTACCGCG
GTGGCCTGCT ATAACGACTC GATGGCGGCC GGCGCGATGG GAGTATTAAA TGATAATGGC
GTGGGGGTGC CGGGCGAAGT ATCGCTCATC GGTTTTGATG ATGTACTGGT CTCACGCTAT
GTGCGTCCCC GACTGACCAC CATTCGGTAT CCGATCGTCA CCATGGCGAC ACAGGCGGCG
GAGCTGGCGT TAGCGTTGGC GGGGAAATGC CCTACGCCAG AAGTAACTCA TGTATTTAGT
CCGACACTGG TACGCCGACA TTCGGTATCC ACGCCGACGG ATACCGGGCA CCTGTCGACA
ACCGATTAA
 
Protein sequence
MATIKDVARL AGVSVATVSR VINDSPKASE ASRLAVTSAM ESLSYHPNAN ARALAQQATE 
TLGLVVGDVS DPFFGAMVKA VEQVAYHTGN FLLIGNGYHN EQKERQAIEQ LIRHRCAALV
VHAKMIPDAD LASLMKQIPG MVLINRILPG LEHRCVALDD RYGAWLATRH LIQQGHTRIG
YICSNHTISD AEDRLRGYYD ALAESHIPAN DRLVTFGEPD ESGGEQAMTE LLGRGRHFTA
VACYNDSMAA GAMGVLNDNG VGVPGEVSLI GFDDVLVSRY VRPRLTTIRY PIVTMATQAA
ELALALAGKC PTPEVTHVFS PTLVRRHSVS TPTDTGHLST TD