Gene SeHA_C3226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3226 
Symbol 
ID6488180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3153262 
End bp3154290 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content56% 
IMG OID642743365 
ProductDNA-binding transcriptional regulator GalR 
Protein accessionYP_002046982 
Protein GI194448481 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.000639669 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGACCA TAAAAGATGT AGCCCGACTG GCCGGTGTTT CAGTCGCCAC CGTTTCTCGC 
GTTATTAACG ATTCGCCAAA AGCCAGCGAA GCGTCCCGGC TGGCGGTGAC CAGCGCAATG
GAGTCCCTGA GCTATCACCC TAACGCCAAC GCGCGCGCGC TGGCACAGCA GGCAACGGAA
ACCCTCGGTC TGGTGGTCGG CGACGTTTCC GATCCTTTTT TCGGCGCGAT GGTGAAAGCC
GTTGAACAGG TGGCGTATCA CACCGGCAAT TTTTTACTGA TTGGCAACGG GTATCATAAC
GAACAAAAAG AGCGTCAGGC TATTGAACAG TTGATTCGTC ATCGTTGCGC AGCGTTAGTG
GTACACGCCA AAATGATTCC GGATGCGGAC CTGGCCTCAT TAATGAAGCA AATCCCCGGC
ATGGTGCTGA TTAACCGCAT TTTACCGGGG TTAGAACACC GCTGTGTCGC GCTGGATGAC
CGTTACGGGG CATGGCTGGC GACCCGACAT CTGATCCAGC AAGGTCATAC GCGTATTGGG
TATATCTGTT CCAACCACAC CATCTCTGAT GCCGAAGATC GCCTGAGGGG CTATTACGAT
GCGCTGGCGG AAAGCCATAT CCCGGCTAAC GATCGGCTGG TGACGTTCGG CGAACCGGAT
GAAAGCGGCG GCGAGCAGGC GATGACTGAG TTATTAGGTC GCGGCAGAAA TTTTACCGCG
GTGGCCTGCT ATAACGACTC GATGGCGGCC GGCGCGATGG GAGTATTAAA TGATAATGGC
GTGGGGGTGC CGGGCGAAGT ATCGCTCATC GGTTTTGATG ATGTACTGGT CTCACGCTAT
GTGCGTCCCC GACTGACCAC CATTCGGTAT CCGATCGTCA CCATGGCGAC ACAGGCGGCG
GAGCTGGCCT TAGCGCTGGC AGGGAAATGC CCTACGCCAG AAGTAACTCA TGTATTTAGT
CCGACACTGG TACGCCGACA TTCGGTGTCC ACGCCGACGG ATACCGGGCA CCTGTCGACA
ACCGATTAA
 
Protein sequence
MATIKDVARL AGVSVATVSR VINDSPKASE ASRLAVTSAM ESLSYHPNAN ARALAQQATE 
TLGLVVGDVS DPFFGAMVKA VEQVAYHTGN FLLIGNGYHN EQKERQAIEQ LIRHRCAALV
VHAKMIPDAD LASLMKQIPG MVLINRILPG LEHRCVALDD RYGAWLATRH LIQQGHTRIG
YICSNHTISD AEDRLRGYYD ALAESHIPAN DRLVTFGEPD ESGGEQAMTE LLGRGRNFTA
VACYNDSMAA GAMGVLNDNG VGVPGEVSLI GFDDVLVSRY VRPRLTTIRY PIVTMATQAA
ELALALAGKC PTPEVTHVFS PTLVRRHSVS TPTDTGHLST TD