Gene SeHA_C4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4004 
Symbol 
ID6489294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3882771 
End bp3883652 
Gene Length882 bp 
Protein Length293 aa 
Translation table11 
GC content57% 
IMG OID642744105 
Productputative transcriptional regulator 
Protein accessionYP_002047710 
Protein GI194451815 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.542016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAAT ATATCGGTAT TGATGTGGGA GGAACTCACG TCAAATATGG CGTGATTAAC 
AGTGACGGCG AAGAATTAAC CCATCATCAA TTCGATACGC CAGAGGACGC CTCCACGTTT
ACCCGCAAAT GGCAGGATGT GGTGGCGCGT TGCCAACAGG ACTATGACAT TGCGGCAATC
GGGGTTAGTT TCCCCGGCCA TATTAATCCC CATAACGGTC ATGCGGCAAA AGCGGGCGCG
CTGGCTTACC TGGATGACGT CAACCTGATG GAGTTGTTCA GCGGGCTGAC CGATCTGCCG
CTGGTCGTGG AGAACGACGC GAACTGTGCG GCGCTGGGCG AAATGTGGCG AGGTGCCGGG
CAGCATTATG ACAATCTGGT CTGTATTACC ATTGGAACCG GCATTGGCGG CGGTATTATC
GTCGGACGAG AACTGTATCG CGGCGCACAT TTCCATGCCG GTGAATTCGG CGTCATGCCG
GTCGGGAACA ATGGCGAAAG TATGCATAAA ATCGCGTCAA CCAGCGGATT AATGGCGTCG
TGCCGCCAGG CGCTGGCGCT GCCCGCCGAA GAGATGCCGC CTGCGGATGT GATCTTCGAA
CGAATGGCGA CCGATGTTCA TCTGCGTGAA GCGGTCAATG ACTGGGCGCG TTATCTGTCA
CGCGGCGTTT ACAGCGTGAT CTCTATGTTT GATCCGGGCG TGATGCTGAT CGGCGGAGGA
ATAAGCGAAC AGGAAAAGCT CTACCCGCTC CTGACGCGGC ATCTTGAAAC GTTTGAAATG
TGGGAGGCGC TCCAGGTGCC GATTCAGCCC TGCCAACTGG GAAATCAGGC GGGCAGGCTG
GGCGCCGTCT GGCTGGCGCA GCAAAAGCTC GCCCGAAGCT AA
 
Protein sequence
MQQYIGIDVG GTHVKYGVIN SDGEELTHHQ FDTPEDASTF TRKWQDVVAR CQQDYDIAAI 
GVSFPGHINP HNGHAAKAGA LAYLDDVNLM ELFSGLTDLP LVVENDANCA ALGEMWRGAG
QHYDNLVCIT IGTGIGGGII VGRELYRGAH FHAGEFGVMP VGNNGESMHK IASTSGLMAS
CRQALALPAE EMPPADVIFE RMATDVHLRE AVNDWARYLS RGVYSVISMF DPGVMLIGGG
ISEQEKLYPL LTRHLETFEM WEALQVPIQP CQLGNQAGRL GAVWLAQQKL ARS