Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4004 |
Symbol | |
ID | 6489294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 3882771 |
End bp | 3883652 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642744105 |
Product | putative transcriptional regulator |
Protein accession | YP_002047710 |
Protein GI | 194451815 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | [TIGR00744] ROK family protein (putative glucokinase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 0.542016 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCAAT ATATCGGTAT TGATGTGGGA GGAACTCACG TCAAATATGG CGTGATTAAC AGTGACGGCG AAGAATTAAC CCATCATCAA TTCGATACGC CAGAGGACGC CTCCACGTTT ACCCGCAAAT GGCAGGATGT GGTGGCGCGT TGCCAACAGG ACTATGACAT TGCGGCAATC GGGGTTAGTT TCCCCGGCCA TATTAATCCC CATAACGGTC ATGCGGCAAA AGCGGGCGCG CTGGCTTACC TGGATGACGT CAACCTGATG GAGTTGTTCA GCGGGCTGAC CGATCTGCCG CTGGTCGTGG AGAACGACGC GAACTGTGCG GCGCTGGGCG AAATGTGGCG AGGTGCCGGG CAGCATTATG ACAATCTGGT CTGTATTACC ATTGGAACCG GCATTGGCGG CGGTATTATC GTCGGACGAG AACTGTATCG CGGCGCACAT TTCCATGCCG GTGAATTCGG CGTCATGCCG GTCGGGAACA ATGGCGAAAG TATGCATAAA ATCGCGTCAA CCAGCGGATT AATGGCGTCG TGCCGCCAGG CGCTGGCGCT GCCCGCCGAA GAGATGCCGC CTGCGGATGT GATCTTCGAA CGAATGGCGA CCGATGTTCA TCTGCGTGAA GCGGTCAATG ACTGGGCGCG TTATCTGTCA CGCGGCGTTT ACAGCGTGAT CTCTATGTTT GATCCGGGCG TGATGCTGAT CGGCGGAGGA ATAAGCGAAC AGGAAAAGCT CTACCCGCTC CTGACGCGGC ATCTTGAAAC GTTTGAAATG TGGGAGGCGC TCCAGGTGCC GATTCAGCCC TGCCAACTGG GAAATCAGGC GGGCAGGCTG GGCGCCGTCT GGCTGGCGCA GCAAAAGCTC GCCCGAAGCT AA
|
Protein sequence | MQQYIGIDVG GTHVKYGVIN SDGEELTHHQ FDTPEDASTF TRKWQDVVAR CQQDYDIAAI GVSFPGHINP HNGHAAKAGA LAYLDDVNLM ELFSGLTDLP LVVENDANCA ALGEMWRGAG QHYDNLVCIT IGTGIGGGII VGRELYRGAH FHAGEFGVMP VGNNGESMHK IASTSGLMAS CRQALALPAE EMPPADVIFE RMATDVHLRE AVNDWARYLS RGVYSVISMF DPGVMLIGGG ISEQEKLYPL LTRHLETFEM WEALQVPIQP CQLGNQAGRL GAVWLAQQKL ARS
|
| |