Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C1322 |
Symbol | thiK |
ID | 6488673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 1296113 |
End bp | 1296937 |
Gene Length | 825 bp |
Protein Length | 274 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642741557 |
Product | thiamine kinase |
Protein accession | YP_002045207 |
Protein GI | 194450936 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0510] Predicted choline kinase involved in LPS biosynthesis |
TIGRFAM ID | [TIGR02721] thiamine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.209926 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.00000837318 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGGTCCA ACAACAATAA TCCCTTAACG CGCGACGAGA TCCTGTCGCG CTATTTTCCC CAGTATCGTC CCGCCGTCGC CGCATCGCAG GGGCTAAGCG GCGGGAGCTG TATTATTGCC CACGATACTC ACCGTGTCGT GCTGCGCCGT CATCACGACC CCGACGCTCC ACCAGCCCAT TTTTTACGTC ATTACCGCGC CTTATCCCAA TTGCCAGCCA GTCTTGCGCC GCGAGCGCTT TTTTATACGC CAGGCTGGAT GGCGGTAGAA TATCTGCATG GTGTGGTAAA TTCCGCTCTG CCAGATGCCG ACGAACTGGC GGCCTTACTG TATCATTTGC ATCAACAGCC GCGTTTTGGC TGGCGTATTG CGCTATCGCC ACTATTGGCG CAGTACTGGT CGTGTTGCGA TCCGGCAAGG CGTACGCCGT TTTGGTTGCG GCGGCTCAAA CAGTTGCAAA AAAACGGTGA ACCTCGCCCG CTTCGGCTCG CGCCTTTGCA TATGGATGTC CATGGCGACA ATATAGTATT AACGTCCGCC GGGTTGAGAC TGATTGACTG GGAGTATGCC GGCGACGGCG ATATTGCGTT GGAGCTGGCG GCAGTATGGG TTGAGGATGA ACGCCAGCAC CGACAACTGG CAGACGCTTA TGCCGCGCGC GCGCGAATCG ACGCCCGGCA GCTTTGGCGA CAGATACGAT TATGGCACCC CTGGGTCATT ATGCTAAAAG CAGGGTGGTT CGAATACCGC TGGCGACAAA CCGGCGAGCA ACAATTTATC AGGCTGGCCG ATGAAACCTG GCGCCAGTTA CGTATGAAAG GATAA
|
Protein sequence | MRSNNNNPLT RDEILSRYFP QYRPAVAASQ GLSGGSCIIA HDTHRVVLRR HHDPDAPPAH FLRHYRALSQ LPASLAPRAL FYTPGWMAVE YLHGVVNSAL PDADELAALL YHLHQQPRFG WRIALSPLLA QYWSCCDPAR RTPFWLRRLK QLQKNGEPRP LRLAPLHMDV HGDNIVLTSA GLRLIDWEYA GDGDIALELA AVWVEDERQH RQLADAYAAR ARIDARQLWR QIRLWHPWVI MLKAGWFEYR WRQTGEQQFI RLADETWRQL RMKG
|
| |