Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0460 |
Symbol | thiL |
ID | 6872320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 475346 |
End bp | 476326 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642783686 |
Product | thiamine monophosphate kinase |
Protein accession | YP_002214373 |
Protein GI | 198243241 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 98 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATGTG GCGAGTTTTC CCTGATTGCC CGTTATTTTG ACCGTGTAAA AAGCTCTCGT CTTGATGTTG AAACCGGTAT TGGCGACGAT TGCGCGCTCC TGAATATTCC TGAAAAACAG ACCCTGGCGA TCAGTACCGA TACGCTGGTG GCGGGCATCC ATTTCTTACC CGATATCGAT CCTGCCGATC TGGCGTATAA AGCGCTGGCG GTGAATTTAA GCGATCTGGC GGCGATGGGC GCCGATCCGG CATGGTTAAC GCTGGCGCTC ACGCTTCCTG ACGTCGATGA GGCGTGGCTT GCCGCGTTCA GCGACAGCCT GTTTGAACAA CTGGATTACT ACGACATGCA GCTCATTGGC GGCGATACCA CGCGCGGCCC GCTGTCGATG ACGCTAGGTA TTCATGGCCT TGTGCCAGTC GGTCGGGCGT TGAAACGTTC TGGCGCAAAA CCGGGCGACT GGATTTATGT TACTGGCACG TTGGGCGATA GCGCTGCCGG GCTGGCGATT CTACGGGGTG ATTTTCGCGT GGGAAGCTGG GGGGATGCCG ACTATCTGGT CAAACGCCAT CTGCGCCCGA CGCCGCGTAT TTTACAAGGG CAGGCGCTAC GCGATCTCGC CAGTTCAGCG ATCGATCTTT CCGACGGTTT GATCTCCGAT CTTGGTCACA TTCTGCAAGC CAGCAACTGC GGCGCGCGAA TCGATTTGGA GGCGCTGCCT GACTCCGAAG AACTGTGGGG ACATGCCAAT GATCCCGAAC AAAAGCTTCG CTGGATGTTA TCCGGCGGCG AAGATTATGA ACTGTGCTTT ACCGTCCCGG AGCTGAACCG TGGCGCGCTG GATGTCGCGC TTGGTCATCT GGGCGTGCCG TTTACCTGTA TCGGGCAAAT GACGGCGGAT ATCGAAGGGA TCGCCTTTGT GCGTGACGGA GAACCTGTCA CTTTTGACTG GAAAGGATAT GACCATTTTG CCACGCCATA A
|
Protein sequence | MACGEFSLIA RYFDRVKSSR LDVETGIGDD CALLNIPEKQ TLAISTDTLV AGIHFLPDID PADLAYKALA VNLSDLAAMG ADPAWLTLAL TLPDVDEAWL AAFSDSLFEQ LDYYDMQLIG GDTTRGPLSM TLGIHGLVPV GRALKRSGAK PGDWIYVTGT LGDSAAGLAI LRGDFRVGSW GDADYLVKRH LRPTPRILQG QALRDLASSA IDLSDGLISD LGHILQASNC GARIDLEALP DSEELWGHAN DPEQKLRWML SGGEDYELCF TVPELNRGAL DVALGHLGVP FTCIGQMTAD IEGIAFVRDG EPVTFDWKGY DHFATP
|
| |