Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0458 |
Symbol | thiL |
ID | 6793008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 453557 |
End bp | 454537 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642774745 |
Product | thiamine monophosphate kinase |
Protein accession | YP_002145401 |
Protein GI | 197249517 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.703911 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATGTG GCGAGTTTTC CCTGATTGCC CGTTATTTTG ACCGTGTAAG AAGCTCTCGT CTTGATGTTG AAACCGGTAT TGGCGACGAT TGCGCGCTCC TGAATATTCC TGAAAAGCAG ACCCTGGCGA TCAGTACCGA TACGCTGGTG GCGGGCATCC ATTTCTTACC CGATATCGAT CCTGCCGATC TGGCGTATAA AGCGCTGGCG GTGAATTTAA GCGATCTGGC GGCGATGGGC GCCGATCCGG CATGGTTAAC GCTGGCGCTC ACGCTTCCTG ACGTCGATGA GGCGTGGCTT GCCGCGTTCA GCGACAGCCT GTTTGAACAA CTGGATTACT ACGACATGCA GCTCATTGGC GGCGATACCA CGCGCGGCCC GCTGTCGATG ACGCTCGGTA TTCATGGCCT TGTGCCAGTC GGCCGGGCGC TGAAACGTTC TGGCGCAAAA CCGGGCGACT GGATTTATGT TACTGGCACG TTGGGCGATA GCGCTGCCGG GCTGGCGATT CTACGGGGCG ATTTTCGCGT GGGAAGCTGG GGGGATGCCG ACTATCTGGT CAAACGCCAT CTGCGCCCGA CGCCGCGTAT TTTACAAGGA CAGGCGCTAC GCGATCTCGC CAGTTCGGCG ATCGATCTTT CCGACGGTTT GATCTCCGAT CTTGGTCACA TTCTGCAAGC CAGCAACTGC GGCGCGCGAA TCGATTTGGA GGCGCTGCCT GACTCCGAAG AACTGTGGGG ACATGCCAAT GATCCCGAAC AAAAGCTTCG CTGGATGCTA TCCGGCGGCG AAGATTATGA ACTGTGCTTT ACCGTCCCGG AGCTGAACCG TGGCGCGCTG GATGTCGCGC TTGGTCATCT GGGCGTGCCG TTTACCTGTA TCGGGCAAAT GACGGCGGAT ATCGAAGGGA TCGCCTTTGT GCGTGACGGA GAACCTGTCA CTTTTGACTG GAAAGGATAT GACCATTTTG CCACGCCATA A
|
Protein sequence | MACGEFSLIA RYFDRVRSSR LDVETGIGDD CALLNIPEKQ TLAISTDTLV AGIHFLPDID PADLAYKALA VNLSDLAAMG ADPAWLTLAL TLPDVDEAWL AAFSDSLFEQ LDYYDMQLIG GDTTRGPLSM TLGIHGLVPV GRALKRSGAK PGDWIYVTGT LGDSAAGLAI LRGDFRVGSW GDADYLVKRH LRPTPRILQG QALRDLASSA IDLSDGLISD LGHILQASNC GARIDLEALP DSEELWGHAN DPEQKLRWML SGGEDYELCF TVPELNRGAL DVALGHLGVP FTCIGQMTAD IEGIAFVRDG EPVTFDWKGY DHFATP
|
| |