Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A0485 |
Symbol | thiI |
ID | 6519189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 489407 |
End bp | 490855 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642745634 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_002113458 |
Protein GI | 194735073 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACTATCA AAAGCCAATC TGTGCGTTTG CGCTTTATAA AAATTTTAAC CGGGAACATC CGTAACGTTT TAAAGCACTA CGATGAGACC CTCGCGGTTG TCCGTCACTG GGATAACATT GAAGTTCGCG CCAAAGATGA AAACCAGCGT CTGGCGATTC GCGACGCGCT GACCCGCATT CCGGGGATTC ACCATATTCT TGAAGTCGAA GATGTGCCGT TCACCGATAT GCACGACATT TTCGAGAAAG CGTTGGCGCA GTATCGCGAG CAGCTTGAAG GCAAAACCTT CTGCGTGCGC GTAAAACGTC GCGGTAAGCA TGAGTTTAGC TCCATTGAGG TGGAACGCTA TGTCGGCGGC GGATTAAATC AGCATATTGA ATCGGCGCGC GTGAAGCTCA CTAACCCGGA TGTGACGGTG CATCTGGAAG TGGAAGATGA TCGCCTGCTG CTGATCAAAG GGCGTTATGA AGGTATTGGC GGTTTCCCGA TTGGCACCCA GGAAGATGTG CTGTCGCTGA TCTCCGGCGG TTTTGACTCC GGCGTCTCCA GCTATATGCT GATGCGTCGC GGCTGCCGCG TACACTACTG CTTCTTTAAC CTCGGCGGCG CGGCGCATGA AATCGGCGTT CGCCAGGTGG CGCATTACCT GTGGAACCGC TTTGGCAGCT CCCATCGCGT GCGTTTTGTG GCGATTAACT TCGAACCGGT GGTCGGCGAG ATTCTGGAGA AAGTTGACGA CGGCCAGATG GGCGTGGTGC TCAAACGTAT GATGGTACGC GCGGCGTCGA AAGTGGCGGA ACGTTACGGC GTACAGGCGC TGGTGACCGG CGAAGCGCTG GGCCAGGTGT CCAGCCAGAC GCTAACCAAC TTGCGCTTGA TCGATAACGT GTCTGACACG CTGATCCTGC GCCCGCTGAT CTCTTACGAT AAAGAGCACA TTATCAACCT GGCGCGCCAG ATTGGTACGG AAGATTTTGC CCGTACGATG CCGGAATACT GTGGCGTGAT TTCAAAAAGT CCGACGGTGA AAGCCATTAA AGCGAAAATT GAAGCCGAAG AAGAAAATTT CGACTTCAGT ATTCTCGATA AGGTGGTAGA AGAAGCGAAC AACGTCGATA TTCGTGAAAT CGCCCAGCAG ACCCAGCAGG AGGTGGTGGA AGTAGAAACC GTGAGCGGTT TTGGCGCCAA CGATGTGATT CTGGATATCC GTTCTGTCGA TGAGCAGGAT GACAAGCCGC TGAAAGTGGA AGGCGTCGAC GTCGTTTCGC TGCCTTTCTA CAAGCTGAGC ACTAAATTTG GCGACCTCGA TCAGAGCAAA ACCTGGCTGC TATGGTGCGA ACGCGGCGTA ATGAGTCGCC TGCAGGCGCT CTATCTGCGC GAGCAGGGGT TTGCCAATGT GAAGGTGTAT CGTCCGTAA
|
Protein sequence | MKFIIKLFPE ITIKSQSVRL RFIKILTGNI RNVLKHYDET LAVVRHWDNI EVRAKDENQR LAIRDALTRI PGIHHILEVE DVPFTDMHDI FEKALAQYRE QLEGKTFCVR VKRRGKHEFS SIEVERYVGG GLNQHIESAR VKLTNPDVTV HLEVEDDRLL LIKGRYEGIG GFPIGTQEDV LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FGSSHRVRFV AINFEPVVGE ILEKVDDGQM GVVLKRMMVR AASKVAERYG VQALVTGEAL GQVSSQTLTN LRLIDNVSDT LILRPLISYD KEHIINLARQ IGTEDFARTM PEYCGVISKS PTVKAIKAKI EAEEENFDFS ILDKVVEEAN NVDIREIAQQ TQQEVVEVET VSGFGANDVI LDIRSVDEQD DKPLKVEGVD VVSLPFYKLS TKFGDLDQSK TWLLWCERGV MSRLQALYLR EQGFANVKVY RP
|
| |