Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0345 |
Symbol | thiI |
ID | 6270057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 335598 |
End bp | 337046 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641724583 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_001879133 |
Protein GI | 187731644 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACCATCA AAAGCCAATC TGTGCGCTTG CGCTTTATAA AAATCCTTAC CGGGAACATT CGTAACGTTT TAAAGCACTA TGATGAGACG CTCGCTGTCG TCCGCCACTG GGATAACATC GAAGTTCGCG CAAAAGATGA AAACCAGCGT CTGGCTATTC GCGACGCCCT GACCCGTATT CCGGGTATCC ACCATATTCT CGAAGTCGAA GACGTGCCGT TTACCGACAT GCACGATATT TTTGAGAAAG CGTTGGTTCA GTATCGCGAT CAGCTGGAAG GCAAAACCTT CTGCGTACGC GTGAAGCGCC GTGGCAAACA TGATTTTAGC TCGATTGATG TGGAACGTTA CGTCGGCGGC GGTTTAAATC AGCATATTGA GTCCGCGCGC GTGAAGCTGA CCAATCCGGA AGTGACTGTC CATCTGGAAG TGGAAGACGA TCGTCTCCTG CTGATTAAAG GCCGCTACGA AGGTATTGGC GGTTTCCCGA TCGGCACCCA GGAAGATGTG CTGTCGCTCA TTTCCGGTGG TTTCGACTCC GGTGTTTCCA GTTATATGTT GATGCGTCGC GGCTGCCGCG TGCATTACTG TTTCTTTAAC CTTGGCGGCG CGGCGCATGA AATTGGCGTG CGTCAGGTGG CGCATTATCT GTGGAACCGT TTTGGCAGCT CCCACCGCGT GCGTTTTGTC GCTATTAATT TCGAACCGGT CGTCGGGGAA ATTCTCGAGA AAATCGACGA CGGTCAGATG GGCGTTATCC TCAAACGTAT GATGGTGCGT GCCGCGTCTA AAGTGGCTGA ACGTTACGGC GTACAGGCGC TGGTCACCGG CGAAGCGCTC GGCCAGGTGT CCAGTCAGAC GCTGACCAAC CTGCGCCTGA TTGATAACGT CTCCGATACG CTGATCCTGC GTCCGCTGAT TTCTTACGAC AAAGAGCACA TCATCAACCT GGCTCGCCAG ATTGGCACCG AAGACTTTGC TCGCACGATG CCGGAATATT GCGGTGTTAT CTCCAAAAGC CCGACGGTGA AAGCGGTTAA GTCGAAGATT GAAGCGGAAG AAGAGAAATT CGACTTCAGC ATTCTCGATA AAGTGGTTGA AGAAGCGAAT AACGTTGATA TCCGCGAAAT CGCCCAGCAG ACCGAGCAGG AAGTGGTGGA AGTGGAAACC GTCAATGGCT TCGGCCCGAA CGACGTGATC CTCGATATCC GTTCTATCGA TGAACAGGAA GATAAGCCAC TGAAAGTCGA AGGGATTGAT GTAGTTTCTC TGCCGTTCTA TAAACTGAGC ACCAAGTTTG GCGATCTCGA CCAGAACAGA ACCTGGCTAC TGTGGTGTGA GCGCGGGGTG ATGAGCCGTC TGCAGGCGCT CTATCTGCGC GAGCAGGGCT TTAACAATGT GAAGGTATAT CGCCTGTAA
|
Protein sequence | MKFIIKLFPE ITIKSQSVRL RFIKILTGNI RNVLKHYDET LAVVRHWDNI EVRAKDENQR LAIRDALTRI PGIHHILEVE DVPFTDMHDI FEKALVQYRD QLEGKTFCVR VKRRGKHDFS SIDVERYVGG GLNQHIESAR VKLTNPEVTV HLEVEDDRLL LIKGRYEGIG GFPIGTQEDV LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FGSSHRVRFV AINFEPVVGE ILEKIDDGQM GVILKRMMVR AASKVAERYG VQALVTGEAL GQVSSQTLTN LRLIDNVSDT LILRPLISYD KEHIINLARQ IGTEDFARTM PEYCGVISKS PTVKAVKSKI EAEEEKFDFS ILDKVVEEAN NVDIREIAQQ TEQEVVEVET VNGFGPNDVI LDIRSIDEQE DKPLKVEGID VVSLPFYKLS TKFGDLDQNR TWLLWCERGV MSRLQALYLR EQGFNNVKVY RL
|
| |