Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0507 |
Symbol | thiI |
ID | 6967468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 511188 |
End bp | 512636 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643384555 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_002269069 |
Protein GI | 209397021 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.667098 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACCATCA AAAGCCAATC TGTGCGCTTG CGCTTTATAA AAATCCTTAC CGGGAACATT CGTAACGTTT TAAAGCACTA TGATGAGACG CTCGCTGTCG TCCGCCACTG GGATAACATC GAAGTTCGCG CGAAAGATGA AAACCAGCGT CTGACTATTC GCGACGCCCT GACCCGTATT CCTGGTATCC ACCATATTCT CGAAGTCGAA GACGTGCCGT TTACCGACAT GCACGATATT TTCGAGAAAG CGTTGGTTCA GTATCGCGAT CAGCTGGATG GCAAAACCTT CTGCGTACGC GTGAAGCGCC GTGGCAAACA TGATTTTAGC TCGATTGATG TGGAACGTTA CGTCGGCGGC GGTTTAAATC AGCATATTGA ATCCGCGCGC GTGAAGCTGA CCAATCCGGA TGTGACTGTC CATCTGGAAG TGGAAGACGA TCGTCTCCTG CTGATTAAAG GCCGCTACGA AGGTATTGGC GGTTTCCCGA TCGGCACCCA GGAAGATGTG CTGTCGCTCA TTTCCGGTGG TTTCGACTCC GGTGTTTCCA GTTATATGTT GATGCGTCGC GGCTGCCGCG TGCATTACTG CTTCTTTAAC CTCGGCGGCG CAGCGCATGA AATTGGCGTG CGTCAGGTGG CACATTATCT GTGGAACCGC TTTGGCAGCT CCCACCGCGT GCGTTTTGTC GCTATTAATT TCGAGCCGGT AGTCGGGGAA ATTCTCGAGA AAATCGACGA CGGTCAGATG GGCGTTATCC TCAAACGTAT GATGGTGCGT GCCGCGTCTA AAGTGGCTGA ACGTTACGGC GTACAGGCGC TGGTCACCGG CGAAGCGCTC GGCCAGGTGT CCAGTCAGAC GCTGACCAAC CTGCGCCTGA TTGATAACGT CTCTGATACG CTGATCCTGC GTCCGCTGAT CTCTTACGAC AAAGAGCACA TCATCAACCT GGCTCGCCAG ATTGGCACCG AAGACTTTGC CCGTACTATG CCGGAATACT GCGGTGTGAT CTCCAAAAGC CCGACGGTGA AAGCGGTTAA GTCGAAGATT GAAGCGGAAG AAGAGAAATT CGACTTCAGC ATTCTCGATA AAGTGGTTGA AGAAGCGAAT AACGTTGATA TCCGCGAAAT CGCCCAGCAG ACCGGGCAGG AAGTGGTGGA AGTGGAAACC GTCAATGACT TCGGCCCGAA CGACGTGATT CTCGATATCC GTTCTGTCGA TGAACAGGAA GATAAGCCAC TGAAAGTCGA AGGGATTGAT GTGGTTTCAC TGCCGTTCTA CAAGTTGAGC ACCAAATTTG GCGATCTCGA CCAGAACAGA ACCTGGCTAC TGTGGTGTGA GCGCGGGGTA ATGAGCCGCC TGCAGGCGCT CTATCTACGC GAACAGGGCT TTAAGAATGT GAAGGTGTAT CGTCCGTAA
|
Protein sequence | MKFIIKLFPE ITIKSQSVRL RFIKILTGNI RNVLKHYDET LAVVRHWDNI EVRAKDENQR LTIRDALTRI PGIHHILEVE DVPFTDMHDI FEKALVQYRD QLDGKTFCVR VKRRGKHDFS SIDVERYVGG GLNQHIESAR VKLTNPDVTV HLEVEDDRLL LIKGRYEGIG GFPIGTQEDV LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FGSSHRVRFV AINFEPVVGE ILEKIDDGQM GVILKRMMVR AASKVAERYG VQALVTGEAL GQVSSQTLTN LRLIDNVSDT LILRPLISYD KEHIINLARQ IGTEDFARTM PEYCGVISKS PTVKAVKSKI EAEEEKFDFS ILDKVVEEAN NVDIREIAQQ TGQEVVEVET VNDFGPNDVI LDIRSVDEQE DKPLKVEGID VVSLPFYKLS TKFGDLDQNR TWLLWCERGV MSRLQALYLR EQGFKNVKVY RP
|
| |