Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0495 |
Symbol | thiI |
ID | 5595068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 505341 |
End bp | 506789 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640919678 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_001457263 |
Protein GI | 157159945 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 57 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACCATCA AAAGCCAATC TGTGCGCTTG CGCTTTATAA AAATCCTTAC CGGGAACATT CGTAACGTTT TAAAGCACTA TGATGAGACG CTCGCTGTCG TCCGCCACTG GGATAACATC GAAGTTCGCG CAAAAGATGA AAACCAGCGT CTGGCTATTC GCGACGCCCT GACCCGTATT CCGGGTATCC ACCATATTCT CGAAGTCGAA GACGTGCCGT TTACCGACAT GCACGATATT TTTGAGAAAG CGTTGGTTCA GTATCGCGAT CAGCTGGAAG GCAAAACCTT CTGCGTACGC GTGAAGCGTC GTGGCAAACA TGATTTTAGC TCGATTGATG TGGAACGTTA CGTCGGCGGC GGTTTAAATC AGCATATTGA GTCCGCGCGC GTGAAGCTGA CCAATCCGGA AGTGACTGTC CATCTGGAAG TGGAAGACGA TCGTCTCCTG CTGATTAAAG GCCGCTACGA AGGTATTGGC GGTTTCCCGA TCGGCACCCA GGAAGACGTG CTGTCGCTCA TTTCCGGTGG TTTCGACTCC GGTGTTTCCA GTTATATGTT GATGCGTCGC GGCTGCCGCG TGCATTACTG TTTCTTTAAC CTTGGCGGCG CGGCGCATGA AATTGGCGTG CGTCAGGTGG CGCATTATCT GTGGAACCGT TTTGGCAGCT CCCACCGCGT GCGTTTTGTC GCTATTAATT TCGAACCGGT CGTTGGGGAA ATTCTCGAGA AAATCGACGA CGGTCAGATG GGCGTTATCC TCAAACGTAT GATGGTGCGT GCCGCGTCTA AAGTGGCTGA ACGTTACGGC GTACAGGCGC TGGTCACCGG CGAAGCGCTC GGCCAGGTGT CCAGTCAGAC GCTGACCAAC CTGCGCCTGA TTGATAACGT CTCTGACACG CTGATCCTGC GTCCGCTGAT TTCTTACGAC AAAGAGCACA TCATCAACCT GGCCCGCCAG ATTGGCACTG AAGACTTTGC TCGCACGATG CCAGAATATT GTGGCGTGAT CTCCAAAAGC CCGACGGTGA AAGCGGTTAA ATCGAAGATT GAAGCGGAAG AAGAGAAGTT CGACTTCAGT ATTCTCGATA AAGTGGTTGA GGAAGCGAAT AACGTTGATA TCCGCGAAAT CGCCCAGCAG ACCGAGCAGG AAGTGGTGGA AGTGGAAACC GTCAATGGCT TTGGCCCGAA CGACGTGATC CTCGATATCC GTTCTGTCGA TGAGCAGGAA GATAAGCCGC TGAAGGTTGA AGGTATCGAC GTGGTTTCTC TGCCGTTCTA TAAACTGAGC ACCAAATTTG GCGATCTCGA CCAGAACAGA ACCTGGCTAC TGTGGTGTGA GCGCGGGGTG ATGAGCCGTC TGCAGGCGCT CTATCTGCGC GAGCAGGGCT TTAACAATGT GAAGGTGTAT CGCCCATAA
|
Protein sequence | MKFIIKLFPE ITIKSQSVRL RFIKILTGNI RNVLKHYDET LAVVRHWDNI EVRAKDENQR LAIRDALTRI PGIHHILEVE DVPFTDMHDI FEKALVQYRD QLEGKTFCVR VKRRGKHDFS SIDVERYVGG GLNQHIESAR VKLTNPEVTV HLEVEDDRLL LIKGRYEGIG GFPIGTQEDV LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FGSSHRVRFV AINFEPVVGE ILEKIDDGQM GVILKRMMVR AASKVAERYG VQALVTGEAL GQVSSQTLTN LRLIDNVSDT LILRPLISYD KEHIINLARQ IGTEDFARTM PEYCGVISKS PTVKAVKSKI EAEEEKFDFS ILDKVVEEAN NVDIREIAQQ TEQEVVEVET VNGFGPNDVI LDIRSVDEQE DKPLKVEGID VVSLPFYKLS TKFGDLDQNR TWLLWCERGV MSRLQALYLR EQGFNNVKVY RP
|
| |