Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0459 |
Symbol | thiI |
ID | 6146108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 467580 |
End bp | 469028 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615353 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_001742560 |
Protein GI | 170679960 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.882891 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACCATCA AAAGCCAATC TGTGCGCTTG CGCTTTATAA AAATCCTTAC CGGGAACATT CGTAACGTTT TAAAGCACTA TGATGAGACG CTCGCTGTCG TCCGCCACTG GGATAACATC GAAGTTCGCG CAAAAGATGA AAACCAGCGT CTGGCCATTC GCGACGCCTT GACCCGTATT CCGGGTATCC ACCATATTCT CGAAGTCGAA GACGTGCCGT TTACCGACAT GCACGATATT TTCGAGAAAG CGTTGGTTCA GTATCGCGAT CAGTTGGAAG GCAAAACCTT CTGCGTGCGC GTGAAGCGCC GTGGCAAACA TGATTTTAGC TCGATTGATG TGGAGCGTTA CGTCGGCGGC GGTTTAAATC AGCATATTGA ATCCGCGCGC GTGAAGTTGA CCAATCCGGA TGTGACTGTC CATCTGGAAG TGGAAGACGA TCGTCTTCTG CTGATTAAAG GCCGCTACGA AGGTATTGGC GGTTTCCCGA TTGGTACCCA GGAAGATGTG CTGTCGCTCA TTTCTGGTGG TTTCGACTCC GGCGTTTCCA GTTATATGTT GATGCGTCGC GGATGTCGTG TGCATTACTG CTTCTTTAAC CTTGGCGGCG CGGCGCATGA AATTGGCGTG CGTCAGGTAG CGCATTATCT GTGGAACCGC TTTGGCAGCT CCCACCGCGT GCGTTTTGTC GCTATTAATT TCGAACCGGT CGTCGGGGAA ATTCTCGAGA AAATCGACGA CGGTCAGATG GGCGTTATCC TCAAACGTAT GATGGTGCGT GCCGCGTCCA AAGTGGCAGA ACGTTACGGC GTTCAGGCGC TGGTTACCGG TGAAGCGCTC GGCCAGGTGT CCAGCCAGAC GCTGACCAAC CTGCGCCTGA TTGATAACGT CTCCGATACG TTGATCCTGC GTCCGCTGAT CTCTTACGAC AAAGAGCACA TCATCAACCT GGCTCGCCAG ATTGGCACCG AAGACTTTGC CCGCACGATG CCGGAATATT GCGGCGTGAT TTCCAAAAGC CCGACGGTGA AAGCGGTTAA ATCGAAGATT GAAGCGGAAG AAGAGAAGTT TGACTTCAGC ATTCTCGATA AAGTGGTTGA GGAAGCGAAT AACGTTGATA TCCGCGAAAT CGCCCAGCAG ACCGAGCAGG AAGTGGTGGA AGTGGAAACC GTCAATGGCT TCGGCCCGAA CGACGTGATC CTCGATATCC GTTCTATCGA TGAACAGGAA GATAAGCCAC TGAAAGTCGA AGGGATTGAT GTGGTTTCTC TGCCGTTCTA TAAACTGAGC ACCAAATTTG GCGATCTCGA CCAGAGCAAA ACCTGGCTGT TGTGGTGTGA GCGCGGGGTG ATGAGCCGCC TGCAGGCGCT CTATCTGCGC GAGCAGGGCT TTAACAATGT GAAGGTGTAT CGCCCGTAA
|
Protein sequence | MKFIIKLFPE ITIKSQSVRL RFIKILTGNI RNVLKHYDET LAVVRHWDNI EVRAKDENQR LAIRDALTRI PGIHHILEVE DVPFTDMHDI FEKALVQYRD QLEGKTFCVR VKRRGKHDFS SIDVERYVGG GLNQHIESAR VKLTNPDVTV HLEVEDDRLL LIKGRYEGIG GFPIGTQEDV LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FGSSHRVRFV AINFEPVVGE ILEKIDDGQM GVILKRMMVR AASKVAERYG VQALVTGEAL GQVSSQTLTN LRLIDNVSDT LILRPLISYD KEHIINLARQ IGTEDFARTM PEYCGVISKS PTVKAVKSKI EAEEEKFDFS ILDKVVEEAN NVDIREIAQQ TEQEVVEVET VNGFGPNDVI LDIRSIDEQE DKPLKVEGID VVSLPFYKLS TKFGDLDQSK TWLLWCERGV MSRLQALYLR EQGFNNVKVY RP
|
| |