Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3210 |
Symbol | |
ID | 6065677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3516650 |
End bp | 3518098 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641602625 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_001726159 |
Protein GI | 170021205 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000778782 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACCATCA AAAGCCAATC TGTGCGCTTG CGCTTTATAA AAATCCTTAC CGGGAACATT CGTAACGTTT TAAAGCACTA TGATGAGACG CTCGCTGTCG TCCGCCACTG GGATAACATC GAAGTTCGCG CAAAAGATGA AAACCAGCGT CTGGCTATTC GCGACGCCCT GACCCGTATT CCGGGTATCC ACCATATTCT CGAAGTCGAA GACGTGCCGT TTACCGACAT GCACGATATT TTCGAGAAAG CGTTGGTTCA GTATCGCGAT CAGCTGGAAG GCAAAACCTT CTGCGTACGC GTGAAGCGCC GTGGCAAACA TGATTTTAGC TCGATTGATG TGGAACGTTA CGTCGGCGGC GGTTTAAATC AGCATATTGA GTCCGCGCGC GTGAAGCTGA CCAATCCGGA AGTGACTGTC CATCTGGAAG TGGAAGACGA TCGTCTCCTG CTGATTAAAG GCCGCTACGA AGGTATTGGC GGTTTCCCGA TCGGCACCCA GGAAGATGTG CTGTCGCTCA TTTCCGGTGG TTTCGACTCC GGTGTTTCCA GTTATATGTT GATGCGTCGC GGCTGCCGCG TGCATTACTG CTTCTTTAAC CTCGGCGGCG CGGCGCATGA AATTGGCGTG CGTCAGGTGG CGCATTATCT GTGGAATCGT TTTGGCAGCT CCCACCGCGT GCGTTTTGTC GCTATTAATT TCGAACCGGT CGTCGGGGAA ATTCTCGAGA AAATCGACGA CGGTCAGATG GGCGTTATCC TCAAACGTAT GATGGTGCGT GCCGCGTCTA AAGTGGCTGA ACGTTACGGC GTACAGGCGC TGGTCACCGG CGAAGCGCTC GGCCAGGTGT CCAGTCAGAC GCTGACCAAC CTGCGCCTGA TTGATAACGT CTCTGATACG CTGATCCTGC GTCCGCTGAT TTCTTACGAC AAAGAGCACA TCATCAACCT GGCCCGCCAG ATTGGCACTG AAGACTTTGC TCGCACGATG CCGGAATATT GTGGCGTGAT CTCCAAAAGC CCGACGGTGA AAGCGGTTAA ATCGAAGATT GAAGCGGAAG AAGAGAAGTT CGACTTCAGC ATTCTCGATA AAGTGGTTGA GGAGGCGAAT AACGTTGATA TCCGCGAAAT CGCCCAGCAG ACCGAGCAGG AAGTGGTGGA AGTGGAAACC GTCAATGGCT TCGGCCCGAA CGACGTGATC CTCGATATCC GTTCTATCGA TGAACAGGAA GATAAGCCAC TGAAAGTCGA AGGGATTGAT GTGGTTTCTC TGCCGTTCTA TAAACTGAGC ACCAAATTTG GCGATCTCGA CCAGAACAAA ACTTGGCTGC TGTGGTGTGA GCGCGGGGTG ATGAGCCGTC TGCAGGCGCT CTATCTGCGC GAGCAGGGCT TTAACAATGT GAAGGTATAT CGCCCGTAA
|
Protein sequence | MKFIIKLFPE ITIKSQSVRL RFIKILTGNI RNVLKHYDET LAVVRHWDNI EVRAKDENQR LAIRDALTRI PGIHHILEVE DVPFTDMHDI FEKALVQYRD QLEGKTFCVR VKRRGKHDFS SIDVERYVGG GLNQHIESAR VKLTNPEVTV HLEVEDDRLL LIKGRYEGIG GFPIGTQEDV LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FGSSHRVRFV AINFEPVVGE ILEKIDDGQM GVILKRMMVR AASKVAERYG VQALVTGEAL GQVSSQTLTN LRLIDNVSDT LILRPLISYD KEHIINLARQ IGTEDFARTM PEYCGVISKS PTVKAVKSKI EAEEEKFDFS ILDKVVEEAN NVDIREIAQQ TEQEVVEVET VNGFGPNDVI LDIRSIDEQE DKPLKVEGID VVSLPFYKLS TKFGDLDQNK TWLLWCERGV MSRLQALYLR EQGFNNVKVY RP
|
| |