Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C0527 |
Symbol | thiI |
ID | 6488070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 525003 |
End bp | 526451 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642740793 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_002044460 |
Protein GI | 194448678 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACTATCA AAAGCCAATC TGTGCGTTTG CGCTTTATAA AAATTTTAAC CGGGAACATC CGTAACGTTT TAAAGCACTA CGATGAGACC CTCGCGGTTG TCCGTCACTG GGATAACATT GAAGTTCGCG CCAAAGATGA AAACCAGCGT CTGGCGATTC GCGACGCGCT GACCCGCATT CCGGGGATCC ACCATATTCT TGAAGTCGAA GATGTGCCGT TCACCGATAT GCACGACATT TTCGAGAAAG CGTTGGCGCA GTATCGCGAG CAGCTTGAAG GCAAAACCTT CTGCGTGCGC GTAAAACGTC GCGGTAAGCA TGAGTTTAGC TCCATTGAGG TGGAGCGCTA TGTCGGCGGC GGATTAAATC AGCATATTGA ATCGGCGCGC GTGAAGCTCA CTAACCCGGA TGTGACGGTG CATCTGGAAG TGGAAGATGA TCGCCTGCTG CTGATCAAAG GGCGTTATGA AGGTATTGGC GGTTTCCCGA TTGGCACCCA GGAAGATGTG CTGTCGCTGA TCTCCGGCGG TTTTGACTCC GGCGTCTCCA GCTATATGCT GATGCGTCGC GGCTGTCGCG TACACTACTG CTTCTTTAAC CTTGGCGGCG CGGCGCATGA AATCGGCGTT CGCCAGGTGG CGCATTACCT GTGGAACCGC TTTGGCAGCT CCCATCGCGT GCGTTTTGTG GCGATTAACT TTGAACCAGT GGTCGGCGAG ATTCTGGAGA AAGTCGACGA CGGCCAGATG GGCGTGGTGC TCAAACGTAT GATGGTACGC GCGGCGTCGA AAGTGGCGGA ACGTTACGGC GTACAGGCGC TGGTGACCGG CGAAGCGCTG GGCCAGGTGT CCAGCCAGAC GCTAACCAAC TTGCGCTTGA TCGATAACGT GTCTGACACG CTGATCCTGC GCCCGCTGAT CTCTTACGAT AAAGAGCACA TTATCAACCT GGCGCGCCAG ATTGGTACGG AAGATTTTGC CCGTACGATG CCGGAATACT GTGGCGTGAT TTCAAAAAGT CCGACGGTGA AAGCCATTAA AGCGAAAATT GAAGCCGAAG AAGAAAACTT CGACTTCAGT ATTCTCGATA AGGTGGTAGA AGAAGCGAAC AACGTCGATA TTCGTGAAAT CGCCCAGCAG ACCCAGCAGG AGGTGGTGGA AGTAGAAACC GTGAGCGGTT TCGGCCCGAA CGACGTGATT CTGGATATCC GTTCTGTCGA TGAGCAGGAT GACAAGCCGC TGAAAGTGGA AGGCGTCGAC GTCGTTTCGC TGCCTTTCTA CAAGCTGAGC ACTAAATTTG GCGACCTCGA TCAGAGCAAA ACCTGGCTGC TATGGTGCGA ACGCGGCGTA ATGAGTCGCC TGCAGGCGCT CTATCTGCGC GAGCAGGGGT TTGCCAATGT GAAGGTGTAT CGCCCGTAA
|
Protein sequence | MKFIIKLFPE ITIKSQSVRL RFIKILTGNI RNVLKHYDET LAVVRHWDNI EVRAKDENQR LAIRDALTRI PGIHHILEVE DVPFTDMHDI FEKALAQYRE QLEGKTFCVR VKRRGKHEFS SIEVERYVGG GLNQHIESAR VKLTNPDVTV HLEVEDDRLL LIKGRYEGIG GFPIGTQEDV LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FGSSHRVRFV AINFEPVVGE ILEKVDDGQM GVVLKRMMVR AASKVAERYG VQALVTGEAL GQVSSQTLTN LRLIDNVSDT LILRPLISYD KEHIINLARQ IGTEDFARTM PEYCGVISKS PTVKAIKAKI EAEEENFDFS ILDKVVEEAN NVDIREIAQQ TQQEVVEVET VSGFGPNDVI LDIRSVDEQD DKPLKVEGVD VVSLPFYKLS TKFGDLDQSK TWLLWCERGV MSRLQALYLR EQGFANVKVY RP
|
| |