Gene YpsIP31758_3108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3108 
SymbolthiI 
ID5385126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3498956 
End bp3500407 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content47% 
IMG OID640866115 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_001402068 
Protein GI153948038 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTTA TCATTAAATT GTTCCCAGAA ATCACCATCA AGAGTCAATC TGTGCGATTG 
CGCTTCATTA AGATCCTGAC TACCAATATC CGCAACGTAC TGAAACACCT TGAGGACGAT
ACCCTCGCCA TTGTTCGTCA TTGGGATCAT ATCGAGCTTC GTACCAAAGA TGACAATCTT
GGCCCGGAGA TTTGTGATGC ACTGACCCGC ATACCGGGTA TTCACCATAT CCTTGAAGTA
GAAGATCGTA GCTACAGCGA TATGCACAAT ATTTTTGAAC AAACGCTGGA AGCCTACCGG
GAGACACTGG TTGGGAAGAC TTTCTGTGTT CGGGTAAAAC GTCGTGGCAA GCATGAATTC
TCTTCTGGTG ATGTTGAACG ATATGTCGGT GGGGGTCTGA ATCAACATAT TGAAAGTGCT
AAAGTAAACC TGACCCGCCC ACAGGTGACG GTTAATTTGG AAGTTGATCA AGACAAACTG
ATCTTGGTTA AGGCACGTCA TGAAGGGCTG GGGGGCTTCC CTATCGGCAC TCAGGAGGAT
GTTCTCTCCC TGATTTCAGG CGGTTTTGAT TCAGGTGTAT CGAGCTATAT GCTGATGCGC
CGTGGCTGTC GTGTCCATTA TTGTTTCTTT AATCTCGGTG GCTCAGCCCA TGAGATAGGT
GTTAAGCAGG TGGCACATTA TCTGTGGAAT CGCTTTGGCA GCTCCCATCG AGTACGGTTT
ATCGCCATTG ATTTTGAACC GGTTGTTGGC GAGATCCTGG AAAAAGTTGA AGATGGCCAG
ATGGGCGTAG TATTGAAGCG CATGATGGTG CGCGCAGCTT CTCAGGTTGC TGAACGCTAT
GGTGTTCAGG CTTTAGTTAC CGGTGAGGCG CTGGGGCAGG TATCCAGCCA GACACTGACT
AACCTGCGTT TAATTGATAA TGCCTCGGAT ACCCTAATTT TGCGCCCGCT TATCTCCCAC
GATAAAGAGC ACATCATTAA TTTGGCACGG CAAATTGGTA CGGAAGATTT TGCTAAAACC
ATGCCGGAAT ATTGTGGCGT TATTTCAAAA AGCCCAACGG TAAAAGCGGT CAAAGCTAAA
ATCGAAGAAG AGGAGTCTCA CTTTGATTTC TCTATTCTGG ATCGGGTTGT GAGCGAAGCT
AAAAATGTTG ATATCCGTGA AATCGCACAG CAAAGCCGTG AACAAGTCGT TGAAGTTGAA
ACCGTTGCTG AATTGGCCGA TACCGATGTG TTGCTAGATA TTCGTGCGCC TGATGAACAG
GAAGAGAAGC CACTGAAACT GGATCAAGTT GAAGTGCGAT CACTGCCGTT CTATAAGCTC
AGCTCGCAAT TTGCCGATCT GGACCAGAGT AAAACCTATT TGCTGTATTG TGATCGCGGT
GTGATGAGCC GTTTGCAGGC CCTTTATTTG CGTGAGCAGG GGTATACCAA CGTGAAAGTC
TATCGCCCTT AG
 
Protein sequence
MKFIIKLFPE ITIKSQSVRL RFIKILTTNI RNVLKHLEDD TLAIVRHWDH IELRTKDDNL 
GPEICDALTR IPGIHHILEV EDRSYSDMHN IFEQTLEAYR ETLVGKTFCV RVKRRGKHEF
SSGDVERYVG GGLNQHIESA KVNLTRPQVT VNLEVDQDKL ILVKARHEGL GGFPIGTQED
VLSLISGGFD SGVSSYMLMR RGCRVHYCFF NLGGSAHEIG VKQVAHYLWN RFGSSHRVRF
IAIDFEPVVG EILEKVEDGQ MGVVLKRMMV RAASQVAERY GVQALVTGEA LGQVSSQTLT
NLRLIDNASD TLILRPLISH DKEHIINLAR QIGTEDFAKT MPEYCGVISK SPTVKAVKAK
IEEEESHFDF SILDRVVSEA KNVDIREIAQ QSREQVVEVE TVAELADTDV LLDIRAPDEQ
EEKPLKLDQV EVRSLPFYKL SSQFADLDQS KTYLLYCDRG VMSRLQALYL REQGYTNVKV
YRP