Gene SeHA_C0527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0527 
SymbolthiI 
ID6488070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp525003 
End bp526451 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content53% 
IMG OID642740793 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_002044460 
Protein GI194448678 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACTATCA AAAGCCAATC TGTGCGTTTG 
CGCTTTATAA AAATTTTAAC CGGGAACATC CGTAACGTTT TAAAGCACTA CGATGAGACC
CTCGCGGTTG TCCGTCACTG GGATAACATT GAAGTTCGCG CCAAAGATGA AAACCAGCGT
CTGGCGATTC GCGACGCGCT GACCCGCATT CCGGGGATCC ACCATATTCT TGAAGTCGAA
GATGTGCCGT TCACCGATAT GCACGACATT TTCGAGAAAG CGTTGGCGCA GTATCGCGAG
CAGCTTGAAG GCAAAACCTT CTGCGTGCGC GTAAAACGTC GCGGTAAGCA TGAGTTTAGC
TCCATTGAGG TGGAGCGCTA TGTCGGCGGC GGATTAAATC AGCATATTGA ATCGGCGCGC
GTGAAGCTCA CTAACCCGGA TGTGACGGTG CATCTGGAAG TGGAAGATGA TCGCCTGCTG
CTGATCAAAG GGCGTTATGA AGGTATTGGC GGTTTCCCGA TTGGCACCCA GGAAGATGTG
CTGTCGCTGA TCTCCGGCGG TTTTGACTCC GGCGTCTCCA GCTATATGCT GATGCGTCGC
GGCTGTCGCG TACACTACTG CTTCTTTAAC CTTGGCGGCG CGGCGCATGA AATCGGCGTT
CGCCAGGTGG CGCATTACCT GTGGAACCGC TTTGGCAGCT CCCATCGCGT GCGTTTTGTG
GCGATTAACT TTGAACCAGT GGTCGGCGAG ATTCTGGAGA AAGTCGACGA CGGCCAGATG
GGCGTGGTGC TCAAACGTAT GATGGTACGC GCGGCGTCGA AAGTGGCGGA ACGTTACGGC
GTACAGGCGC TGGTGACCGG CGAAGCGCTG GGCCAGGTGT CCAGCCAGAC GCTAACCAAC
TTGCGCTTGA TCGATAACGT GTCTGACACG CTGATCCTGC GCCCGCTGAT CTCTTACGAT
AAAGAGCACA TTATCAACCT GGCGCGCCAG ATTGGTACGG AAGATTTTGC CCGTACGATG
CCGGAATACT GTGGCGTGAT TTCAAAAAGT CCGACGGTGA AAGCCATTAA AGCGAAAATT
GAAGCCGAAG AAGAAAACTT CGACTTCAGT ATTCTCGATA AGGTGGTAGA AGAAGCGAAC
AACGTCGATA TTCGTGAAAT CGCCCAGCAG ACCCAGCAGG AGGTGGTGGA AGTAGAAACC
GTGAGCGGTT TCGGCCCGAA CGACGTGATT CTGGATATCC GTTCTGTCGA TGAGCAGGAT
GACAAGCCGC TGAAAGTGGA AGGCGTCGAC GTCGTTTCGC TGCCTTTCTA CAAGCTGAGC
ACTAAATTTG GCGACCTCGA TCAGAGCAAA ACCTGGCTGC TATGGTGCGA ACGCGGCGTA
ATGAGTCGCC TGCAGGCGCT CTATCTGCGC GAGCAGGGGT TTGCCAATGT GAAGGTGTAT
CGCCCGTAA
 
Protein sequence
MKFIIKLFPE ITIKSQSVRL RFIKILTGNI RNVLKHYDET LAVVRHWDNI EVRAKDENQR 
LAIRDALTRI PGIHHILEVE DVPFTDMHDI FEKALAQYRE QLEGKTFCVR VKRRGKHEFS
SIEVERYVGG GLNQHIESAR VKLTNPDVTV HLEVEDDRLL LIKGRYEGIG GFPIGTQEDV
LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FGSSHRVRFV
AINFEPVVGE ILEKVDDGQM GVVLKRMMVR AASKVAERYG VQALVTGEAL GQVSSQTLTN
LRLIDNVSDT LILRPLISYD KEHIINLARQ IGTEDFARTM PEYCGVISKS PTVKAIKAKI
EAEEENFDFS ILDKVVEEAN NVDIREIAQQ TQQEVVEVET VSGFGPNDVI LDIRSVDEQD
DKPLKVEGVD VVSLPFYKLS TKFGDLDQSK TWLLWCERGV MSRLQALYLR EQGFANVKVY
RP