Gene SeSA_A0485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A0485 
SymbolthiI 
ID6519189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp489407 
End bp490855 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content53% 
IMG OID642745634 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_002113458 
Protein GI194735073 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACTATCA AAAGCCAATC TGTGCGTTTG 
CGCTTTATAA AAATTTTAAC CGGGAACATC CGTAACGTTT TAAAGCACTA CGATGAGACC
CTCGCGGTTG TCCGTCACTG GGATAACATT GAAGTTCGCG CCAAAGATGA AAACCAGCGT
CTGGCGATTC GCGACGCGCT GACCCGCATT CCGGGGATTC ACCATATTCT TGAAGTCGAA
GATGTGCCGT TCACCGATAT GCACGACATT TTCGAGAAAG CGTTGGCGCA GTATCGCGAG
CAGCTTGAAG GCAAAACCTT CTGCGTGCGC GTAAAACGTC GCGGTAAGCA TGAGTTTAGC
TCCATTGAGG TGGAACGCTA TGTCGGCGGC GGATTAAATC AGCATATTGA ATCGGCGCGC
GTGAAGCTCA CTAACCCGGA TGTGACGGTG CATCTGGAAG TGGAAGATGA TCGCCTGCTG
CTGATCAAAG GGCGTTATGA AGGTATTGGC GGTTTCCCGA TTGGCACCCA GGAAGATGTG
CTGTCGCTGA TCTCCGGCGG TTTTGACTCC GGCGTCTCCA GCTATATGCT GATGCGTCGC
GGCTGCCGCG TACACTACTG CTTCTTTAAC CTCGGCGGCG CGGCGCATGA AATCGGCGTT
CGCCAGGTGG CGCATTACCT GTGGAACCGC TTTGGCAGCT CCCATCGCGT GCGTTTTGTG
GCGATTAACT TCGAACCGGT GGTCGGCGAG ATTCTGGAGA AAGTTGACGA CGGCCAGATG
GGCGTGGTGC TCAAACGTAT GATGGTACGC GCGGCGTCGA AAGTGGCGGA ACGTTACGGC
GTACAGGCGC TGGTGACCGG CGAAGCGCTG GGCCAGGTGT CCAGCCAGAC GCTAACCAAC
TTGCGCTTGA TCGATAACGT GTCTGACACG CTGATCCTGC GCCCGCTGAT CTCTTACGAT
AAAGAGCACA TTATCAACCT GGCGCGCCAG ATTGGTACGG AAGATTTTGC CCGTACGATG
CCGGAATACT GTGGCGTGAT TTCAAAAAGT CCGACGGTGA AAGCCATTAA AGCGAAAATT
GAAGCCGAAG AAGAAAATTT CGACTTCAGT ATTCTCGATA AGGTGGTAGA AGAAGCGAAC
AACGTCGATA TTCGTGAAAT CGCCCAGCAG ACCCAGCAGG AGGTGGTGGA AGTAGAAACC
GTGAGCGGTT TTGGCGCCAA CGATGTGATT CTGGATATCC GTTCTGTCGA TGAGCAGGAT
GACAAGCCGC TGAAAGTGGA AGGCGTCGAC GTCGTTTCGC TGCCTTTCTA CAAGCTGAGC
ACTAAATTTG GCGACCTCGA TCAGAGCAAA ACCTGGCTGC TATGGTGCGA ACGCGGCGTA
ATGAGTCGCC TGCAGGCGCT CTATCTGCGC GAGCAGGGGT TTGCCAATGT GAAGGTGTAT
CGTCCGTAA
 
Protein sequence
MKFIIKLFPE ITIKSQSVRL RFIKILTGNI RNVLKHYDET LAVVRHWDNI EVRAKDENQR 
LAIRDALTRI PGIHHILEVE DVPFTDMHDI FEKALAQYRE QLEGKTFCVR VKRRGKHEFS
SIEVERYVGG GLNQHIESAR VKLTNPDVTV HLEVEDDRLL LIKGRYEGIG GFPIGTQEDV
LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FGSSHRVRFV
AINFEPVVGE ILEKVDDGQM GVVLKRMMVR AASKVAERYG VQALVTGEAL GQVSSQTLTN
LRLIDNVSDT LILRPLISYD KEHIINLARQ IGTEDFARTM PEYCGVISKS PTVKAIKAKI
EAEEENFDFS ILDKVVEEAN NVDIREIAQQ TQQEVVEVET VSGFGANDVI LDIRSVDEQD
DKPLKVEGVD VVSLPFYKLS TKFGDLDQSK TWLLWCERGV MSRLQALYLR EQGFANVKVY
RP