Gene SNSL254_A0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0472 
SymbolthiI 
ID6484159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp482275 
End bp483723 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content53% 
IMG OID642735893 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_002039667 
Protein GI194446549 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACTATCA AAAGCCAATC TGTGCGTTTG 
CGCTTTATAA AAATTTTAAC CGGGAACATC CGTAACGTTT TAAAGCACTA CGATGAGACC
CTCGCGGTTG TCCGTCACTG GGATAACATT GAAGTTCGCG CCAAAGATGA AAACCAGCGT
CTGGCGATTC GCGACGCGCT GACCCGCATT CCGGGGATCC ACCATATTCT TGAAGTCGAA
GATGTGCCGT TCACCGATAT GCACGACATT TTCGAGAAAG CGTTGGCGCA GTATCGCGAG
CAGCTTGAAG GCAAAACCTT CTGCGTGCGC GTAAAACGTC GCGGTAAGCA TGAGTTTAGC
TCCATTGAGG TGGAGCGCTA TGTCGGCGGC GGATTAAATC AGCATATTGA ATCGGCGCGC
GTGAAGCTCA CTAACCCGGA TGTGACGGTG CATCTGGAAG TGGAAGATGA TCGCCTGCTG
CTGATCAAAG GGCGTTATGA AGGTATTGGC GGTTTCCCGA TTGGCACCCA GGAAGATGTG
CTGTCGCTGA TCTCCGGCGG TTTTGACTCC GGCGTCTCCA GCTATATGCT GATGCGTCGC
GGCTGTCGCG TACACTACTG CTTCTTTAAC CTTGGCGGCG CGGCGCATGA AATCGGCGTT
CGCCAGGTGG CGCATTACCT GTGGAACCGC TTTGGCAGCT CCCATCGCGT GCGTTTTGTG
GCGATTAACT TTGAACCAGT GGTCGGCGAG ATTCTGGAGA AAGTCGACGA CGGCCAGATG
GGCGTGGTGC TCAAACGTAT GATGGTACGC GCGGCGTCGA AAGTGGCGGA ACGTTACGGC
GTACAGGCGC TGGTGACCGG CGAAGCGCTG GGCCAGGTGT CCAGCCAGAC GCTAACCAAC
TTGCGCTTGA TCGATAACGT GTCTGACACG CTGATCCTGC GCCCGCTGAT CTCTTACGAT
AAAGAGCACA TTATCAACCT GGCGCGCCAG ATTGGTACGG AAGATTTTGC CCGTACGATG
CCGGAATACT GTGGCGTGAT TTCAAAAAGT CCGACGGTGA AAGCCATTAA AGCGAAAATT
GAAGCCGAAG AAGAAAACTT CGACTTCAGT ATTCTCGATA AGGTGGTAGA AGAAGCGAAC
AACGTCGATA TTCGTGAAAT CGCCCAGCAG ACCCAGCAGG AGGTGGTGGA AGTAGAAACC
GTGAGCGGTT TCGGCCCGAA CGACGTGATT CTGGATATCC GTTCTGTCGA TGAGCAGGAT
GACAAGCCGC TGAAAGTGGA AGGCGTCGAC GTCGTTTCGC TGCCTTTCTA CAAGCTGAGC
ACTAAATTTG GCGACCTCGA TCAGAGCAAA ACCTGGCTGC TATGGTGCGA ACGCGGCGTA
ATGAGTCGCC TGCAGGCGCT CTATCTGCGC GAGCAGGGGT TTGCCAATGT GAAGGTGTAT
CGCCCGTAA
 
Protein sequence
MKFIIKLFPE ITIKSQSVRL RFIKILTGNI RNVLKHYDET LAVVRHWDNI EVRAKDENQR 
LAIRDALTRI PGIHHILEVE DVPFTDMHDI FEKALAQYRE QLEGKTFCVR VKRRGKHEFS
SIEVERYVGG GLNQHIESAR VKLTNPDVTV HLEVEDDRLL LIKGRYEGIG GFPIGTQEDV
LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FGSSHRVRFV
AINFEPVVGE ILEKVDDGQM GVVLKRMMVR AASKVAERYG VQALVTGEAL GQVSSQTLTN
LRLIDNVSDT LILRPLISYD KEHIINLARQ IGTEDFARTM PEYCGVISKS PTVKAIKAKI
EAEEENFDFS ILDKVVEEAN NVDIREIAQQ TQQEVVEVET VSGFGPNDVI LDIRSVDEQD
DKPLKVEGVD VVSLPFYKLS TKFGDLDQSK TWLLWCERGV MSRLQALYLR EQGFANVKVY
RP