Gene SeD_A0466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0466 
SymbolthiI 
ID6871042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp481192 
End bp482640 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content53% 
IMG OID642783692 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_002214379 
Protein GI198242480 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACTATCA AAAGCCAATC TGTGCGTTTG 
CGCTTTATAA AAATTTTAAC CGGGAACATC CGTAACGTTT TAAAGCACTA CGATGAGACC
CTCGCGGTTG TCCGTCACTG GGATAACATT GAAGTTCGCG CCAAAGATGA AAACCAGCGT
CTGGCGATTC GCGACGCGCT GACCCGCATC CCGGGGATTC ACCATATTCT TGAAGTCGAA
GATGTGCCGT TCACCGATAT GCACGACATT TTCGAGAAAG CGTTGGCGCA GTATCGCGAA
CAGCTTGAAG GCAAAACCTT CTGCGTGCGG GTAAAACGTC GCGGTAAGCA TGAGTTTAGC
TCCATTGAAG TGGAGCGCTA TGTTGGCGGC GGATTAAATC AGCATATTGA ATCGGCGCGC
GTGAAGCTCA CTAACCCGGA TGTGACGGTG CATCTGGAAG TGGAAGATGA TCGCCTGCTG
CTGATCAAAG GGCGTTATGA AGGTATTGGC GGTTTCCCGA TTGGCACCCA GGAAGATGTG
CTGTCGCTGA TCTCCGGCGG TTTTGACTCC GGCGTCTCCA GCTATATGCT GATGCGTCGC
GGCTGTCGCG TACACTACTG CTTCTTTAAC CTTGGCGGCG CGGCGCATGA AATCGGCGTT
CGCCAGGTGG CGCATTACCT GTGGAACCGC TTTGGCAGCT CCCATCGCGT GCGTTTTGTG
GCGATTAACT TCGAACCGGT GGTCGGCGAG ATTCTGGAGA AAGTTGACGA CGGCCAGATG
GGCGTGGTGC TCAAACGTAT GATGGTACGC GCGGCGTCGA AAGTGGCGGA ACGTTACGGC
GTACAGGCGC TGGTGACCGG CGAAGCGCTG GGCCAGGTAT CCAGCCAGAC GCTAACCAAC
TTGCGCTTGA TCGATAACGT GTCTGACACG CTGATCCTGC GCCCGCTGAT CTCTTACGAT
AAAGAGCACA TTATCAACCT GGCGCGCCAG ATTGGTACGG AAGATTTTGC CCGTACGATG
CCGGAATACT GTGGCGTGAT TTCAAAAAGT CCGACGGTGA AAGCCATTAA AGCGAAAATT
GAAGCCGAAG AAGAAAATTT CGACTTCAGT ATTCTCGATA AGGTGGTAGA AGAAGCGAAC
AACGTCGATA TTCGTGAAAT CGCCCAGCAG ACCCAGCAGG AGGTGGTGGA AGTAGAAACC
GTGAGCGGTT TCGGCCCGAA CGACGTGATT CTGGATATCC GTTCTGTGGA TGAGCAGGAT
GACAAGCCGC TGAAAGTGGA AGGTGTGGAC GTCGTTTCGC TGCCGTTCTA CAAGCTGAGC
ACTAAATTTG GCGACCTCGA TCAGAGCAAA ACCTGGCTGC TGTGGTGCGA ACGCGGGGTA
ATGAGCCGCC TGCAGGCGCT CTATCTGCGC GAGCAGGGCT TTGAAAACGT GAAAGTGTAT
CGTCCGTAA
 
Protein sequence
MKFIIKLFPE ITIKSQSVRL RFIKILTGNI RNVLKHYDET LAVVRHWDNI EVRAKDENQR 
LAIRDALTRI PGIHHILEVE DVPFTDMHDI FEKALAQYRE QLEGKTFCVR VKRRGKHEFS
SIEVERYVGG GLNQHIESAR VKLTNPDVTV HLEVEDDRLL LIKGRYEGIG GFPIGTQEDV
LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FGSSHRVRFV
AINFEPVVGE ILEKVDDGQM GVVLKRMMVR AASKVAERYG VQALVTGEAL GQVSSQTLTN
LRLIDNVSDT LILRPLISYD KEHIINLARQ IGTEDFARTM PEYCGVISKS PTVKAIKAKI
EAEEENFDFS ILDKVVEEAN NVDIREIAQQ TQQEVVEVET VSGFGPNDVI LDIRSVDEQD
DKPLKVEGVD VVSLPFYKLS TKFGDLDQSK TWLLWCERGV MSRLQALYLR EQGFENVKVY
RP