Gene Shel_16500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_16500 
SymbolthiH 
ID8395540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp1855460 
End bp1856914 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content61% 
IMG OID644986404 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_003144018 
Protein GI257064346 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAC ACGTCTACAA CCCGAGTTCG CCGCATGCCG ACGAATTCAT CAATCACCAG 
GAGATTCTCG ATACGCTTCA GTACGCCCAG GAGCACAAGG ACGACCTTGA GCTGTGCCGT
AGCATCCTGA AGAAGGCACA CCCCAACCTG GCGCCGAAGA AGGAGCATTG CACCTGCATC
ACGCATCGCG AGGCGGCTGT GCTGTTGGCC TGCGAGGACC CCGAAATCAA CGAGGAAATC
AAGACGCTGG CGCGTCAGAT CAAGCTTGCC TACTATGGCA ATCGCATCGT GCTGTTCGCG
CCGCTGTACC TTTCCAACTA CTGCGTGAAC GGTTGCCTGT ACTGCCCATA CCACGCCAAG
AACCGCGAGA TCCCCCGCCG CAAGCTGACT CAGGACGAGA TCAGGGCGGA AGTCATCGCA
CTGCAGGACA TGGGGCACAA GCGCCTGGCC ATCGAAGCGG GCGAGGATCC CAAGCACAAT
CCCATCGAGT ACATCCTGGA GTCGATGCAG ACCATCTACT CCATCAAGCA CAAGAACGGC
GCCATCCGCC GTGTGAACGT CAACATCGCG GCCACGACCG TCGAAGAGTA CCGCATGCTC
AAAGAGGCCG AGATCGGCAC GTACATCCTG TTCCAGGAAA CCTATAACCG CGCCCGTTAC
GAGGAGCTGC ATCCCACGGG ACCGAAGTCC GATTACGAAT GGCACACCGA GGCGCATGAC
CGTGCCCAGG AGGCCGGCAT CGACGACGTG GGTCTGGGCG TTCTGTTCGG TCTGGAAGGC
TACGCCTACG AGTTCTGCGG ACTCATCATG CATGCCGAGC ACCTCGAGGC GCGTTTCGGC
GTGGGTCCGC ACACCATCAG CGTGCCCCGC GTGAAGCCCG CCATGGACAT TGACCCCGAC
GTGTTCGACA ACGGTATTCC CGACGAGATG TTCGAGAAGA TCATCGCCCT CATCCGCATC
ACCGTGCCTT ACACCGGCAT GATCATCAGC ACCCGCGAGT CGGAGGCCGT TCGTTCTGCC
GCGCTGCAGT ACGGCATCTC GCAGATTTCG GGCGGTTCGC GCACCAGCGT GGGCGGCTAC
ACCGAGGAGG AGCGTCCCCA CGACACCGAG CAGTTCGACG TGTCCGACCA GCGTACGCTC
GACGAGGTCA TTGCCTGGCT TATGGATTGC GGCCACATCC CCAGCTTCTG CACGGCATGC
TACCGCGCAG GGCGCACGGG CGACCGCTTC ATGAGCTTCT GCAAATCGGG CGAGATTCTG
AACTACTGCC ATCCGAACGC GCTTATGACG CTGTCCGAGT ACCTGGTCGA CTACGCGACC
CCGGCAACGG CCGAGCGCGG CTGGGAGATG ATTCGCGAGG AGCTTACCAA GATCCCCGAC
GCCCGCAGGC GCGAGCTGTG CGCGGCCCAC ATCGAGGAGA TCCGCACCGG CAACGCCCGC
GACTTCAGGT TCTAG
 
Protein sequence
MTEHVYNPSS PHADEFINHQ EILDTLQYAQ EHKDDLELCR SILKKAHPNL APKKEHCTCI 
THREAAVLLA CEDPEINEEI KTLARQIKLA YYGNRIVLFA PLYLSNYCVN GCLYCPYHAK
NREIPRRKLT QDEIRAEVIA LQDMGHKRLA IEAGEDPKHN PIEYILESMQ TIYSIKHKNG
AIRRVNVNIA ATTVEEYRML KEAEIGTYIL FQETYNRARY EELHPTGPKS DYEWHTEAHD
RAQEAGIDDV GLGVLFGLEG YAYEFCGLIM HAEHLEARFG VGPHTISVPR VKPAMDIDPD
VFDNGIPDEM FEKIIALIRI TVPYTGMIIS TRESEAVRSA ALQYGISQIS GGSRTSVGGY
TEEERPHDTE QFDVSDQRTL DEVIAWLMDC GHIPSFCTAC YRAGRTGDRF MSFCKSGEIL
NYCHPNALMT LSEYLVDYAT PATAERGWEM IREELTKIPD ARRRELCAAH IEEIRTGNAR
DFRF