Gene YpsIP31758_3857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3857 
SymbolthiH 
ID5385738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4343533 
End bp4344663 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content50% 
IMG OID640866882 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001402808 
Protein GI153948169 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones69 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAG ACTTCAACCA GCGCTGGCAG CAGTTGGACT GGGACGATAT CTCCTTAACC 
ATCAACAGTA AAAAACCCGC AGACGTTGAA CGGGCGCTGA ATGCCATCAA GCCCACCCGC
GAAGATTTGA TGGCACTCAT TTCCCCTGCC GCACTGGCTT ACCTGGAGCC AATGGCACAG
AAGGCCCAAC AACTGACACG CCAACGTTTT GGAAATACAG TCAGTTTTTA TGTGCCGCTT
TATCTCTCCA ATTTGTGTGC TAATGATTGC ACTTACTGCG GCTTCTCGAT GAGTAATCGC
ATCAAACGCA AGACATTGGA TGAAGCAGAG ATTATCCGAG AGTGTGAAGC TATCAAAGCG
CTGGGTTTTG AGCACTTGCT GCTGGTCACT GGGGAGCACC AAACAAAAGT AGGAATGGAC
TATTTTCGCC GCCACCTCCC TACGATCCGC AGTAGATTCA GTTCGCTAAT GATGGAGGTT
CAGCCATTGG CAGAAGATGA ATATACTGAA TTAAAGGCGC TGGGATTAGA TGGCGTGATG
GTTTATCAGG AAACCTATCA CCCAGCGACG TATCAGCAGC ACCATTTGCG GGGTCATAAG
CAGGATTTTC ACTGGCGGTT GGCAACCCCA GATCGTTTGG GCCGTGCGGG GATCGACAAG
ATCGGATTGG GTGCCTTGAT TGGCTTGTCC AATAGTTGGC GTACCGACTG TTATATGCTG
GCGGAGCATC TGTTTTATTT GCAACAAACT TACTGGCAAA CCCGTTATTC GATCTCTTTC
CCTCGCTTGC GTCCGTGCGC TGGGGGGATC GAACCCGCAT CCATCATGAG TGAGCCACAA
CTGCTGCAAC TGATCTGCGC TTTTCGCTTA TTTGCACCTG ATGTGGAACT GTCGTTATCT
ACTCGAGAAT CGCCTTTCTT CCGCGATAAT GTCATTCCGG TTGCTATCAA TAATGTCAGT
GCCGGGTCAA AAACCCAACC GGGGGGTTAT GCCGATGATC ATCCCGAACT GGAACAATTT
GCGCCCCATG ATAACCGCTC CCCGGAACAG GTCGCACAAG CGTTAACAAA AGCAGGCTTA
CAGCCCGTGT GGAAAGATTG GGATAGCCAT TTAGGCCGTT CATTGCGATA A
 
Protein sequence
MSEDFNQRWQ QLDWDDISLT INSKKPADVE RALNAIKPTR EDLMALISPA ALAYLEPMAQ 
KAQQLTRQRF GNTVSFYVPL YLSNLCANDC TYCGFSMSNR IKRKTLDEAE IIRECEAIKA
LGFEHLLLVT GEHQTKVGMD YFRRHLPTIR SRFSSLMMEV QPLAEDEYTE LKALGLDGVM
VYQETYHPAT YQQHHLRGHK QDFHWRLATP DRLGRAGIDK IGLGALIGLS NSWRTDCYML
AEHLFYLQQT YWQTRYSISF PRLRPCAGGI EPASIMSEPQ LLQLICAFRL FAPDVELSLS
TRESPFFRDN VIPVAINNVS AGSKTQPGGY ADDHPELEQF APHDNRSPEQ VAQALTKAGL
QPVWKDWDSH LGRSLR