Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_3857 |
Symbol | thiH |
ID | 5385738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 4343533 |
End bp | 4344663 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640866882 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001402808 |
Protein GI | 153948169 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 69 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAAG ACTTCAACCA GCGCTGGCAG CAGTTGGACT GGGACGATAT CTCCTTAACC ATCAACAGTA AAAAACCCGC AGACGTTGAA CGGGCGCTGA ATGCCATCAA GCCCACCCGC GAAGATTTGA TGGCACTCAT TTCCCCTGCC GCACTGGCTT ACCTGGAGCC AATGGCACAG AAGGCCCAAC AACTGACACG CCAACGTTTT GGAAATACAG TCAGTTTTTA TGTGCCGCTT TATCTCTCCA ATTTGTGTGC TAATGATTGC ACTTACTGCG GCTTCTCGAT GAGTAATCGC ATCAAACGCA AGACATTGGA TGAAGCAGAG ATTATCCGAG AGTGTGAAGC TATCAAAGCG CTGGGTTTTG AGCACTTGCT GCTGGTCACT GGGGAGCACC AAACAAAAGT AGGAATGGAC TATTTTCGCC GCCACCTCCC TACGATCCGC AGTAGATTCA GTTCGCTAAT GATGGAGGTT CAGCCATTGG CAGAAGATGA ATATACTGAA TTAAAGGCGC TGGGATTAGA TGGCGTGATG GTTTATCAGG AAACCTATCA CCCAGCGACG TATCAGCAGC ACCATTTGCG GGGTCATAAG CAGGATTTTC ACTGGCGGTT GGCAACCCCA GATCGTTTGG GCCGTGCGGG GATCGACAAG ATCGGATTGG GTGCCTTGAT TGGCTTGTCC AATAGTTGGC GTACCGACTG TTATATGCTG GCGGAGCATC TGTTTTATTT GCAACAAACT TACTGGCAAA CCCGTTATTC GATCTCTTTC CCTCGCTTGC GTCCGTGCGC TGGGGGGATC GAACCCGCAT CCATCATGAG TGAGCCACAA CTGCTGCAAC TGATCTGCGC TTTTCGCTTA TTTGCACCTG ATGTGGAACT GTCGTTATCT ACTCGAGAAT CGCCTTTCTT CCGCGATAAT GTCATTCCGG TTGCTATCAA TAATGTCAGT GCCGGGTCAA AAACCCAACC GGGGGGTTAT GCCGATGATC ATCCCGAACT GGAACAATTT GCGCCCCATG ATAACCGCTC CCCGGAACAG GTCGCACAAG CGTTAACAAA AGCAGGCTTA CAGCCCGTGT GGAAAGATTG GGATAGCCAT TTAGGCCGTT CATTGCGATA A
|
Protein sequence | MSEDFNQRWQ QLDWDDISLT INSKKPADVE RALNAIKPTR EDLMALISPA ALAYLEPMAQ KAQQLTRQRF GNTVSFYVPL YLSNLCANDC TYCGFSMSNR IKRKTLDEAE IIRECEAIKA LGFEHLLLVT GEHQTKVGMD YFRRHLPTIR SRFSSLMMEV QPLAEDEYTE LKALGLDGVM VYQETYHPAT YQQHHLRGHK QDFHWRLATP DRLGRAGIDK IGLGALIGLS NSWRTDCYML AEHLFYLQQT YWQTRYSISF PRLRPCAGGI EPASIMSEPQ LLQLICAFRL FAPDVELSLS TRESPFFRDN VIPVAINNVS AGSKTQPGGY ADDHPELEQF APHDNRSPEQ VAQALTKAGL QPVWKDWDSH LGRSLR
|
| |