Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppro_1492 |
Symbol | thiH |
ID | 4572167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelobacter propionicus DSM 2379 |
Kingdom | Bacteria |
Replicon accession | NC_008609 |
Strand | + |
Start bp | 1600358 |
End bp | 1601779 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639755537 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_901166 |
Protein GI | 118579916 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCCC TGCCATCCAT GAAACTGAGC GCAGGCGCCG CCGACTTCAT CGACGAGGCG TATCTGTTCG GCCTTCTGGG AGGCAAGCGT CCCGAAGCGG CACGGGTTCG CGACATAATC GCCAAGAGCC TTGAAAAGCA GGCACTGGGT GTGGAGGAAA CCGCGCAGCT TCTGCTGGCG GATACGCCTG AGCTGGTCGA GGAGATCTTT GCCGCGGCCC GCGAACTGAA AGAAAAGGTG TACGGAAACA GGATCGTCCT GTTCGCCCCG CTCTACATCG GGAATAAATG TGTGAACGAC TGCGCCTACT GCGCCTTCAA GCGGAGCAAT GCCAGCGCAA TCCGGCGCAC GCTGACCGAA GGCGAGATCC GCGGGCAGGT CGAAGCCCTG GAAAGCAAGG GGCACAAACG GCTGATCCTG GTGTTCGGCG AACACCAGGC CTATGATGCC GAGTTCATCG CCCAGAGCGT CAGGACCGTC TACGACACCA GGGGAAAGGG TGAGATCAGG CGGGTGAACA TCAATGCCGC CCCCCTTGAC CGGGAGGGGT ATGCCATTGT CAAGGCCGCC GGGATCGGCA CCTACCAGAT CTTTCACGAA ACATACCATC ACGAAACCTA TGCCCGCCTG CATCCGCGGG GCACCCGCAA GGCAAACTAC CTCTACCGAC TGGACGGACT CAACCGGGCC TTTGAGGCCG GCTGCGACGA TGTGGGGCTG GGGGTGCTCT TTGGTCTTGT GGATTGGAGA TTCGAGGTGC TGGGCCTGGT CAGCCACGCC CTTCACCTGC AGCGGCGCTA CAATGTGGGT CCCCACACCA TCAGTTTTCC GCGCCTGCGG CCTGCCGCGG GTGTTGCGCT GGACAGGGAG CTCTTGGTTG GCGACCAGGA ATTCAAGCGG ATCATCGCCA TACTCCGTCT GGCTGTGCCC TATACCGGGC TCATCCTGAC CGCCCGGGAA AATGCCCAGG TACGCCGGGA ATGCATCTCC TTAGGCGTTT CCCAGATCGA CGCCGGCAGC CGCATCGAGC TGGGCGGCTA TACCGAGGCC GGGGATGCTC AGGTGATGGA GCGGGAACAG TTCTCACTTG GCGACGTGCG TTCCCTGGAC GAGGTCATGG CCGAACTGAT GGGAGCTGGC TATATTCCCA GTTTCTGCAC CTCCTGCTAC CGGGTCGGCA GAACCGGTGA ACATTTCATG GAATTCAGCA TCCCCGGCTT CATAAAGGAA TTCTGCACGC CCAATGCCCT GCTGACGCTG CAGGAATACT TGCTGGACTA TGCGTCACCG TCCACCAGGG AAATCGGAGA CAGGCTGATA GCGGAAGAAC TGGCCAAGCT GGTCGATGGC CATGGTAAAC AGGGCGTGGT GCAGAGGCTC AAAGAAATTA GGGAGGGCGC ACAACGCGAC CACTGTTTCT GA
|
Protein sequence | MSSLPSMKLS AGAADFIDEA YLFGLLGGKR PEAARVRDII AKSLEKQALG VEETAQLLLA DTPELVEEIF AAARELKEKV YGNRIVLFAP LYIGNKCVND CAYCAFKRSN ASAIRRTLTE GEIRGQVEAL ESKGHKRLIL VFGEHQAYDA EFIAQSVRTV YDTRGKGEIR RVNINAAPLD REGYAIVKAA GIGTYQIFHE TYHHETYARL HPRGTRKANY LYRLDGLNRA FEAGCDDVGL GVLFGLVDWR FEVLGLVSHA LHLQRRYNVG PHTISFPRLR PAAGVALDRE LLVGDQEFKR IIAILRLAVP YTGLILTARE NAQVRRECIS LGVSQIDAGS RIELGGYTEA GDAQVMEREQ FSLGDVRSLD EVMAELMGAG YIPSFCTSCY RVGRTGEHFM EFSIPGFIKE FCTPNALLTL QEYLLDYASP STREIGDRLI AEELAKLVDG HGKQGVVQRL KEIREGAQRD HCF
|
| |