Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcar_0340 |
Symbol | thiH |
ID | 3723473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelobacter carbinolicus DSM 2380 |
Kingdom | Bacteria |
Replicon accession | NC_007498 |
Strand | + |
Start bp | 425109 |
End bp | 426233 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637749924 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_355770 |
Protein GI | 77917955 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 66 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATTTTC TCGACGAATT CAACAGCTAC GATCGCGGCG AGCTTGCGCA ACGGATTATG TCATGCCGGG CTGCCGATGT GGAACGAGCG CTGACGGCGG AACATCTGCG CAGTGCCGAT TTTATGGCGC TGCTGTCGCC GGCGGCGCAC GGTTACCTGG AGTCGATGGC ACAAAAAGCC CACCGTCTGA CGCAGCAGCG TTTCGGCAAA ACCATCCAGC TCTTTGCGCC GCTGTACATC TCCAACGAAT GCAGCAACGG CTGCCTGTAC TGCGGCTTCA ACGCCGCCAA CAAGGTCGCG CGGCGCACCT TGAGCCTGGA CGAAGTCGAA GCCGAGGCCC GCATCCTGCG CCAGCGCGGT TTCCGTCATG TGCAGATTCT TACCGGTGAA GCTCCCCGGG CTGTGGATAA CGATATGCTG GCAGCGGTGG TCCGCCGGAT TCGGCCATTG TTTTCCGCCA TCAGCATCGA AGTCTATCCC ATGGAAGAAG CCGGCTACCG ACAGATGGTC GATGCCGGCG TCGACAACCT GACCGTTTAT CAGGAGACGT ACGATCGCGA TCTGTACGAC AAGCTGCATC CCTTCGGTCG TAAAAAGGAT TTTGACTGGC GTCTGACCAC TCCCGACCGC GGCGGTGCGG CGGGACTGCG TTCCATCGGT ATCGGTGCCT TGTTGGGGCT GAGCGACTGG CGCGTCGAAG GTGTGCTGGT CGGGTTGCAC GCGCGACACC TGGCGCGTAC CTGGTGGCGC AGTCGGGTGA ATGTATCCTT TCCGCGCATG CGGCCCGCCG GGGGCGGGTT CAATCCGCTG GCGCCGGTAT CCGACAGTGC CCTGGTGCAA CTGATCTGCG CGCTGCGACT GTTGATACCC GATGCCGGGC TGGTGCTGTC GACCCGCGAA AGCTCCAGTT TGCGCGATCA TCTGCTGCCT TTGGGTATCA CCCAGCTGAG TGCCGGCTCC TGTACTGCCC CGGGCGGGTA TGGCGACGAG GGGCACGGTA GCGAGCAGTT TGCCATTGAC GACGACCGCG ACGCCGAACA GGTTTGCGCC ATGCTGCGCG CCCAGGGATA TGAGCCGGTA TGGAAGGATT GGGATCGCAC CTTTATGGAT CGGCAGGCCG TTTGA
|
Protein sequence | MNFLDEFNSY DRGELAQRIM SCRAADVERA LTAEHLRSAD FMALLSPAAH GYLESMAQKA HRLTQQRFGK TIQLFAPLYI SNECSNGCLY CGFNAANKVA RRTLSLDEVE AEARILRQRG FRHVQILTGE APRAVDNDML AAVVRRIRPL FSAISIEVYP MEEAGYRQMV DAGVDNLTVY QETYDRDLYD KLHPFGRKKD FDWRLTTPDR GGAAGLRSIG IGALLGLSDW RVEGVLVGLH ARHLARTWWR SRVNVSFPRM RPAGGGFNPL APVSDSALVQ LICALRLLIP DAGLVLSTRE SSSLRDHLLP LGITQLSAGS CTAPGGYGDE GHGSEQFAID DDRDAEQVCA MLRAQGYEPV WKDWDRTFMD RQAV
|
| |