Gene Hoch_0059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0059 
Symbol 
ID8542429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp89421 
End bp90782 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content71% 
IMG OID646384846 
Product7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit 
Protein accessionYP_003264593 
Protein GI262193384 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily
[TIGR03551] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.17333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAAG CCCCGGGCGG GACTCCGCCG GCGAAATCGG ACAACATGAC GAATTCGTCC 
AGCGCGGAAT CTGTCCGGGC GAGCGAGCTG CGCGCCGAGG CCGCGCTGGC CCGGGCGCCG
CGCGCCGAAC GCGCGATCTT GGCCGCGTGC ATGGGCGATG GTTGCCCGAG CTGGCAGCAG
GCGGCCGAGC TGTTCGCGGT CCGCGACGCC GGGCTCGATG CCCTGGTCGC GGTCGCCGAC
GCCGTGCGCC AGCGCCAGGT CGGCGAGGCG GTGAGCTACG TCGTCAACCG CAACGTCAAC
TTCACCAACG TGTGCGTCAA GAAATGCCAG TTCTGCGCCT TCTCGCGCGA GCTGCGCTCG
GAGCAGGGCT ACTACCTCGC GCGCGACCAG GTCATCGCGC GCGTGCGCGA AGCCCACGCC
CTGGGCGCCA CTGAGGTGTG CCTGCAGGCC GGGCTGGCGC CCAGCGCCGA GGGCCGCCAG
TACATCGAGC TATGCCGGGC CGTGAAGCGG GCGGTGCCCG CGATCCACGT GCACGGCTTC
TCGCCCGAAG AGGTCCGCTA CGGCGCTCTG CGCGCACACA TGAGCGTGCG CGCGTATCTC
GAAGAGCTGC TCGACGCTGG GCTGGGCTCG ATGCCGGGCA CATCGGCCGA AATCCTCGAC
GACGAGCTGC GCGCGCGCAT CTCGCCCGGA CGCATCACCA CGGCGCAGTG GATCGATGTG
GTCACCACCG CGCACGCGCT GGGGATTCCG ACCACCTCGA CCATCATGTA CGGCCACGTC
GAAGACGCCG GCCACCGCGC CCGTCATCTG GCCCTGCTGC GCGATATTCA GGCCGATACC
GGCGGTTTCA CGGAATTCGT ACCGCTGTCC TTCATCCCCA CGCACGCGCC CATGTACGCG
CTGTCGACCG TGCCCGGTGT GCGCCCAGGG CCGCGGCCAG ACGACGTCGT GCGCATGCAC
GCGATCGCGC GTCTGATGCT CGGTGCCAGC GTGCGCAACA TCCAGGTCTC GTGGGTCAAA
GAGGGCATGG ACATGGCCGC GCGTCTGCTC GGCTGCGGCG GCAACGACCT CGGCGGCACG
CTGATCAACG AGAGCATCTC GACCGCGGCC GGCGCCGGCC ACGGTCAGCG CCAGAGCCCG
CGCGCGCTGC GCCAGTGCAT CCGGGCTGCC GGGCGCGTGC CCGTGCAGCG CGACACCGGC
TATCGCGTGC TGCGCCGCTT TGGCGACGCC GAGAGCGACG CCGAGAGCGG CGCGGACACG
CTCGACGCGG CCGGCGGCGA CGAGCGCTTT GGCTCGTTCG AGGCGCTCAT CGGCGATCGC
CGCCATCGCT ACCGCGGCAT CGGCGCGCCC GGCGGCGACT GA
 
Protein sequence
MSEAPGGTPP AKSDNMTNSS SAESVRASEL RAEAALARAP RAERAILAAC MGDGCPSWQQ 
AAELFAVRDA GLDALVAVAD AVRQRQVGEA VSYVVNRNVN FTNVCVKKCQ FCAFSRELRS
EQGYYLARDQ VIARVREAHA LGATEVCLQA GLAPSAEGRQ YIELCRAVKR AVPAIHVHGF
SPEEVRYGAL RAHMSVRAYL EELLDAGLGS MPGTSAEILD DELRARISPG RITTAQWIDV
VTTAHALGIP TTSTIMYGHV EDAGHRARHL ALLRDIQADT GGFTEFVPLS FIPTHAPMYA
LSTVPGVRPG PRPDDVVRMH AIARLMLGAS VRNIQVSWVK EGMDMAARLL GCGGNDLGGT
LINESISTAA GAGHGQRQSP RALRQCIRAA GRVPVQRDTG YRVLRRFGDA ESDAESGADT
LDAAGGDERF GSFEALIGDR RHRYRGIGAP GGD