Gene Hoch_2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2331 
Symbol 
ID8544717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3238746 
End bp3239735 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content67% 
IMG OID646387035 
Productaldo/keto reductase 
Protein accessionYP_003266766 
Protein GI262195557 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.196419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.2019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAC GCAAGCTCGG GGCAGAGGGT CCCGAAGTCT CCGCGCAAGG CCTGGGATGC 
ATGGGCATGA GCGCGTTTTA CGGAAACGGC GACGACGCTG AATCGATCGC CGTCATGCAC
CGAGCGCTCG AGCTGGGCGT GAATTTTTTC GATACTGCGG ACATGTATGG CCCTCACACC
AACGAGAAGT TGGTGGGCCG CGCGATCGCC GATCGCCGCG ACCAGGTCTT CCTGGCCACC
AAGTTCGGCA TCGTCTTCGA CCCCGAGCGT CCGCGCGAGC GCAGCATCGA CGGCTCGCCC
GCGTACCTGC GCAACGCCTG CGACGCCAGC TTGCAACGGC TCGGTGTCGA CCACATCGAC
CTCTACTATC AGCACCGGGT GGACGCGAAT GTCCCCATCG AGGAGACGGT CGGCGCCATG
GCCGAGCTGG TCAAGGCCGG CAAGGTTCGC TACCTCGGCC TATCCGAGGC CGGCCCCAAG
ACCCTGCGGC GCGCTTGCGA GGTCCACCCC ATCACCGCCT TGCAGACCGA ATACTCGCTG
TGGAGCCGCG ATCCCGAGGA CGAGATCCTG GCCACCTGCC GCGAGCTCGG CGTCGGCTTC
GTCGCCTACA GCCCGCTCGG ACGCGGCTTC CTCACCGGCC AGATCACCTC GCCCAGCGAC
CTCGCCGAAG ATGACTGGCG CCGTCACAGC CCGCGCTTTC AGGGCGAGAA CTTCGCAAAG
AATCTCGCCA TGGTGTCCAA AATCCAGGAG ATCGCCGCGG AGAAGGGCTG CACCGCGGCG
CAGCTCGCGC TGGCCTGGGT GATGGCCCAG GGCGACGACA TCGTGCCCAT CCCGGGGACC
AAGCGCAAAC ACTACCTCGA GGACAACGCC GGCGCGTGCG AGCTCGCGCT GAGCGACGAG
GACAAGGCGC GCATCGAAGC CGTGGCCCCG CCCGGCGCCG CGGCCGGCAC GCGCTACCCC
GAGGCCTTGA TGAAGGGCGT CAGCACCTGA
 
Protein sequence
MKRRKLGAEG PEVSAQGLGC MGMSAFYGNG DDAESIAVMH RALELGVNFF DTADMYGPHT 
NEKLVGRAIA DRRDQVFLAT KFGIVFDPER PRERSIDGSP AYLRNACDAS LQRLGVDHID
LYYQHRVDAN VPIEETVGAM AELVKAGKVR YLGLSEAGPK TLRRACEVHP ITALQTEYSL
WSRDPEDEIL ATCRELGVGF VAYSPLGRGF LTGQITSPSD LAEDDWRRHS PRFQGENFAK
NLAMVSKIQE IAAEKGCTAA QLALAWVMAQ GDDIVPIPGT KRKHYLEDNA GACELALSDE
DKARIEAVAP PGAAAGTRYP EALMKGVST