Gene Hoch_3082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3082 
Symbol 
ID8545470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4253738 
End bp4254769 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content68% 
IMG OID646387753 
Productaldo/keto reductase 
Protein accessionYP_003267481 
Protein GI262196272 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0228208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.204358 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTACA CCAAACTCGG AAACACGGGC CTCACCGTGT CGCGCATCTG TCTCGGCTGC 
ATGAGCTACG GCGAGGCCGG AGACGGCAGC TCGAGCCTCA AACACGCGTG GACGCTCGAC
GAGGACACCA GCCGCGGGTT CTTCCGGCGC GCGCTCGAGG CCGGCATCAA CTTCTTCGAC
ACGGCCAACG GCTACTCCGA GGGCAGCTCC GAAGAGTTTC TCGGACGGGC GATGCAGGCG
CTGGCGCGAC GCGATGAGGT GGTGATCGCG ACCAAGGCGT TTTTGCCGTG GCGCGAGAGT
CCAAATACCG GCGGTCTGTC GCGCAAGGCG CTGTTTCAGG CGATCGACGA CAGTCTGCGC
CGGCTGGGCA TGGACTACGT GGACCTGTAC CAGATCCACC GCTGGGATTA CGAGACGCCG
ATCGAGGAGA CCATGGAGGC GCTCCACGAC ATCGTGAAGG CCGGCAAGGC GCGCTACATC
GGCGCCTCGT CGATGCGGGC GTGGGAGTTT TTCAAGGCCC AGAGCACGGC CGAGCGCCAC
GGCTGGACCA AGTTCGTGGC CATGCAGAAC CACCTCAATC TGCTGTATCG CGAGGAGGAG
CGCGAGATGA TGCCGCTGTG CGAGGACCTC GGGGTCGGCG TGATTCCCTG GAGTCCGCTG
GCGCGCGGGC GCCTGGCGCG GCCGTGGGAC ACCCACACCG AGCGCTCGCA GAGCGACCGT
TTTGGCAAAC GCATCTACGC GGCCACCGAG GACAACGATC GCGAGATCGT CGAGCGCGTG
GGCGCGGTGG CCGGCGAGCG CGGGGTCTCG CGCGCGCAGG TCGCGCTGGC CTGGCTGCTG
GGCACGCCGG CGGTGGCGGC GCCCATCATC GGCGCCTCCA AGCTCGCGCA CCTCGAGGAC
GCGATCGCGG CGGTCGATGT CGAGCTGAGC GATGCGGAGC GCGAGCAGCT CGAGGCGCCG
TATCGCCCGC ATCCGGTGGT CGGCCTGGCC GGTCCGCTGC CGCCGCCGAA GAGCGTGAGC
GTGCTGGACT GA
 
Protein sequence
MKYTKLGNTG LTVSRICLGC MSYGEAGDGS SSLKHAWTLD EDTSRGFFRR ALEAGINFFD 
TANGYSEGSS EEFLGRAMQA LARRDEVVIA TKAFLPWRES PNTGGLSRKA LFQAIDDSLR
RLGMDYVDLY QIHRWDYETP IEETMEALHD IVKAGKARYI GASSMRAWEF FKAQSTAERH
GWTKFVAMQN HLNLLYREEE REMMPLCEDL GVGVIPWSPL ARGRLARPWD THTERSQSDR
FGKRIYAATE DNDREIVERV GAVAGERGVS RAQVALAWLL GTPAVAAPII GASKLAHLED
AIAAVDVELS DAEREQLEAP YRPHPVVGLA GPLPPPKSVS VLD