Gene Hoch_3497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3497 
Symbol 
ID8545886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4824718 
End bp4825755 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content72% 
IMG OID646388165 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_003267892 
Protein GI262196683 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.157273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.487323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAACG TCGTCATCCA CCGCGCCGGC GGCTACGAGC AGCTCCGCAT CGAGGAGCGG 
GTCGAGCCCG TCCCCGGCCC CGGCGAGGCG CTGGTCGAGA GCCACTTCGC CGGCGTCAAT
TTCGCGGACT GCGTGGTCCG CATGGGCCTC TACGCCTCGG CCAAGAAGTA CGTGGGCTGG
CCGATCACGC CCGGCTTCGA GTTCGCGGGC GTGGTCCGCG CGGTCGGCCC CGGGGTCTCG
GACGAGCTCG TCGGCAGCAC CTGCATGGGC GTGACCCGCT TCGGCGGCTA CGCCACGCAC
CTCACCGTGC CCGCCAGCCA GCTCTTTCCC GTGCCCGTGG GCCTGTCGCT GGCCCAGGCC
GCGGCCTTTC CCACCGTGCA CCTCACCGCC TGGTTCGCGC TGTGCGAGCT CTTGCGCCTG
CGCCCCGGCA TGCGCGTGCT GGTGCACTCG GCCGCGGGCG GCGTCGGCAG CGCCGCGGTG
CAGATCGCGC GCCTGCACGG CTGCGAGGTC ATCGGCGTGG TCGGCGCCGC GCACAAGCTC
GAGCCGCTGC GCGCGCTCGG CGCCCAGCAC GCCATCGACA AACGCGCGCA AGCGCTGTGG
CCGGCGGTCG CCCGCATCGC GCCCGCCGGC TGCGACGTCA TCCTCGACGC CAACGGGGTC
GAGACCCTGC GCCACAGCTT CCGCCACCTG GCGCCCATGG GTCGCCTGGT CATCTACGGC
TTCCACACCA TGCTGCCGCA GAACCAGCGC GGCCGCGTCA ACTACCTCAG CCTGGCGCGC
GACTTTCTGC GCACGCCGCG CTTCGATCCC TTCCGCATGA CCACCGAGAA CAAGTCGGTG
CTGGCCTTCA ACCTGTCGTT CCTGTTCGAC GAGGGCTCGC TGCTGGCCGA GGCCATGAAC
CAGCTCGCCG GCTGGCTCGA CGACGGCACC CTGGCCCCGC CCCAGGTCAC TGAGTACGCC
TTCGACGACG TCGCCCAGGC CCAGCACGCG CTGGAATCGG GACGCACGGT CGGCAAACTG
GTGCTCCGCA CCGGCTGA
 
Protein sequence
MRNVVIHRAG GYEQLRIEER VEPVPGPGEA LVESHFAGVN FADCVVRMGL YASAKKYVGW 
PITPGFEFAG VVRAVGPGVS DELVGSTCMG VTRFGGYATH LTVPASQLFP VPVGLSLAQA
AAFPTVHLTA WFALCELLRL RPGMRVLVHS AAGGVGSAAV QIARLHGCEV IGVVGAAHKL
EPLRALGAQH AIDKRAQALW PAVARIAPAG CDVILDANGV ETLRHSFRHL APMGRLVIYG
FHTMLPQNQR GRVNYLSLAR DFLRTPRFDP FRMTTENKSV LAFNLSFLFD EGSLLAEAMN
QLAGWLDDGT LAPPQVTEYA FDDVAQAQHA LESGRTVGKL VLRTG