Gene Hoch_5231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5231 
Symbol 
ID8547643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7189507 
End bp7191033 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content72% 
IMG OID646389905 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003269609 
Protein GI262198400 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.752485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.244998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGA CCAAGCAGGG CGCGACGCGC GCGCAACTCG AGAGCCGCGA TCCCCGCACC 
CACGAAGTGC TCGGGACGGT TCCCATTCAC AGCGAGGACG ACGTCCGCGC CGCGGTGGCG
CGCGCGCGGC AGGCGGCCGC CCAGTGGGGG GCCCTGGATG TGAGCGCGCG GGCGAGCGCG
CTCGACGGTT TTCGGCGCGC GCTGGCGGCC CAGGCCGAGG AGCTGGCCGA TCTCATCCAC
CGCGAGAACG GCAAACCGCG TTTCGACGCG CTGATGGAGG TGTTCCTCGC GCTCGCGCAT
CTGGCGCACA CCGCCGAGCG CGCCGGCAAA GCGCTGGCGC CGCGGCGCGT GAGCCCGGGC
CTGTTCGCCA ACATCCGCGC CGCCATCCAC TACCATCCGC TCGGCGTCAT CGGCGTCATC
GGACCGTGGA ACTATCCGAT GTTCACGCCC ATGGGCTCGA TCGGCAGCGC GCTAGCCGCG
GGCAACGCGG TGGTGCTCAA GCCCTCGGAG CTCACGCCCC TGGTCGGCGT ACGCCTGGCC
GAAATAGCCG CCAGCTCGCT GGGTAACGCG GATCTCGTAC AGGTGGTCAC CGGCGCCGGC
GAGACCGGGG CAGCGCTGGC GCGCTCGGGC GTGGACAAGC TGTCGTTCAC CGGCTCGACC
GCCACCGGGC GCAAGGTCAT GGCCGCCGCG GCCGAGACGC TCACGCCGGT GCTGCTCGAG
CTGGGCGGCA AAGACGCCAT GATCGTAGCC GCCGACGCCG ACATCGAAGA GGCCGCGCAG
GCGGCTGTGT GGGGCGCGTT TAGCAACGCC GGCCAGACCT GCATCTCGAT CGAGCGCGCC
TACGTGGCCG CGCCGGTCTA CGATGCCTTC GTCGACCGGG TGGTCGAAAT CGCTCGCGAG
GTACGCGCCG GCGAGGACAT CGGGCCGATG ACCAACGCCG CGCAGAGCGA TATCATCGCC
GGCCAGCTCC GCGAGGCGGT GGCCGCGGGC GCGCGTCCCC TGGTCGGCGG CCCCGAGGCC
ATGGCCGACG GCTTCGTCTC GCCCACGGTG CTGGTCGATG TCAGCGACGA TATGAGCATC
ATGCGCGAGG AGACCTTTGG CCCGGTGCTG CCGATCGCGC GCGTGGCCGA CGCCGAAGAG
GGCGTGCGCC GCGCCAACGC CTCGATGTAC GGGCTGGGCG GCGCGGTCTT TGGCAAGCAG
GGCGTGCGCA CGCTGGCCTC GCGCCTGCGC GCCGGTGCCA CCGCGGTCAA CGCCGTCTTG
GCCTTCGCCG GCGTGCCCTC GCTGCCCTTT GGCGGCGTCG GCGACTCGGG TTTCGGCCGC
ATCCACGGCG ACGAGGGGCT GCGCTCGTTC TCGCGCACCC ACGCGGTCGC CGAGGCCCGC
TTCGGCCTGC CCAAGTCGTT TGACCTGATG CGCTTCCACC AGCCCGAGAA CACCTTCGAG
CGCATGCTCG GGCTCATCGA GCAGCTCTAC GGTGGCGGCG CCGTGGACAC CGCCAGCTCG
CTGCTGCGTC GCCTGCGGCC CTGGTAG
 
Protein sequence
MAETKQGATR AQLESRDPRT HEVLGTVPIH SEDDVRAAVA RARQAAAQWG ALDVSARASA 
LDGFRRALAA QAEELADLIH RENGKPRFDA LMEVFLALAH LAHTAERAGK ALAPRRVSPG
LFANIRAAIH YHPLGVIGVI GPWNYPMFTP MGSIGSALAA GNAVVLKPSE LTPLVGVRLA
EIAASSLGNA DLVQVVTGAG ETGAALARSG VDKLSFTGST ATGRKVMAAA AETLTPVLLE
LGGKDAMIVA ADADIEEAAQ AAVWGAFSNA GQTCISIERA YVAAPVYDAF VDRVVEIARE
VRAGEDIGPM TNAAQSDIIA GQLREAVAAG ARPLVGGPEA MADGFVSPTV LVDVSDDMSI
MREETFGPVL PIARVADAEE GVRRANASMY GLGGAVFGKQ GVRTLASRLR AGATAVNAVL
AFAGVPSLPF GGVGDSGFGR IHGDEGLRSF SRTHAVAEAR FGLPKSFDLM RFHQPENTFE
RMLGLIEQLY GGGAVDTASS LLRRLRPW