Gene Hoch_4944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4944 
Symbol 
ID8547352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6818853 
End bp6820406 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content72% 
IMG OID646389618 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003269326 
Protein GI262198117 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02278] phenylacetic acid degradation protein paaN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.141691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGC TGGCCAGTTA CGTCAACGAG CGCTGGGTGG AGGGCACGGG CTCGGCCCAG 
CCGCTGCACA ATCCGGCTAC CGAAGAGATC CTTGCCGAGA CCTCGACCGA GGGCGTCGAT
TTCGCGGCCG CGATGACGCA CGCGCGCGAG CGCGGCGGCC CGGCCCTGCG CGCGCTCAGC
TTTGCCCAGC GCGGCGAGAT CCTGCGCGCC ATGGCAAAGA CCATCCACGA CAACCGCGAG
GAGCTGATCG CGCTGGCCAT CGAGAACGGC GGCAACACCC GCGGCGACGC CAAGTTCGAC
ATCGACGGCG CCAGCGCCAC CCTGGCCGCG TACGGCGAGC TCGGCGCCGA GATCGGCGAC
ACCCAGGTGA TGGTCGACGG CGACCCGGTG CAGATCGGGC GCACGGCGCG CTATCAGGGC
ATGCACCTCT GCGTGCCCCG GCGCGGCGTG GCCGTACACA TCAACGCGTT CAACTTCCCC
GCCTGGGGCA TGTGCGAAAA GGCCGCGACC GCGCTGCTCG CCGGCATGCC GGTGGTCAGC
AAGCCGGCCT CGACCTCGGC CATGGTCGCG CATCGCACCA TGGAGCTGTT CGTCGCGGCC
AAGCTCTTGC CCGAGGGCGC GCTGTCGTTC ATCGCCGGTC AGCCCGGCGA CCTGCTCGCG
CATCTCCAGG GCCAGGACGT GTTGGCCTTC ACCGGCTCGA GCGGGACCGC GCGCACGCTG
CGCGGGCTGG GCAGCGTCAT CGACAACTCG GTGCACGTCA ACGTCGAGGC CGACAGCCTC
AACGCCGCCG TGCTCGCGCC GGATGTCGAC CCATCGTCCG AAACCTTCCA GCTCTTCCTC
GCCGACATAA GCCGCGATAT CACACAAAAG GCCGGGCAGA AGTGCACGGC CATCCGCCGC
ATCTTCGTCG CCGAGGCCCT GGCCGAGCGC GCGGCCGAGG CCCTGGTCGA GCGCCTGGCC
GGCACGGTGG TCGGCGATCC GGCCGACAAG AGCGTGCGTA TGGGGCCGCT GGCCTCGGCC
GCGCAGCAGC GCGACGTGCG CGCCGGGATC GAGCGCCTGG CCGGGCAGAC CGAGGCCCTG
TTCGGCGGCG ACGGCGCCTG CGAGCCGGTC GGCGTACCCG CGGGCAAGGG CTACTTCGTC
GGCCCGGTGT TGCGCCGCGC CAGCGACGCG CGCGCGGCCA CGGCGGTGCA CGATCACGAG
GTCTTCGGCC CGGTGGCCAC GCTCCTGCCC TTTGCCGGCG GCGCCGAGGA GGCGGCCGAG
CTGGTCGCGC TGGGCGGCGG CGGGCTGGTG GCCTCGGCGT ACACCGACGA GCGCGACTAC
GCGCGCGACA TCATCCTCGG CCTGGCGCCC TACAACGGCC GCGTGTACCT CGGCAGCAAC
AAGATGGCGG CGCAGTCCAT GGGCCCGGGC ACGGTGTTGC CGCAGCTCGT GCACGGCGGC
CCGGGGCGCG CCGGCGGCGG CGAGGAGCTG GGCGGCCGCC GCGGCATGGC GCTGTATCAG
CAGCGCACCG CGGTGCAGGG CGACAAGGGC ATGCTCAAGA CCTTCGAGCG CTGA
 
Protein sequence
MKKLASYVNE RWVEGTGSAQ PLHNPATEEI LAETSTEGVD FAAAMTHARE RGGPALRALS 
FAQRGEILRA MAKTIHDNRE ELIALAIENG GNTRGDAKFD IDGASATLAA YGELGAEIGD
TQVMVDGDPV QIGRTARYQG MHLCVPRRGV AVHINAFNFP AWGMCEKAAT ALLAGMPVVS
KPASTSAMVA HRTMELFVAA KLLPEGALSF IAGQPGDLLA HLQGQDVLAF TGSSGTARTL
RGLGSVIDNS VHVNVEADSL NAAVLAPDVD PSSETFQLFL ADISRDITQK AGQKCTAIRR
IFVAEALAER AAEALVERLA GTVVGDPADK SVRMGPLASA AQQRDVRAGI ERLAGQTEAL
FGGDGACEPV GVPAGKGYFV GPVLRRASDA RAATAVHDHE VFGPVATLLP FAGGAEEAAE
LVALGGGGLV ASAYTDERDY ARDIILGLAP YNGRVYLGSN KMAAQSMGPG TVLPQLVHGG
PGRAGGGEEL GGRRGMALYQ QRTAVQGDKG MLKTFER