Gene Hoch_5040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5040 
Symbol 
ID8547450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6954537 
End bp6955508 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content70% 
IMG OID646389715 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_003269421 
Protein GI262198212 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.13939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.514395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCA TACGCGTTCA CGACACAGGC GGCCCCGAGG TCTTGCGCAG CGAAGCCATC 
GAGTTGCCGG AGCCCGGCCC TGGGCAGGTG CGCATCGCGG TCGCGGCGGC CGGGGTGAAC
TTCATCGACA CCTACCACCG CACAGGCCTG TATCCGCGCG AGCGCCCGTT CACCCTCGGG
CTCGAGGGCG CGGGCACCAT CGAGGCCAGC GGCGAGGGCG TGAGCGACGA TCTGCAGCCG
GGCGCGCGGG TCGCGTGGAC CAACGTCCCC GGCTCCTACG CCAGCCACGT GATCGCCGAC
GCCGAGCGCC TGGTCCCGGT CCCCGAGGGC GTCACCAGTG AAGACGCGGC CGCCGTCATG
TTGCAGGGCA TGACCGCCCA TTACCTCGTG CACGACACCT ACCCGGTCGG CGCTGGCACC
GTGTGCCTGT TGCACGCCGC CGCTGGAGGC GTGGGATTGC TCGTGTGCCA GATGGCCAAG
CAGCGTGGCG CGCGCGTGAT CGGCACCGCC TCGAGCGCGA ACAAAGCCGA GCGCGCCAGG
CAAGCGGGCG CCGCGGACAT GATCCTCTAC ACCAGCCAGG ATTTTCGCGC CGAGGTCGAG
CGCCTCACCG GCGGTGAGGG CGTACACGTC GCCTACGACT CCGTGGGCAA GAGCACCTTC
GAGGGCAGCC TCGCGTGTTT GCGGCGGCGC GGCATGTTGG TGCTCTTTGG CCAGTCGAGC
GGCCCCATTG GCAGCTTCGA CCCCTTGATG TTGAGCCGCG GTGGCTCCTT GTACATGACC
CGGCCGACCC TCTTCGACTA CACCGCCACC CGCGCCGAGC TCTTGGCGCG GGCCGGCGCG
GTCTTGGGAG CCGTGGCCAG CGGCGCCTTG CGGGTGAGCA TCGACCGCAC CCTACCCATG
GACGACGCCG CCGAGGCGCA TCGTCTGCTC GAGGGCCGCA AAACCAGCGG CAAGCTCTTG
CTGACGCCCT GA
 
Protein sequence
MKAIRVHDTG GPEVLRSEAI ELPEPGPGQV RIAVAAAGVN FIDTYHRTGL YPRERPFTLG 
LEGAGTIEAS GEGVSDDLQP GARVAWTNVP GSYASHVIAD AERLVPVPEG VTSEDAAAVM
LQGMTAHYLV HDTYPVGAGT VCLLHAAAGG VGLLVCQMAK QRGARVIGTA SSANKAERAR
QAGAADMILY TSQDFRAEVE RLTGGEGVHV AYDSVGKSTF EGSLACLRRR GMLVLFGQSS
GPIGSFDPLM LSRGGSLYMT RPTLFDYTAT RAELLARAGA VLGAVASGAL RVSIDRTLPM
DDAAEAHRLL EGRKTSGKLL LTP