Gene Hoch_2121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2121 
Symbol 
ID8544507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2941537 
End bp2942385 
Gene Length849 bp 
Protein Length282 aa 
Translation table11 
GC content70% 
IMG OID646386828 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_003266559 
Protein GI262195350 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0891264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.29276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA CTCCCACTGC GGCATCTCAG CACATCTTCG CGCCCGGCCT GTTCCGCGGT 
CAGCGCGTCA TCGTCACCGG CGGCGGCTCG GGCATCGGCC TGGCGAGCGC GCGCGCGTTC
GCGCGTCTGG GAGCGCGCGT GGCCATCTGC GGCCGCGACG AGGACAAGCT CGCAGCCGCG
CGCGACGAGC TGCAAGCCGT GGCCGACGAA GCTCGCGCCG CCGCCGCCGA GGCCGACAGC
GAAGGCGAGA TGGTCTACGC CGCACCCTGC GATATCCGCT CGGCCGAGAC GGTCGAGGGC
TTCGTCGGCG CGGTGCGCGA GCGCTTTGGC GGCATCGACG TGCTGTTCAA CAACGCCGGC
GGTCAGTTCG CATCACCGGC CACGGGCATC TCGCCCAAGG GCTTCGCCGC GGTGGTTCGC
AATAACCTCG AGGGCACCTA TTACATGACC CACGCGGTCG CCACGCAGGC CATGATCCCG
CAGCGCGGCG GTTGTATCGT CAACATGAGC GCCAACGTGT ATCGTGGCTT CCCCGGCATG
GTGCACACCG GAGCCGCGCG CGCCGGGGTC GAGAACATGA CCATGACGCT GGCGGTCGAG
TGGGCCAGCT ACGGCATCCG CATCAACGCG GTGGCCCCGG GCATCATCCT GTCGTCGGGG
ACCGATCAGT ACCCGCCCGC GATCCTGTCG CGCGCGCTGT CGCAGGTGCC GATCGCGCGC
GGCGGCACGG TCGAGGAGGT GGCCGCGGCC GTGGTATTCC TGGCCTCGCC CGCAGCCCAG
TACATCTCCG GGGTGTCGCT GCGCATCGAC GGCGGCATCA GCCTCAGCGG CGAGATGTTT
CCCCGCTGA
 
Protein sequence
MNDTPTAASQ HIFAPGLFRG QRVIVTGGGS GIGLASARAF ARLGARVAIC GRDEDKLAAA 
RDELQAVADE ARAAAAEADS EGEMVYAAPC DIRSAETVEG FVGAVRERFG GIDVLFNNAG
GQFASPATGI SPKGFAAVVR NNLEGTYYMT HAVATQAMIP QRGGCIVNMS ANVYRGFPGM
VHTGAARAGV ENMTMTLAVE WASYGIRINA VAPGIILSSG TDQYPPAILS RALSQVPIAR
GGTVEEVAAA VVFLASPAAQ YISGVSLRID GGISLSGEMF PR