Gene Hoch_3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3011 
Symbol 
ID8545399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4167791 
End bp4169425 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content69% 
IMG OID646387683 
ProductAll-trans-retinol 13,14-reductase 
Protein accessionYP_003267411 
Protein GI262196202 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG GACGTTCGTA CAAACAACAC GGCATCGAGG GCAGCTACGA CGCCATCGTC 
ATCGGCTCCG GCATCGGCGG ACTCACCGCG GCCGCGCTCC TGGCCAAGCA CGGCGGCAAA
AAGGTATTGG TGCTCGAGCG CCACTACACC GCCGGCGGTT TCACCCACAG CTTCTCGCGG
CCCGGCTACG AGTGGGACGT GGGCGTGCAC TACATCGGCC AGACGCAGCC GCGTACGCTG
CTGCGCAGAG CCTTCGACGA GCTCAGCGAC GGCCAGCTCG CGTGGGCCGA CATCGGCGAC
GGCTACGACA CCATCGTCGT CGGCGACGAG CGCTATCCCT TCCCGGCCGG TCGCGAGCGC
TGGCGCCAGG CCATGCACGG CTGGTTTCCC CGCGAGCACG CGGCCATCGA CCGCTACATC
GAGCTCATCC GCGAGAGCGC GCGCGACTCG CAGCTCTTCT TCGCCGAAAA AGTGCTGCCG
CGGCCGCTCT CGGGCACGGT CGGCGGGGTG CTGCGGCGCA AGGCCATGCG CTTGGCGCGC
CGCACCACGC GCGAGGTGCT GAGCGAGCTC ACCAGCGATG AGCGCCTGAT CGGCGTGCTC
ACCGGCCAGT ACGGCGACTA CGGCCTGCCG CCGGCAGAGT CGAGCTTCTT CATGCACGCG
CTGGTGGTCA ATCACTACAT GGGCGGGGGC GCGTATCCCG TGGGCGGCGC GTCGCGCATC
GCCGAGACCA TCATCCCCGT GATCGAGCGC GCGGGCGGCG CGGTGCTGGT CTCGGCCGAG
GTGACCAAGG TGCTGATCGA GAAGAACCGC GCGCACGGCG TGGAGCTGGC CGATGGCACC
TGCGTGTACG CGCCGCAGGT GATCAGCGAC GCCGGCGTCA TGAACACCTT CGCTCGCCTG
GTGCCGGCCG AGGACGCCGA CCGCCACGGC TTTGACCGCC AGCTCGCCGA GCTCGAGCCC
TCGGTGGCGC ACGCCTCGCT GTACCTGGGG TTTCGGCAGA CGGCCGCCGA GCTGGGCCTG
CGCAAGAACA ACCTGTGGGT CTATCCCGAC TACAATCACG ACCGCAACGT CGAGCGCTTT
TTGCACGACA AAGCGGCGCC CCTGCCGGTG GCGTATCTGT CCTTTCCCTC GGCCAAGGAC
CCGGATTTCG AGAATCGCCA CCCCGGCCGG GCCACGGTCG AGGTGGTCAC GCTGGCGCCG
TATCGCTGGT TTGCCGAGTG GCAGGACACG CGCTGGAAGA AGCGCGGCGC CGAGTACGAG
CAGCTCAAGA AGGACCTGAG CGAGCGCATG CTGGCGCCGC TGCTGGCCCA GTATCCGCAA
CTCGCGGGCC AGATCGACCA CTGCGAGCTG TCGTCGCCGC TCACCACGCG CCACTTTGCG
CACTTCAGCC GCGGCGAGAT CTACGGCGTC TCGCACACGC CGCAGCGCTT CGAGCAGCGC
TGGCTGCGCC CGGCGACCCC GGTCAAGGGC CTGTACCTCA CCGGGGCCGA TGTCTGCTCG
GCCGGGGTCG GCGGCGCGCT GTTCGGTGGC GTGCTCACCG CGTCGTCGAT ACTGCGCAAG
AACCTGATCA ACGTGATCGC GCGCCGCCCG CCCGCGGCCG AGGCGTCTGC CGAACCCGTG
CGCGCGGCCG CATAG
 
Protein sequence
MKIGRSYKQH GIEGSYDAIV IGSGIGGLTA AALLAKHGGK KVLVLERHYT AGGFTHSFSR 
PGYEWDVGVH YIGQTQPRTL LRRAFDELSD GQLAWADIGD GYDTIVVGDE RYPFPAGRER
WRQAMHGWFP REHAAIDRYI ELIRESARDS QLFFAEKVLP RPLSGTVGGV LRRKAMRLAR
RTTREVLSEL TSDERLIGVL TGQYGDYGLP PAESSFFMHA LVVNHYMGGG AYPVGGASRI
AETIIPVIER AGGAVLVSAE VTKVLIEKNR AHGVELADGT CVYAPQVISD AGVMNTFARL
VPAEDADRHG FDRQLAELEP SVAHASLYLG FRQTAAELGL RKNNLWVYPD YNHDRNVERF
LHDKAAPLPV AYLSFPSAKD PDFENRHPGR ATVEVVTLAP YRWFAEWQDT RWKKRGAEYE
QLKKDLSERM LAPLLAQYPQ LAGQIDHCEL SSPLTTRHFA HFSRGEIYGV SHTPQRFEQR
WLRPATPVKG LYLTGADVCS AGVGGALFGG VLTASSILRK NLINVIARRP PAAEASAEPV
RAAA