Gene Hoch_5459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5459 
Symbol 
ID8547872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7492446 
End bp7493990 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content68% 
IMG OID646390132 
ProductCarotenoid oxygenase 
Protein accessionYP_003269835 
Protein GI262198626 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.176097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.349671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGC TGTACTCCAC TTCACACGCC GCGGCCGCCA ACGCTCACGC CGCAGCCGAC 
ACCTCGCCGT CGGCCGCGCC TGCGCATACA GCCGACGGCG AGGGGGTACA GGCGCCGCCC
TGGGCCAACG CCTTCCGCGA GCTCACACGC GAGCACGATT TTCAGCCGCT TCGGGTCGAG
GGCGTACTAC CGCCTGATCT CGAGGGCAGC TTCTACCAGA ACGGCCCGGT GCTGTTCTCG
TCCCACGGAT ACCGCTACAC CCACTGGTTC GACGGCGACG GCGGCGTCTC GGCGGTTCGC
CTGCAGGCCG GCCGCGCCCA CGGCGCCGCT CGGGTCACGG CCACGGCCGG CCTGATCGCC
GAAGCGCGCG CGGGAAAACG CCTCTACGGC GGCTACAGCT CACCGCAGCC GGGCGCGGTG
AAACGCCTCC TCGGCATTCT CAAGAACACC GCCAACACCT CCATGCTGGT GTGGAATCGG
CGCCTGTTCG CGCTCATGGA GGCGGGACTG CCCACCGAGA TCGCGCCCGA GGACCTGCGC
ACCCTGGGCG AGCGCGACCT CGGCGCGATC ACGCACGTGT TCTCAGCCCA CCCGCACTGG
TGCGCGCGCC GCAACACCTA CTACGGCTTT GGCGTGCGCC CGGGCCGCCA GCAGCAGCTC
GACATCTTCG AGCTAACCCA CACCGGCGTG GCGCGCCCCC TGTGTTCGGT CCCGCTGTCC
GAACACACGC TGATCCACGA CTTCGCGATC ACGGGCCGCT ACCTGGTGTT CTTCGCCCCG
CCCTTCGAAC TGCGCGCCTG GCGCATGCTC GCCGGCGAGG GCGGCTACGC CGACAATTTG
CAGTGGAAGC CCGAGTACGG CACCGAGATC ATCGTGGTGC CCATCGACCT GCCGCACGCG
GTGCAGCGCT TTCGCGTCGA TCCCTTCTTT CACTGGCACG TGGCCAACGG CTTCGACGAC
GGCGACGACA TCGTGGTCGA CTTCGTCCGC TACGACGACT TTGAAAACAA CGCGTTCCTC
GCCGACTTGC CCGCGGGCAA CGACACCCGC AACCTGGGCA GCAGGCTGGT CCGCGCCCGC
ATCTCGCTCG CCAACACGCG CATGCGCCGC GAGGAGCGCT GGAGCCGCTC GGTCGAGTTT
CCCCAGATCC GCCAGGATTA TTTCGGACGA CCGTACCGCT ACTGTTACCT GGCCGCCTAC
GAGGACGGCG CCCCCGACAG CGGCCTGCAA AACGTGCTCG CCAAAGTCGA CATGCACAGC
GGCGAGGTCC GCGAGTACAC CTGCGCGCCC GGCCGCTACC TGACAGAGGC CGTGTTCGTG
CCGCGCGCCA CCGGGGCCGA CGCTGGCGAA TCACCCGAGG ACGACGGCTA TCTGCTCACC
ATGGTCTACG ACGCCAACAG CCACACCAGC CACCTGGCCG TCTTTGATGC CGGCGATATC
GAGGCCGGCC CGCGCGCGCG CACGCACTTC GATCACCACA TCCCGCCGCG CTTTCACGGC
GCGTGGATGC CGGTGAGCCA GTATCCGCAC ATGCGCGGCC GCTGA
 
Protein sequence
MKELYSTSHA AAANAHAAAD TSPSAAPAHT ADGEGVQAPP WANAFRELTR EHDFQPLRVE 
GVLPPDLEGS FYQNGPVLFS SHGYRYTHWF DGDGGVSAVR LQAGRAHGAA RVTATAGLIA
EARAGKRLYG GYSSPQPGAV KRLLGILKNT ANTSMLVWNR RLFALMEAGL PTEIAPEDLR
TLGERDLGAI THVFSAHPHW CARRNTYYGF GVRPGRQQQL DIFELTHTGV ARPLCSVPLS
EHTLIHDFAI TGRYLVFFAP PFELRAWRML AGEGGYADNL QWKPEYGTEI IVVPIDLPHA
VQRFRVDPFF HWHVANGFDD GDDIVVDFVR YDDFENNAFL ADLPAGNDTR NLGSRLVRAR
ISLANTRMRR EERWSRSVEF PQIRQDYFGR PYRYCYLAAY EDGAPDSGLQ NVLAKVDMHS
GEVREYTCAP GRYLTEAVFV PRATGADAGE SPEDDGYLLT MVYDANSHTS HLAVFDAGDI
EAGPRARTHF DHHIPPRFHG AWMPVSQYPH MRGR