Gene Hoch_6552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6552 
Symbol 
ID8548969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp8996282 
End bp8997451 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content69% 
IMG OID646391214 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_003270913 
Protein GI262199704 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0724494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.194858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGAAC GGATCGTGGC CGGCGAGGTC CCGGCCAAGC ACCACATCGC CATGCGTCAG 
GGCGATGGTT CGCTGTACGT CGAGGAGTGC TTCACGCGCC GCGGCTTCGA CGGCCCGTAC
ACGATCCTGT ATCACCAGAA CCGGCCGCAC ACGCACCGGG TCGCGGCCGA TCTCGGGCGC
GGCTTCGCGG CGCCCGTGCG CGCCCAGAGC GACCTCGAGC GGCCCCTGGC CAAGCGCCAC
TACCGCTCGC AGACGCTCAG CTCGGCCGGC GACATGCCGG TCAACTGCCG CACGCCGCTG
CTGTTCAACC GCGACGTGGT GCTGTCGATC GTGCGCCCCG ATCGCGACGA CGACGTGTAC
TTCAGCAACG GCGATGGCGA CGACCTGTAC TACATCCACG AGGGCGGCGG CACCCTGCGC
ACGCCGCTCG GCGACCTGGC CTTCTCTGCC CGCGACTACG TGTTCGTGCC CAAGGGCATG
TTGCACCGCT TCGTGCTCGA CGCCGGCAGC CAGTACTGGC TGTCGATCGA GTGCCTGGGC
GGCATGGGCC TGCTGGCGCA GTGGCGCAAT GATGCCGGCC AGTTGACCAT GGACGCGCCC
TACTGCCACC GCGATTTTCG CGCGCCGAGC TTTCGCGGAC CGGTGGACGA AAACATCCGC
GGCTGCGCGG TCAAGCGCGA GGGCCGCTTC TTCGGCTTTC GCCTCGATCA CTCGCCGCTC
GACGTGGTCG GGTGGGACGG CGCCTGCTAT CCGTTCGTGT TTCCCATCCT GGCGTTTCAA
CCGCGCGCCG GGCTGGTGCA CCTGCCGCCC ACCTGGCACG GTACCTTCGC CGCCCGCGGC
GCGCTGATCT GCAGCTTCGT GCCGCGGATG CTCGACTTCC ACCCCGACGC GATTCCCTGC
CCCTATCCCC ATCACTCGGT GGATTGCGAC GAGTTTTTGT TCTACTGCCA CGGCAACTTC
AGCTCGCGCC GCGGCGTCGG CGCCGGCAGC ATCTCGCACC ACCCCAGCGC CCTGCCGCAC
GGCCCGCACC CGGGCGCCTA CGAGGCCAGC CTGGGCGAGC GGCGCACCGA GGAGCTGGCC
GTGATGGTCG ACACCTTCGA GTCGCTGCAC CCGACCGAGG CGGCGCTGGC CATCGAAGAC
CCCGAGTACC ACGACAGCTT CTTGCCCTGA
 
Protein sequence
MIERIVAGEV PAKHHIAMRQ GDGSLYVEEC FTRRGFDGPY TILYHQNRPH THRVAADLGR 
GFAAPVRAQS DLERPLAKRH YRSQTLSSAG DMPVNCRTPL LFNRDVVLSI VRPDRDDDVY
FSNGDGDDLY YIHEGGGTLR TPLGDLAFSA RDYVFVPKGM LHRFVLDAGS QYWLSIECLG
GMGLLAQWRN DAGQLTMDAP YCHRDFRAPS FRGPVDENIR GCAVKREGRF FGFRLDHSPL
DVVGWDGACY PFVFPILAFQ PRAGLVHLPP TWHGTFAARG ALICSFVPRM LDFHPDAIPC
PYPHHSVDCD EFLFYCHGNF SSRRGVGAGS ISHHPSALPH GPHPGAYEAS LGERRTEELA
VMVDTFESLH PTEAALAIED PEYHDSFLP