Gene Hoch_5224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5224 
Symbol 
ID8547636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7179703 
End bp7181349 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content70% 
IMG OID646389898 
Productbenzoyl-CoA-dihydrodiol lyase 
Protein accessionYP_003269602 
Protein GI262198393 
COG category[I] Lipid transport and metabolism 
COG ID[COG1024] Enoyl-CoA hydratase/carnithine racemase 
TIGRFAM ID[TIGR03222] benzoyl-CoA-dihydrodiol lyase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCGA TCCGCTTCGA AACCCATCCC AGCGAGTACA AGCATTGGCA ACTGGACGTC 
GACGGTCCGG TTGCCAAGCT CACCATGGCC GTCGACGCCG AGCATCCGCT GCGCCCCGGC
TACGAGCTAA AGCTCAACTC CTACGACCTC TCGGTCGACA TCGAGCTGGC CGACGCCGTG
CAGCGCATTC GCTTCGAGCA CCCCGAGGTG CGCACCGTGG TCATCTCGGC CGACCTCGAT
CGCGTGTTCT GCTCAGGCGC GAACATCTAC ATGCTCGGCG CCTCGGACCA CAGCTTCAAG
GTCAACTTCT GCAAGTACAC CAACGAGACC CGGCTGTACA TGGAGGAGGC CTCGCGCAAC
AGCGGGCTGC GCTTCCTGGC CGCGTGCAAG GGCACCACGG CCGGCGGCGG CTACGAGCTG
GCCCTGGCCT GCGACCACAT CACCCTGGTC GACGACGGCT CGTCCGCGGT GTCGTTTCCC
GAGACCCCGC TGCTCGGCGT ATTGCCCGGC ACCGGCGGCC TCACCCGCCT GGTCGACAAA
CGCAAAGTGC GCCGCGATCG CGCCGACGTA TTCTGCACCC TGGCCGAGGG CATCAAGGGC
CGGCGCGCGG TCGAGTGGGG CCTGGTCGAC CAGCTCCTGC CGCGCTCCAA GTTCGACGAG
GGCGTACGCG CGCGCGCCGA AGAGCTGGCC AAGGAGGTCG CCGAGGTCGC CCACGGCCCG
GCCGTGAGCC TGCCGCCGCT GGCGCCGAGC ATCGAGGACG ATCTCATCAG CTACCGCCAT
GTGAGCGTCG AGTTCGAGCG CTCGCAGCGC ACGGCCACGC TGACCCTGCG CGCGCCCGCC
GAGGCGCCGC CTACCAGCAT CGAGGACAGC GCCGCCCAGG GCGCCGATCT GTGGAGCCTG
CGCCTGTTCC GCGAGCTCGA CCACGCGCTC TGCCACCTGC GCTTCAACGA GCCCGAGATC
GGCCTGGTGC TGGTCCGCAG CGTCGGCGAT CCCGCCCAGG TGCTGGCCGC CGACGCCGCG
CTCGACGCCC TGCAGGAGCA CGGCTTCACC CGCGAAGTGC GGCTGTTTCA GGCGCGCGTG
CTGCGCCGCC TCGACACCAC CGCGCGCTCG TTCTTCGCCG TCATCGACAG CGACTCGAAC
TGCTTCGCCG GCTCGCTGCT CGAGGTCGCC TTGGCCGCCG ACCGCGTGTA CATGCTCGAG
GACGACGACG AGGAGGTCGG CGTGCATACC TCGGTCGCCA ACAGCGGCAT CATGCCCATG
GCCAACGGCC TCAGCCGCCT GGCCGTGCGC TTCTACGGCG ATGCCGACCA GGTGGACGAA
GTCCTCGGCC GCGGCAAAGA CGGCCTCATC CCCACCGCGG ACGCCGAGGA GCTGGGACTG
GCCACCATCG CCGCCGACGA CATCGACTTC GAGGACGAGC TGCGCATCGC GTGCGAGGAG
CGCGCATCGC TGTCGCCCGA CGCGCTCACC GGCATGGAGG CCTCGCTGCG CTTCCCCGGC
CCCGAGACCC TCGAGACCAA GATCTTCGGA CGCCTGTCCG CGTGGCAGAA CTGGATCTTC
ACCCGCCCCA ACGCCACCGG CGAGCGCGGC GCCCTCACCC TCTACGGACA ACCCGAGCGC
CCCAGCTTCC GCTGGGCGAG GACTTAA
 
Protein sequence
MAPIRFETHP SEYKHWQLDV DGPVAKLTMA VDAEHPLRPG YELKLNSYDL SVDIELADAV 
QRIRFEHPEV RTVVISADLD RVFCSGANIY MLGASDHSFK VNFCKYTNET RLYMEEASRN
SGLRFLAACK GTTAGGGYEL ALACDHITLV DDGSSAVSFP ETPLLGVLPG TGGLTRLVDK
RKVRRDRADV FCTLAEGIKG RRAVEWGLVD QLLPRSKFDE GVRARAEELA KEVAEVAHGP
AVSLPPLAPS IEDDLISYRH VSVEFERSQR TATLTLRAPA EAPPTSIEDS AAQGADLWSL
RLFRELDHAL CHLRFNEPEI GLVLVRSVGD PAQVLAADAA LDALQEHGFT REVRLFQARV
LRRLDTTARS FFAVIDSDSN CFAGSLLEVA LAADRVYMLE DDDEEVGVHT SVANSGIMPM
ANGLSRLAVR FYGDADQVDE VLGRGKDGLI PTADAEELGL ATIAADDIDF EDELRIACEE
RASLSPDALT GMEASLRFPG PETLETKIFG RLSAWQNWIF TRPNATGERG ALTLYGQPER
PSFRWART