Gene Hoch_6555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6555 
Symbol 
ID8548972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp8999395 
End bp9000372 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content67% 
IMG OID646391217 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_003270916 
Protein GI262199707 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.121187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.192536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTAG CGACCCTTCG AGACGGAACC CGAGACGGCA GCCTGGTGGT GGTGAGCCGC 
GACAACGCCC GCTATGCGAG CGCGCGCGAC ATCGCGCCGA CGATGCAGGC GGCGCTCGAT
GATTGGGACG CGCTCGCGCC CAAGCTGATC GAACTGTACG AGCGGCTGTG CGGCGGCGAG
CTCAAGGGCG AGCCCGTGGA CACCAGCAAG CTGCACGCGC CCTTGCCGCG CGCGTACGAG
TGGGTGGACG GCTCGGCCTA TATCAACCAC ATCATCCTGG TGCGCAAGGC GCGCAACGCC
GAGCCGCCGG CCACGCTGGA GACCGACCCG CTGGTCTACC AGGGCGGCTC GGGCGTGTTG
CTCGGACCGA CGGACGATAT CCCGCTCATT GATCCCGGAT ATGGTCTCGA CTTCGAGGCC
GAGATCTGCG CCGTGCTCGG CGACACGCCG CAGGGCACTG GCCAGGACCA GGCGGGCGCG
CACATCCGGC TGCTGATGCT GTGCAATGAC ATCACCCTGC GCAACCTGAT CCCGCCCGAG
CTGGCCAAGG GCTTTGGCTT CTTCGTGTCC AAGCCGGCGA GCGCGTTCTC GCCCTTTGCG
GTCACGCCCG ACGAGCTCGG CGACGCGTTC CAGGGCGGCC GCGTGCACCT GCCGCTCAGC
TCCACGCTCA ACGGCGAGCG CGTCGGCAAC CCCGACGCCG GCCCCGAGAT GCATTTTTCC
TTTTTCGACC TGGTGGCCCA TATCACGCGG ACGCGCGCGT TCACAGCCGG CACCATCCTC
GGCAGCGGTA CGGTGTCGAA TTCCGACCGC AGCAAGGGCA TCTCGTGCCT GGCCGAGCGC
CGCATGATCG AGATCATCGA CGACGGCGCC GCCAAGACCG AGTTCTTGAA GACCGGCGAC
CGCGTGGCCA TCGAGATGTT CGACGGCGCT GGGGCCTCGA TTTTCGGACG CATCGAGCAG
CAGGTGGTGG CGCGATGA
 
Protein sequence
MKLATLRDGT RDGSLVVVSR DNARYASARD IAPTMQAALD DWDALAPKLI ELYERLCGGE 
LKGEPVDTSK LHAPLPRAYE WVDGSAYINH IILVRKARNA EPPATLETDP LVYQGGSGVL
LGPTDDIPLI DPGYGLDFEA EICAVLGDTP QGTGQDQAGA HIRLLMLCND ITLRNLIPPE
LAKGFGFFVS KPASAFSPFA VTPDELGDAF QGGRVHLPLS STLNGERVGN PDAGPEMHFS
FFDLVAHITR TRAFTAGTIL GSGTVSNSDR SKGISCLAER RMIEIIDDGA AKTEFLKTGD
RVAIEMFDGA GASIFGRIEQ QVVAR