Gene Hoch_2061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2061 
Symbol 
ID8544443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2847631 
End bp2849118 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content72% 
IMG OID646386764 
Productsulfatase 
Protein accessionYP_003266499 
Protein GI262195290 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATCC TGTACATCGA CATCGACTGC CTGCGCCCCG ATCATCTGGG CTGCTACGGC 
TACCACCGCG ACACCAGCCC GAACATCGAC GCCATCGCCG CGCGCGGGCT GCGCTTTGAC
AACGTCTACA TCTCCGACGC GCCCTGTCTG CCCAGCCGCA CGGCCCTGTT CAGCGGCCGC
TTCGGCGTGC ACACCGGCGT GGTCAATCAC GGCGGCCTGG GCGCCGACAT GTACCCGGAC
GGCGAGAGCC GCGGCTTTGG CTCGCGCCTG GGCCGCACCT GCTACATGCG CGCGCTGCGC
GATGCCGGCT TCTACACCGC CACCGTGAGC CCGTTTGGCG AGCGCCACGC CGCCTGGCAC
TTCTACGCCG GCTTTCACGA GGTGCACAAT CCCGGGCGCC GCGGGCTCGA GCGCGCCGAC
GAGATCGAGC CCATCGCCCA GGACTGGCTG CGTCGCCGCG GCCGCGAGGA CGACTGGTTT
CTGCATGTGA ATCTGTGGGA TCCGCACATG CCGTACCGCG CGCCGGCCTC ATTCGGCGAG
CCCTTTGCCG ACGCGCCGCT GCCGGCGTGG CTCAGCGAGG AGGTGCGCGC CGCCCATTAC
GAGGGCTGCG GGCCGCACTC GGCGCGCGAG GTCATCGGCT TCAGCGACCA GGTCCCGCCC
GGCATCGCCT GGGACTACCC GCGCCAGCCG CTGCGCATCG ACTCGATGGA CGCCGTGCGC
CGGGTGTTCG ACGGCTACGA CACCGGCGTG CGCTACGCCG ACGAGGCGGT CGGTCGGCTG
CTCGCGACCC TGGACGAGCT GGGCGTGCGC GAGGACACCG CCATCATCGT GAGCGCCGAC
CACGGCGAGG CGCTGGGCGA GCAGAACATC TACGGCGACC ACCAGACCGC CGATCACATC
ACCACGCGCG TGCCCTGGAT TCTCGACTGG CCCGGCGTCA CCGAGGCCGC GGCCGGGCAG
GCGCGCTCGG CCCTGCATTA TCAGGTCGAC ACCGCGGCCA CCGTGATCGA GCTGGCCGGC
GGCACGGTGC CGGGCGGCTG GGACGGCCGG AGCTTTGCCG ACCGACTGTC CGAAGGCGCC
GACGAGGGGC GGCCGTGCTT GGTCATGTCG CAGGCGGCCT GGAGCTGTCA GCGGGCCGTG
CGCCTGGTCG ACGACGGCCG CGATTATGTG TTCGTGCGCA GCTATCACGA CGGCTATCAC
TGCTTCGACG AGCTGCAGCT CTTTGATATC GGCGCTGATT ATCACGCGCA GCACAACCTC
GCGGCGGAGC GCCCGGCGCT GGTGCAGCGC GCGCTCGCGC AGCTCGAGAG CTGGCACGGC
GAGATGATGC GCACGGCCAC GCACCCGGCC GATCCCATGT GGAACGTGCT GCGCGAGGGC
GGGCCCAAGC ACACCCGCGG CGAGCTGCCC GGCTACCTCG CGCGGCTGCG CGCCACCGGC
CGTGAGCGCT GGGCCGAGCG CCTCGAAGCC CGCCACGGCG CCGGCTGA
 
Protein sequence
MRILYIDIDC LRPDHLGCYG YHRDTSPNID AIAARGLRFD NVYISDAPCL PSRTALFSGR 
FGVHTGVVNH GGLGADMYPD GESRGFGSRL GRTCYMRALR DAGFYTATVS PFGERHAAWH
FYAGFHEVHN PGRRGLERAD EIEPIAQDWL RRRGREDDWF LHVNLWDPHM PYRAPASFGE
PFADAPLPAW LSEEVRAAHY EGCGPHSARE VIGFSDQVPP GIAWDYPRQP LRIDSMDAVR
RVFDGYDTGV RYADEAVGRL LATLDELGVR EDTAIIVSAD HGEALGEQNI YGDHQTADHI
TTRVPWILDW PGVTEAAAGQ ARSALHYQVD TAATVIELAG GTVPGGWDGR SFADRLSEGA
DEGRPCLVMS QAAWSCQRAV RLVDDGRDYV FVRSYHDGYH CFDELQLFDI GADYHAQHNL
AAERPALVQR ALAQLESWHG EMMRTATHPA DPMWNVLREG GPKHTRGELP GYLARLRATG
RERWAERLEA RHGAG