Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_2061 |
Symbol | |
ID | 8544443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 2847631 |
End bp | 2849118 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646386764 |
Product | sulfatase |
Protein accession | YP_003266499 |
Protein GI | 262195290 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAATCC TGTACATCGA CATCGACTGC CTGCGCCCCG ATCATCTGGG CTGCTACGGC TACCACCGCG ACACCAGCCC GAACATCGAC GCCATCGCCG CGCGCGGGCT GCGCTTTGAC AACGTCTACA TCTCCGACGC GCCCTGTCTG CCCAGCCGCA CGGCCCTGTT CAGCGGCCGC TTCGGCGTGC ACACCGGCGT GGTCAATCAC GGCGGCCTGG GCGCCGACAT GTACCCGGAC GGCGAGAGCC GCGGCTTTGG CTCGCGCCTG GGCCGCACCT GCTACATGCG CGCGCTGCGC GATGCCGGCT TCTACACCGC CACCGTGAGC CCGTTTGGCG AGCGCCACGC CGCCTGGCAC TTCTACGCCG GCTTTCACGA GGTGCACAAT CCCGGGCGCC GCGGGCTCGA GCGCGCCGAC GAGATCGAGC CCATCGCCCA GGACTGGCTG CGTCGCCGCG GCCGCGAGGA CGACTGGTTT CTGCATGTGA ATCTGTGGGA TCCGCACATG CCGTACCGCG CGCCGGCCTC ATTCGGCGAG CCCTTTGCCG ACGCGCCGCT GCCGGCGTGG CTCAGCGAGG AGGTGCGCGC CGCCCATTAC GAGGGCTGCG GGCCGCACTC GGCGCGCGAG GTCATCGGCT TCAGCGACCA GGTCCCGCCC GGCATCGCCT GGGACTACCC GCGCCAGCCG CTGCGCATCG ACTCGATGGA CGCCGTGCGC CGGGTGTTCG ACGGCTACGA CACCGGCGTG CGCTACGCCG ACGAGGCGGT CGGTCGGCTG CTCGCGACCC TGGACGAGCT GGGCGTGCGC GAGGACACCG CCATCATCGT GAGCGCCGAC CACGGCGAGG CGCTGGGCGA GCAGAACATC TACGGCGACC ACCAGACCGC CGATCACATC ACCACGCGCG TGCCCTGGAT TCTCGACTGG CCCGGCGTCA CCGAGGCCGC GGCCGGGCAG GCGCGCTCGG CCCTGCATTA TCAGGTCGAC ACCGCGGCCA CCGTGATCGA GCTGGCCGGC GGCACGGTGC CGGGCGGCTG GGACGGCCGG AGCTTTGCCG ACCGACTGTC CGAAGGCGCC GACGAGGGGC GGCCGTGCTT GGTCATGTCG CAGGCGGCCT GGAGCTGTCA GCGGGCCGTG CGCCTGGTCG ACGACGGCCG CGATTATGTG TTCGTGCGCA GCTATCACGA CGGCTATCAC TGCTTCGACG AGCTGCAGCT CTTTGATATC GGCGCTGATT ATCACGCGCA GCACAACCTC GCGGCGGAGC GCCCGGCGCT GGTGCAGCGC GCGCTCGCGC AGCTCGAGAG CTGGCACGGC GAGATGATGC GCACGGCCAC GCACCCGGCC GATCCCATGT GGAACGTGCT GCGCGAGGGC GGGCCCAAGC ACACCCGCGG CGAGCTGCCC GGCTACCTCG CGCGGCTGCG CGCCACCGGC CGTGAGCGCT GGGCCGAGCG CCTCGAAGCC CGCCACGGCG CCGGCTGA
|
Protein sequence | MRILYIDIDC LRPDHLGCYG YHRDTSPNID AIAARGLRFD NVYISDAPCL PSRTALFSGR FGVHTGVVNH GGLGADMYPD GESRGFGSRL GRTCYMRALR DAGFYTATVS PFGERHAAWH FYAGFHEVHN PGRRGLERAD EIEPIAQDWL RRRGREDDWF LHVNLWDPHM PYRAPASFGE PFADAPLPAW LSEEVRAAHY EGCGPHSARE VIGFSDQVPP GIAWDYPRQP LRIDSMDAVR RVFDGYDTGV RYADEAVGRL LATLDELGVR EDTAIIVSAD HGEALGEQNI YGDHQTADHI TTRVPWILDW PGVTEAAAGQ ARSALHYQVD TAATVIELAG GTVPGGWDGR SFADRLSEGA DEGRPCLVMS QAAWSCQRAV RLVDDGRDYV FVRSYHDGYH CFDELQLFDI GADYHAQHNL AAERPALVQR ALAQLESWHG EMMRTATHPA DPMWNVLREG GPKHTRGELP GYLARLRATG RERWAERLEA RHGAG
|
| |