Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1843 |
Symbol | |
ID | 8544225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 2537355 |
End bp | 2539835 |
Gene Length | 2481 bp |
Protein Length | 826 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 646386549 |
Product | sulfatase |
Protein accession | YP_003266284 |
Protein GI | 262195075 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.10365 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCGGC AGCGCGGTCT GGCCGTGCTC CTGGCGTTGG CCTGGTCGGG GCTGCCCGGG TGCGGCGACT GCAATCGCTC CGACGAGCTG GCCGAGGTCC CGGTGGCGCC CGCCGACGCC GGCGCCGAGG CCGCTGACGC GCGCAAAGAC GCCGCCGAGC CGGCGCCCGA TCGCGCCGAG CTGCCGATCT TTCGCCTCGG GCCCAACATC CTCCTGGCCC ACGCGCAGCG CGGCGAGGCG CTGTTCCTCG ACGCCGGCTC GGCCGGCTTC GCCAAGTATC TGCGCTTCGG CATCCCGGCG CTGCGCTGGC GGCTGCAGCA GGAGCGCGAG GGCGTGCGCG TGGCCGTGCC CGAGCGCGCG GCCGCGATCG AGGTGCCGCT CACGGCGGCG CAGGCGGCCT CGGGCGCGAT TCGCCTGGGC GTCCACGCCA GCGCGCCGGG CCGGATCGCG CTCAAGGTCG ATGGTCGCAA AGCCGGCGAG GCCGAGCTCG CGCCGGGCTG GCAGGTGGTC GCGATCGACG CCGAAGAGGG CCGTCTCCAG GCCGGCGCCA ATATCGTCGT GGTCGAGACC CAGGGCGCCG AGCCGCCCGG CATCGCCTGG TTGCAGTTCG GCAGCGCGGC CGCGGTGCCG GCCGAGCGCG CGGCCTCGCC AGCGCCCGCG GTGTTCGATC ACGAGGCCGA TACCCTGGTC CTGGGCCGCG ATGCCGGCCT GGTCTATCAC CTGTTCGTGC CCGAAACTGG GCGCCTGCTC GCCGCGGTGG TACCGGCCGC GGATGCTGCG CCGGCCGCAG CTTCTGCGGG CGCAGATGGC GATGTCGTCC AGGTGGACGG CGACGAGCGC GCCGGTGGCT GTGTGGTGCG CGCGCGCGCC GAGGCCGGCA TCGGTTCGGT CGAGGCCGCG CTGAGCGGTC CGCGCGCGCT CATGGATCTC AGCGCCTTGG GCGGACGCGT GGTCCGGCTC GAGCTGCGCG CCGAGGGCTG CGAGCGCGTG CGTTTGCAGG AGCCGCGCAT GACCGTGGCC GGGGCGACGC GGGTGGGTTC GGACGGCGAG CGCCCGCGCT TCGTGGTGTT GTGGCTGATG AGCGGGCTGC GCGCCGACCG CGTGCGCCCC TTTGCGCCCT GGGCGCGTTC GGAGACGCCG GTGCTCGAGC GCCTGGCCCA GACCGGGCTG AGCGTGTCGC CGAGCTGGGC GCAGTCGCCG CAGACCCAGG CCGCGCGCGC GGCGCTGTGG ACCGGGCGCT ATCCCATCCG CCGCAGCGCG CTCGGCGGCC CGTCTGCGCC GACGCCGGCG GGAGCGCAGC CGGGCACGGC CGCGCCCGCC AAGAACAGGG CCGGCAAGGA CGCCGACAAG GCCGCCGGGA CGCGGCTGCG CGCGCGCGCG CCCGCGCTGG GCGTGGAGAT GCGCGAGGCC GGCTTTCGCA CCGTGGCGGT GACCGCGGCC ATGGACGCGC AGCCCGGCTT TGCCGACGGG TTCGAGCTGT GGGAGCGGGT CGCGCCGCCT GCCGCCCCGG GCGCGGCTGC AGGCGCGGCG GGGGAGACCG CGGCCGGTGA CGCCGTGCTC GCGCAGGCGC TCACCCGGCT CGAGGAGCGC TACCGCGAGG GCCCGGTGTT TCTGCTGCTC GAGACCGCGG ACGCGCGCCT GCCCTGGATC GGACATCGGC CCTGGCTGGA TCGCTACGAT CCCGGCGCCT ACGAGGGTCC CTTCGCCGAC GGCGCCACGC TCGCGGCCGT GGACGCGGAG CCCCGCGGCG ACGCCGCGCG TCTGGCCGAC CTGGTGCAGT GCGACGAGAC CCCGAGCGAG CGCGATCTCG CCCGCCTGCG CGCCATCTAC GACGCCACCG CCAGCTACCA GGACGCGCTG CTCGGTCAGC TCGTCGATCG CCTGGCGCAG TGGGGCATCT TCGAGCAGAC GCTGCTGGTG GTGGTGTCCG ATCACGGCCA GGAGACCTGG GAAGACGGCC GCTGCGGCCA CGGCGCATCG CTGCGCGAGA GCCTGCTGGC GGTGCCGCTG CTGCTGCATC ACCCCGGCCA GGTGCCGGGC GGGCAGGTGC ACGCCGGCGG CGCCGAGGCG GTCGATGTCC TGCCCTCGCT GCTGCGGCTG GCGGGCGTGC CCGTGCCCGA GCCGGTGCAG GGTCGGCCGC TGGTCGAGGT CGCGCGCGCG CCCGGCTACC CGCAGCCGAT GTTCGCGGCC GTCGAGGGCG CCGCGCACGC GGTGCGCGTG GCCGGTTGGA AGCTGGTGGT CGGCGCCGGC GGCGCGGCCC TCTTCGACCT GAGCGCGGAT CCCGGCGAGC AGCGCAGCGC GATCCAGGAG CGCCCGTTCG AGCGCCAGTT CGCGAGCGAT ATCCTGTCGC TGCACGTGCT CTATCGCGCG CGCTGGAATC AGCGCAGCTG GGGCGTTGCC AGCAATCTCA GCGCCGCCGG GTGGCTGCAC ATCGAAGCCC CCACGGACGC CGCGGCCGAC TCCGCGCCGC CGGCGCCCTG A
|
Protein sequence | MVRQRGLAVL LALAWSGLPG CGDCNRSDEL AEVPVAPADA GAEAADARKD AAEPAPDRAE LPIFRLGPNI LLAHAQRGEA LFLDAGSAGF AKYLRFGIPA LRWRLQQERE GVRVAVPERA AAIEVPLTAA QAASGAIRLG VHASAPGRIA LKVDGRKAGE AELAPGWQVV AIDAEEGRLQ AGANIVVVET QGAEPPGIAW LQFGSAAAVP AERAASPAPA VFDHEADTLV LGRDAGLVYH LFVPETGRLL AAVVPAADAA PAAASAGADG DVVQVDGDER AGGCVVRARA EAGIGSVEAA LSGPRALMDL SALGGRVVRL ELRAEGCERV RLQEPRMTVA GATRVGSDGE RPRFVVLWLM SGLRADRVRP FAPWARSETP VLERLAQTGL SVSPSWAQSP QTQAARAALW TGRYPIRRSA LGGPSAPTPA GAQPGTAAPA KNRAGKDADK AAGTRLRARA PALGVEMREA GFRTVAVTAA MDAQPGFADG FELWERVAPP AAPGAAAGAA GETAAGDAVL AQALTRLEER YREGPVFLLL ETADARLPWI GHRPWLDRYD PGAYEGPFAD GATLAAVDAE PRGDAARLAD LVQCDETPSE RDLARLRAIY DATASYQDAL LGQLVDRLAQ WGIFEQTLLV VVSDHGQETW EDGRCGHGAS LRESLLAVPL LLHHPGQVPG GQVHAGGAEA VDVLPSLLRL AGVPVPEPVQ GRPLVEVARA PGYPQPMFAA VEGAAHAVRV AGWKLVVGAG GAALFDLSAD PGEQRSAIQE RPFERQFASD ILSLHVLYRA RWNQRSWGVA SNLSAAGWLH IEAPTDAAAD SAPPAP
|
| |