Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_0544 |
Symbol | |
ID | 8542924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 732763 |
End bp | 734382 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646385338 |
Product | sulfatase |
Protein accession | YP_003265075 |
Protein GI | 262193866 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCGCA GAACCCTCAC CGACCCCATC ACCACCGCGC TCGCCCTGGT GTTTCTGCAC CTGGCGATCG CGATCTTCTA CTGGCTGGCC GCGCCCGCGC CCGCGACCCT GCTCGCGCCC ACCTGGGAGA TCCTCGGCCT GGTCGGCGTG CTGCTGCTCG TCGGCTGGAT TGGGACAGTT GTCCAGATTC GCGGCCTCGA CAGCGTGCTC ACGGTGCTGG TGCTCGCCTA CTTCGTGCTC GGCCTGGCCC AGGGCTTCGC GCGCCGCGAG TTCGGCTACG ACGTCATCGT CGCCCTGCAC GCGCCCTACG TGCCCGAGCT GTTCCGCATG CTGTACAACG GCGAGACCCT GGGCATGTTC CTGCTCTACT GCGCCCTGCT CGCCGTGGTC CTCATCGCCC TGCTCGCCGG CATCTACGCC GCCGTGCGCA AGCTCGTGCA GCAGAGCGCG AGCACCCGCC GCAGCCGCCT GCTCATCGGC GGCGGCGTGG CCGTCTACGC CGCGATCACG ATCCCGCTCT TCGGCATCCA GGGCCCGCTC ACCAAGAACC TGATCGAGCA AGTCGACCTG GCCGTCAACC TCGACCAGCG CATCGACGAG ACCGCGCGCC GCATCGAGCT CGAGACCGCC TGGATCAAGC AGCGCAATCC CTTCGCCAAG CTCGACAAGC CGCCGCGCAT CCTGCTGTTC ATCATCGAAT CCTACGGCCG CGCGCTGTAC GACGACGAGG ACTTCGCAAA CTTCCAGAAG CTGGCCCAGG CCTCGGCCGA CAAGCTGGCC GAGGGCGGCT ACGCAGTGCG CTCGCGCTAC CTCACCGCGC CCGTGTTCGG CGGCTCCTCG TGGCTCTCGG ACACCGCGCT CTTGTGCGGC GTGCGCATCC CCGATCAGAA GCACTTCCGC GGCATGCACG CCTCGCACGC CAAGTGCCTG CCGCACATCC TCGACGACGC CGGCTATCAC ACCGTGGTCG CGGCGCCCAA CACCAAGACC CTCGACGGCA CCTTCGACGA GGGCCTCGGC TTCGAGACCA TCTACTACCG CGAAAACCTG GCCTACCAGG GACCGCGCTT CGGCTGGTCG TTCATGCCCG ACCAGTACGC CATCCAGCGC GTGCACGAGG ACGTGCTGGC CAAGGGCGGC GAGCGGCCCG AGTTCGTGAC CTACATCCTC ACCAGCAGCC ACCACCCGTG GAACAAGCTG CCGCCCCTCA TCGACCAGTG GGACGAAATC GGCGACGGCG CGATCTTCTC CAAGCGCCGG CCGCGGCGCT TCAGCAACTC CTTCGTCGGC GGCCGCCAGG TCAAGAAGGC GTTCATGGCC AGCATCGAGT ACTCGTTCGA GACCGTGATC GAGTACCTCC TGCGCCTCGA CGACCGCGAG ACCATCATCG TGATCACCGG CGATCATCAG CCGCGCACGC CCATCGCCGA CCTCAACACC GACCCCTGGG ACGTGCCCTT CCACATCATC AGCCGCAACC CCGCGCTGGT CGAGCGCTTC GCCAGCGCCG GCTACGAGGC CACGCTCGAG CCCCGGAGCG ACGCGCCCGC GCAGGGCTCT GAGGGCTTCC TGCTGCAGCT CTTCCGGGCG TTTTCGGACG CCGACACGAA CCGACGCTGA
|
Protein sequence | MDRRTLTDPI TTALALVFLH LAIAIFYWLA APAPATLLAP TWEILGLVGV LLLVGWIGTV VQIRGLDSVL TVLVLAYFVL GLAQGFARRE FGYDVIVALH APYVPELFRM LYNGETLGMF LLYCALLAVV LIALLAGIYA AVRKLVQQSA STRRSRLLIG GGVAVYAAIT IPLFGIQGPL TKNLIEQVDL AVNLDQRIDE TARRIELETA WIKQRNPFAK LDKPPRILLF IIESYGRALY DDEDFANFQK LAQASADKLA EGGYAVRSRY LTAPVFGGSS WLSDTALLCG VRIPDQKHFR GMHASHAKCL PHILDDAGYH TVVAAPNTKT LDGTFDEGLG FETIYYRENL AYQGPRFGWS FMPDQYAIQR VHEDVLAKGG ERPEFVTYIL TSSHHPWNKL PPLIDQWDEI GDGAIFSKRR PRRFSNSFVG GRQVKKAFMA SIEYSFETVI EYLLRLDDRE TIIVITGDHQ PRTPIADLNT DPWDVPFHII SRNPALVERF ASAGYEATLE PRSDAPAQGS EGFLLQLFRA FSDADTNRR
|
| |