Gene Hoch_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0544 
Symbol 
ID8542924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp732763 
End bp734382 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content68% 
IMG OID646385338 
Productsulfatase 
Protein accessionYP_003265075 
Protein GI262193866 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGCA GAACCCTCAC CGACCCCATC ACCACCGCGC TCGCCCTGGT GTTTCTGCAC 
CTGGCGATCG CGATCTTCTA CTGGCTGGCC GCGCCCGCGC CCGCGACCCT GCTCGCGCCC
ACCTGGGAGA TCCTCGGCCT GGTCGGCGTG CTGCTGCTCG TCGGCTGGAT TGGGACAGTT
GTCCAGATTC GCGGCCTCGA CAGCGTGCTC ACGGTGCTGG TGCTCGCCTA CTTCGTGCTC
GGCCTGGCCC AGGGCTTCGC GCGCCGCGAG TTCGGCTACG ACGTCATCGT CGCCCTGCAC
GCGCCCTACG TGCCCGAGCT GTTCCGCATG CTGTACAACG GCGAGACCCT GGGCATGTTC
CTGCTCTACT GCGCCCTGCT CGCCGTGGTC CTCATCGCCC TGCTCGCCGG CATCTACGCC
GCCGTGCGCA AGCTCGTGCA GCAGAGCGCG AGCACCCGCC GCAGCCGCCT GCTCATCGGC
GGCGGCGTGG CCGTCTACGC CGCGATCACG ATCCCGCTCT TCGGCATCCA GGGCCCGCTC
ACCAAGAACC TGATCGAGCA AGTCGACCTG GCCGTCAACC TCGACCAGCG CATCGACGAG
ACCGCGCGCC GCATCGAGCT CGAGACCGCC TGGATCAAGC AGCGCAATCC CTTCGCCAAG
CTCGACAAGC CGCCGCGCAT CCTGCTGTTC ATCATCGAAT CCTACGGCCG CGCGCTGTAC
GACGACGAGG ACTTCGCAAA CTTCCAGAAG CTGGCCCAGG CCTCGGCCGA CAAGCTGGCC
GAGGGCGGCT ACGCAGTGCG CTCGCGCTAC CTCACCGCGC CCGTGTTCGG CGGCTCCTCG
TGGCTCTCGG ACACCGCGCT CTTGTGCGGC GTGCGCATCC CCGATCAGAA GCACTTCCGC
GGCATGCACG CCTCGCACGC CAAGTGCCTG CCGCACATCC TCGACGACGC CGGCTATCAC
ACCGTGGTCG CGGCGCCCAA CACCAAGACC CTCGACGGCA CCTTCGACGA GGGCCTCGGC
TTCGAGACCA TCTACTACCG CGAAAACCTG GCCTACCAGG GACCGCGCTT CGGCTGGTCG
TTCATGCCCG ACCAGTACGC CATCCAGCGC GTGCACGAGG ACGTGCTGGC CAAGGGCGGC
GAGCGGCCCG AGTTCGTGAC CTACATCCTC ACCAGCAGCC ACCACCCGTG GAACAAGCTG
CCGCCCCTCA TCGACCAGTG GGACGAAATC GGCGACGGCG CGATCTTCTC CAAGCGCCGG
CCGCGGCGCT TCAGCAACTC CTTCGTCGGC GGCCGCCAGG TCAAGAAGGC GTTCATGGCC
AGCATCGAGT ACTCGTTCGA GACCGTGATC GAGTACCTCC TGCGCCTCGA CGACCGCGAG
ACCATCATCG TGATCACCGG CGATCATCAG CCGCGCACGC CCATCGCCGA CCTCAACACC
GACCCCTGGG ACGTGCCCTT CCACATCATC AGCCGCAACC CCGCGCTGGT CGAGCGCTTC
GCCAGCGCCG GCTACGAGGC CACGCTCGAG CCCCGGAGCG ACGCGCCCGC GCAGGGCTCT
GAGGGCTTCC TGCTGCAGCT CTTCCGGGCG TTTTCGGACG CCGACACGAA CCGACGCTGA
 
Protein sequence
MDRRTLTDPI TTALALVFLH LAIAIFYWLA APAPATLLAP TWEILGLVGV LLLVGWIGTV 
VQIRGLDSVL TVLVLAYFVL GLAQGFARRE FGYDVIVALH APYVPELFRM LYNGETLGMF
LLYCALLAVV LIALLAGIYA AVRKLVQQSA STRRSRLLIG GGVAVYAAIT IPLFGIQGPL
TKNLIEQVDL AVNLDQRIDE TARRIELETA WIKQRNPFAK LDKPPRILLF IIESYGRALY
DDEDFANFQK LAQASADKLA EGGYAVRSRY LTAPVFGGSS WLSDTALLCG VRIPDQKHFR
GMHASHAKCL PHILDDAGYH TVVAAPNTKT LDGTFDEGLG FETIYYRENL AYQGPRFGWS
FMPDQYAIQR VHEDVLAKGG ERPEFVTYIL TSSHHPWNKL PPLIDQWDEI GDGAIFSKRR
PRRFSNSFVG GRQVKKAFMA SIEYSFETVI EYLLRLDDRE TIIVITGDHQ PRTPIADLNT
DPWDVPFHII SRNPALVERF ASAGYEATLE PRSDAPAQGS EGFLLQLFRA FSDADTNRR