Gene Hoch_1843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1843 
Symbol 
ID8544225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2537355 
End bp2539835 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content76% 
IMG OID646386549 
Productsulfatase 
Protein accessionYP_003266284 
Protein GI262195075 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.10365 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGGC AGCGCGGTCT GGCCGTGCTC CTGGCGTTGG CCTGGTCGGG GCTGCCCGGG 
TGCGGCGACT GCAATCGCTC CGACGAGCTG GCCGAGGTCC CGGTGGCGCC CGCCGACGCC
GGCGCCGAGG CCGCTGACGC GCGCAAAGAC GCCGCCGAGC CGGCGCCCGA TCGCGCCGAG
CTGCCGATCT TTCGCCTCGG GCCCAACATC CTCCTGGCCC ACGCGCAGCG CGGCGAGGCG
CTGTTCCTCG ACGCCGGCTC GGCCGGCTTC GCCAAGTATC TGCGCTTCGG CATCCCGGCG
CTGCGCTGGC GGCTGCAGCA GGAGCGCGAG GGCGTGCGCG TGGCCGTGCC CGAGCGCGCG
GCCGCGATCG AGGTGCCGCT CACGGCGGCG CAGGCGGCCT CGGGCGCGAT TCGCCTGGGC
GTCCACGCCA GCGCGCCGGG CCGGATCGCG CTCAAGGTCG ATGGTCGCAA AGCCGGCGAG
GCCGAGCTCG CGCCGGGCTG GCAGGTGGTC GCGATCGACG CCGAAGAGGG CCGTCTCCAG
GCCGGCGCCA ATATCGTCGT GGTCGAGACC CAGGGCGCCG AGCCGCCCGG CATCGCCTGG
TTGCAGTTCG GCAGCGCGGC CGCGGTGCCG GCCGAGCGCG CGGCCTCGCC AGCGCCCGCG
GTGTTCGATC ACGAGGCCGA TACCCTGGTC CTGGGCCGCG ATGCCGGCCT GGTCTATCAC
CTGTTCGTGC CCGAAACTGG GCGCCTGCTC GCCGCGGTGG TACCGGCCGC GGATGCTGCG
CCGGCCGCAG CTTCTGCGGG CGCAGATGGC GATGTCGTCC AGGTGGACGG CGACGAGCGC
GCCGGTGGCT GTGTGGTGCG CGCGCGCGCC GAGGCCGGCA TCGGTTCGGT CGAGGCCGCG
CTGAGCGGTC CGCGCGCGCT CATGGATCTC AGCGCCTTGG GCGGACGCGT GGTCCGGCTC
GAGCTGCGCG CCGAGGGCTG CGAGCGCGTG CGTTTGCAGG AGCCGCGCAT GACCGTGGCC
GGGGCGACGC GGGTGGGTTC GGACGGCGAG CGCCCGCGCT TCGTGGTGTT GTGGCTGATG
AGCGGGCTGC GCGCCGACCG CGTGCGCCCC TTTGCGCCCT GGGCGCGTTC GGAGACGCCG
GTGCTCGAGC GCCTGGCCCA GACCGGGCTG AGCGTGTCGC CGAGCTGGGC GCAGTCGCCG
CAGACCCAGG CCGCGCGCGC GGCGCTGTGG ACCGGGCGCT ATCCCATCCG CCGCAGCGCG
CTCGGCGGCC CGTCTGCGCC GACGCCGGCG GGAGCGCAGC CGGGCACGGC CGCGCCCGCC
AAGAACAGGG CCGGCAAGGA CGCCGACAAG GCCGCCGGGA CGCGGCTGCG CGCGCGCGCG
CCCGCGCTGG GCGTGGAGAT GCGCGAGGCC GGCTTTCGCA CCGTGGCGGT GACCGCGGCC
ATGGACGCGC AGCCCGGCTT TGCCGACGGG TTCGAGCTGT GGGAGCGGGT CGCGCCGCCT
GCCGCCCCGG GCGCGGCTGC AGGCGCGGCG GGGGAGACCG CGGCCGGTGA CGCCGTGCTC
GCGCAGGCGC TCACCCGGCT CGAGGAGCGC TACCGCGAGG GCCCGGTGTT TCTGCTGCTC
GAGACCGCGG ACGCGCGCCT GCCCTGGATC GGACATCGGC CCTGGCTGGA TCGCTACGAT
CCCGGCGCCT ACGAGGGTCC CTTCGCCGAC GGCGCCACGC TCGCGGCCGT GGACGCGGAG
CCCCGCGGCG ACGCCGCGCG TCTGGCCGAC CTGGTGCAGT GCGACGAGAC CCCGAGCGAG
CGCGATCTCG CCCGCCTGCG CGCCATCTAC GACGCCACCG CCAGCTACCA GGACGCGCTG
CTCGGTCAGC TCGTCGATCG CCTGGCGCAG TGGGGCATCT TCGAGCAGAC GCTGCTGGTG
GTGGTGTCCG ATCACGGCCA GGAGACCTGG GAAGACGGCC GCTGCGGCCA CGGCGCATCG
CTGCGCGAGA GCCTGCTGGC GGTGCCGCTG CTGCTGCATC ACCCCGGCCA GGTGCCGGGC
GGGCAGGTGC ACGCCGGCGG CGCCGAGGCG GTCGATGTCC TGCCCTCGCT GCTGCGGCTG
GCGGGCGTGC CCGTGCCCGA GCCGGTGCAG GGTCGGCCGC TGGTCGAGGT CGCGCGCGCG
CCCGGCTACC CGCAGCCGAT GTTCGCGGCC GTCGAGGGCG CCGCGCACGC GGTGCGCGTG
GCCGGTTGGA AGCTGGTGGT CGGCGCCGGC GGCGCGGCCC TCTTCGACCT GAGCGCGGAT
CCCGGCGAGC AGCGCAGCGC GATCCAGGAG CGCCCGTTCG AGCGCCAGTT CGCGAGCGAT
ATCCTGTCGC TGCACGTGCT CTATCGCGCG CGCTGGAATC AGCGCAGCTG GGGCGTTGCC
AGCAATCTCA GCGCCGCCGG GTGGCTGCAC ATCGAAGCCC CCACGGACGC CGCGGCCGAC
TCCGCGCCGC CGGCGCCCTG A
 
Protein sequence
MVRQRGLAVL LALAWSGLPG CGDCNRSDEL AEVPVAPADA GAEAADARKD AAEPAPDRAE 
LPIFRLGPNI LLAHAQRGEA LFLDAGSAGF AKYLRFGIPA LRWRLQQERE GVRVAVPERA
AAIEVPLTAA QAASGAIRLG VHASAPGRIA LKVDGRKAGE AELAPGWQVV AIDAEEGRLQ
AGANIVVVET QGAEPPGIAW LQFGSAAAVP AERAASPAPA VFDHEADTLV LGRDAGLVYH
LFVPETGRLL AAVVPAADAA PAAASAGADG DVVQVDGDER AGGCVVRARA EAGIGSVEAA
LSGPRALMDL SALGGRVVRL ELRAEGCERV RLQEPRMTVA GATRVGSDGE RPRFVVLWLM
SGLRADRVRP FAPWARSETP VLERLAQTGL SVSPSWAQSP QTQAARAALW TGRYPIRRSA
LGGPSAPTPA GAQPGTAAPA KNRAGKDADK AAGTRLRARA PALGVEMREA GFRTVAVTAA
MDAQPGFADG FELWERVAPP AAPGAAAGAA GETAAGDAVL AQALTRLEER YREGPVFLLL
ETADARLPWI GHRPWLDRYD PGAYEGPFAD GATLAAVDAE PRGDAARLAD LVQCDETPSE
RDLARLRAIY DATASYQDAL LGQLVDRLAQ WGIFEQTLLV VVSDHGQETW EDGRCGHGAS
LRESLLAVPL LLHHPGQVPG GQVHAGGAEA VDVLPSLLRL AGVPVPEPVQ GRPLVEVARA
PGYPQPMFAA VEGAAHAVRV AGWKLVVGAG GAALFDLSAD PGEQRSAIQE RPFERQFASD
ILSLHVLYRA RWNQRSWGVA SNLSAAGWLH IEAPTDAAAD SAPPAP