Gene Hoch_2284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2284 
Symbol 
ID8544670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3179991 
End bp3182345 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content69% 
IMG OID646386989 
Productsulfatase 
Protein accessionYP_003266720 
Protein GI262195511 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTG CTCGCACAGC GCTCGCAACG CATGGGCGCC TCCCGGCGCT TCTCTCGATC 
GCCTTGACCG CGGTGGTGAT GGCCCCGGCC TGTAGCAAGG ACAGCGGGGC GCCCGCCGAG
GGCAAGGCCA CCGGGGCCTC CGCGACCACG GCCGCGACCG ACGAGGCCAA AACCCCCGAG
GCGCCCAAGA AACCCTCGGT GCGCACCAAA CTCGAAGATT ACCGCCCGCT GCTCGACGAG
GCCGCGCGCG CCGAGCTCGA CGCCGGCGGC TTGTTCATCG ACCTCGGCAC CGCCGACCAG
CACAAGTTCA CGCGCGGCGG CTGGCGCACG GGTTGGGGCG AGAGCAAGCA GGAAGACGGC
CTGTCGCTGT CCGCCGGCGA CGATCGCCGC CTGCATCTCG ACGTCCTCAC CAGCGAGCCC
GCGGCCGAAG TCGCGGTCCG CGCCCGCTCG GCCGTCGACG GCCAGGCGGC CACGGTGTGG
GTCGACGGCG AGCGCCAGGG CGACATCTGG CTCAAGCCCG AGTGGACCGT GATGCGCGTG
CCCGTGGCCG AGGGCGCGCT CGCGGCCGGG CGCCACAACG TGCAGCTCAC CTTCACCAAG
AGCGGCACCC CGCGCGCCGA GATCGACTGG GTGTGGCTGG CCAACGAGGT CGACGCCGAG
CCGCCCTCGG CCAGCGTCCG CGTGCTGCCC CTGCGCATCG GCAGCCAGCC CAAGCGCGCG
CTCATCGCGC CGACCACGCG CACCTATTCC TTCTACATGC AGCCGCCGGC CGGCGCCTCG
CTGGTCTTCG ACTACGCCTC GGACGTCGGC GCCAGCTTCG AGGTCCGCGC CCAGGCTGAC
GGCGGCGCAG CCGAGTCGCT GTTCACGGCC ACCGGCGGTG AGAGCTGGGA AGAGGCCCAG
GTCGATCTCG GCGCCTTCGC CGACAAGGCC ATCCGCCTCG ACCTGGTCGC CACCGGCGCC
GAGGGCCGCT CCGGCTGGGG CGAGCCCGCG CTCATGGTCG CCCCGGGGAA TTCGGACAAG
CCGGCCCTCA AGACCGCGGC GCCGGGCACG CCCAAGAACG TGGTCGTCAT CCTCATCGAC
ACCGTGCGCG CCGACTCGTT TGCGGCCATC CGTCCGGATA ACAAGGTCGT GACCCCGGCC
TTCGACGCCT TCGCCGACAA GGCCACGGTG TTCACCAACG CCTACAACAA CGAGAACTGG
ACCAAGCCCT CGGTGGCTAG CCTGCTCTCG GGCCTGTACC CGAGCACGCA CGACACCAAG
AAAGACGAGA GCAGCCTGCC CAAAGAGGTC GAGATCCTGT CGCAGCGCCT GAGCAAGCAG
GGCTTCGCCA CCGCCGGGTT CGTGGCCAAC GGCTACGTGT CCGAAAAATT CGGCTTCGAG
AAGGGCTGGG ACGCCTTTAC CAACTACATC CGCGAGAACA AGAGCTCGGA GGCCGAGTAC
GTCTACGGCG ACGCGCTGGC CTGGCTGGGC GAGCGCGAAA AGGCCGCCGA CGGCAAGCCC
TTCTTCCTCT ACATCCAGAC CATCGATCCG CACGTCACCT ACAAGGTCGA GCGGCCCTTC
ACCCAGCACT ACTACGCCGA GGACTACGGC GGTCCGCTGG GGCCGACCAT CGACGCGCTC
GAGCTGCAGG ACCTGTCCAC CGGCAAAAAG CAGGCGAGCG ACAAGGACCT GGCCTGGCTG
CGCGCCATGT ACCGCGGCGA GGTCACCTAC CACGACGAGC ACATGGGCAA GTTCTTCGAG
CAGCTCCAGA CCATGGGCCG CATGGACGAC ACGCTCATCG TCATCACCAA CGACCACGGC
GAGGAGCTGG GCGACCACGG CAAGTTCGGC CACGGCCACA CCCTGTTCGA CGAGCTTCTG
CGCGCGCCGC TGCTGATGTA CTTCCCGGGC ATGTTCCCGG AAGGCGGACG CGTCGACGAA
ATCGTCGAGA CCGTGGACAT CGCGCCGACC ATCGTCGAGG TGCTCGGCCT CGAGCCCATG
AGCAACGCCG ACGGCACCTC GCTGCTGCCG CTGGTGCAGG GCAAACCCAT CCAGCGCCCG
ACCTACGCCA TCAGCGAGTT CCTCGACGGC CGCCGCGCCG TGCGCGTGGG CGACTGGAAG
TTCATGCGCA GCTCGAGCAC CTGGGCCAAC CTGCACAACG TCGCCGACGA CTTCCACGAG
GAGAACGACC GCAGCGAGGA CGCTCTGATC GCGCGCCGCA TGTGCGAGGT GCACCTGGGC
GAGGGCCTGG CCACGCCCGA CAAGAGCAAG CGCCAGCAGG ACATCACCAT CCGCCGCAAG
TTCAAGGCGG GCGAGGCCGA CATCGACCCG GCCATGCGCA AGCAGCTCGA GGCGCTGGGG
TACTTCGGTG AGTGA
 
Protein sequence
MTTARTALAT HGRLPALLSI ALTAVVMAPA CSKDSGAPAE GKATGASATT AATDEAKTPE 
APKKPSVRTK LEDYRPLLDE AARAELDAGG LFIDLGTADQ HKFTRGGWRT GWGESKQEDG
LSLSAGDDRR LHLDVLTSEP AAEVAVRARS AVDGQAATVW VDGERQGDIW LKPEWTVMRV
PVAEGALAAG RHNVQLTFTK SGTPRAEIDW VWLANEVDAE PPSASVRVLP LRIGSQPKRA
LIAPTTRTYS FYMQPPAGAS LVFDYASDVG ASFEVRAQAD GGAAESLFTA TGGESWEEAQ
VDLGAFADKA IRLDLVATGA EGRSGWGEPA LMVAPGNSDK PALKTAAPGT PKNVVVILID
TVRADSFAAI RPDNKVVTPA FDAFADKATV FTNAYNNENW TKPSVASLLS GLYPSTHDTK
KDESSLPKEV EILSQRLSKQ GFATAGFVAN GYVSEKFGFE KGWDAFTNYI RENKSSEAEY
VYGDALAWLG EREKAADGKP FFLYIQTIDP HVTYKVERPF TQHYYAEDYG GPLGPTIDAL
ELQDLSTGKK QASDKDLAWL RAMYRGEVTY HDEHMGKFFE QLQTMGRMDD TLIVITNDHG
EELGDHGKFG HGHTLFDELL RAPLLMYFPG MFPEGGRVDE IVETVDIAPT IVEVLGLEPM
SNADGTSLLP LVQGKPIQRP TYAISEFLDG RRAVRVGDWK FMRSSSTWAN LHNVADDFHE
ENDRSEDALI ARRMCEVHLG EGLATPDKSK RQQDITIRRK FKAGEADIDP AMRKQLEALG
YFGE