Gene Hoch_2507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2507 
Symbol 
ID8544894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3452332 
End bp3454953 
Gene Length2622 bp 
Protein Length873 aa 
Translation table11 
GC content74% 
IMG OID646387207 
Productsulfatase 
Protein accessionYP_003266936 
Protein GI262195727 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0936334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGGCG CGGGCGCGGG CGCCATCGAC GCCGTGTGGT CGTGGTCGGC GCTGGGCCAG 
TTCCTGCCCG GCGCCGGCGG CAAGCTGCGC GCGCTGCTGT TTCTGGCCGC CAGCTACGGC
CTGCTGTGCG GCCTGCTCGG CGCCGCGCTC GCCGCCGTGG TCGCCCTGTA CGCGCGCGCC
TCGTGCCTGG GCCGCGTGCT GCCGGAGGCG ATGCGTCGCC ACCGCGCCGC CCGCGAGCGC
GATCCGGGCA GCGCCGTGCT CGGCCTGGCG CTGGTGCTGG CCGGCGTGCC GACGCTGGCG
GCCGCGCTCG GTATCGCGTT TTACGCCGCC CAGCTCGGCC TGCAGGAGCG CAAGCACTTC
GGCCTGGTCA TCGCCGTGAC CATGGCCGCG ACCCTGGGCG CCGTGCTCGC GGCCGCGGCC
CTCACCCTGC TGCTGGCCCA CGCGCTCGAG CGCGCGCTCG CGGCCCTGGC CCGGCGCCCC
GGCCTGGGCG CGCTGGCCTC GCCGCGCGCG CCCGCGAGCG CGGGCCTGCT CATGCTGCTC
GTGGCCGCGA CCGGCGCCGT GCTCAGCACC TGGAGCACGC TGCGCCTGCT GCCCCTGCGG
CCGCTGTGGA TCGCGCTCGT GGCCGCGGCC CTGAGCCTGC CGGCGCTGCC CATCGGCGTC
CGCGCGCGCG CTCGCCTGTC GCGCCTGCGC CCGGCCCTGC GCCGCGCCCT CCCGCTGGCC
GCGATCGCGC TGGCCGCGCT GCTGGTGCTG GTCACCGGCG CCCCCGACGG CGTGCGCAAG
GCCACCGGCA GCTACAGCGG CCTGGGCGGC CCCATCGCGC GCGTGCTGCG CACAGCCGTG
GATGTCGACC GCGACGGCTA CAGCCCGCTG CTCGGCGGCG GCGACTGCGA CGACTGGGAC
GCCACCGTGC ACCCGGGCGC CGACGAGATC CCGGACGACG GCATCGACCA GAACTGCGTC
GGCGGCGACC CCAGCCTCGG CCGCGCGCAG GACGACACCG GCTTCGTGCC GGTGCCGAGC
AGCGTGCCCG GCAACTTCAA CGTCGTGTTC CTCACCATCG ACACCATCCG CGCCGATCAC
GTCGGCGCGT ACGGCTACCA GCGGCCGACC ACGCCCACGC TCGACGCCCT GGCCGCCGAC
GGCGCCCTGT TCGTCAACGG CTGGGCGCAC GCGCCGTCCA CGCGCTACTC GATCCCGGCC
CTGCTCACCG GCCGGCTGCC GCTCGAGGTC GCCTACGACA CCGCGGTGCG CGGCTGGCCC
GGGCTGCTGC CGGAAGCCGA CACCCTGGCC GAATACGCGC AGCGCGCCGG CCTCACCACC
GGCGCCATCC TCAACTACTG GTACTTCGAC CGCTACCGGC GCATGGACCA GGGCTTTGGC
CACTACGACA ACGAGAACCA GAAGCTGCAC CGCGCGGTGT CGGGCAAGGG CCCGGCCGAG
ACCAGCGGCT CATCGTCCAA GCAGCAGACC GACAAGGCGC TGCGTTTCAT CGACCAGCAC
GCGGCCGAGC GCTTCTTCTT GTGGGTGCAC TACTACGATC CCCACTACCA GTACGAAGCT
CACGCCGAGG TGCCGAGCTT CGGCAGCGAA GACGGCGGCA CGCCCGGCCA GCGCGACCTC
TACGACCAGG AGATCCGCTT CACCGATATG CATATCGGAC GCCTGGTCCA GGACCTCAAG
CGCCGCGGCC TGTACCAGCG CACGGTCATC GTGGTCACCG GCGACCACGG CGAGGGGTTT
GGCGAGCACG GCATCGATCT CCACGGCTAT CACCTGTACG CGGCCCAGAC CAAGGTACCC
TTTATCATCC GCGTGCCCGG CCTGGCGCCG ACGCGCGTGA GCATGCCGGT CGGACACGTG
GACGTCTTGC CGACCCTGGC CAACCTGCTG GGCCGGCCGG CGAGCCCGTC GATGATGGGC
CGCTCGCTGC TCGGGGTGAT GAGCGGCCAG GCCGACGGCG ACGCCGAGCG CTACGTGTTT
CAGCAGCTCT CCTACGAGAA CAACAACGAG ATGCGGGCCG CCGTGAGCCG GCACTGTCAC
GTGATCTACA ACGTCAGCCC GGATACGAGC TGGGAGATCT ACCGGCTCGC AGACGACCCC
GATGAGGAGC GCGACATCGT CGACGCGCCG GGCGACTGCG AGCCGGCACG ACGCGCGCTC
GAGGCCTGGT ACGATCGCGC CGAGCTGCCC GAGGGCGCGC TCGAGGCGCT GCTCGCGGAG
CGGCCGGACG TGGCCCAGCC GCTGGGCGTG CATTTCGGAC AGGAGGTCGA GCTGCTCGCG
GTCGAGCTGC CGCCGGGGCC GGTGCGCGCC GGCCAGCAGA TGCCCGTGAC CTTTACCTTC
GCCGCCCACG GCCCGCTGCC CAGCGGCTGG CGGGTGTTCG CCCACTTCGA AGGGCCGGGC
CGCTTCCTCG GCGACCACGA GCCGCCGCGG CCGTTCTCGT GGTGGCGCGA GGGCCAGTAC
ATCCGCTACA CGCGCGAGAT CACCGTGCCC CGGCAGGCGC GCCCCGGGGA CTACGAGCTG
TGGCTGGGGC TGTTTCGCAA GGCCGAGCGC ATGCCGGCGC GCAGCGACGG CGTGCCCGTG
GACGGCGATC GCGTCAAGGT CGCCACCGTG CGCGTGCGAT GA
 
Protein sequence
MAGAGAGAID AVWSWSALGQ FLPGAGGKLR ALLFLAASYG LLCGLLGAAL AAVVALYARA 
SCLGRVLPEA MRRHRAARER DPGSAVLGLA LVLAGVPTLA AALGIAFYAA QLGLQERKHF
GLVIAVTMAA TLGAVLAAAA LTLLLAHALE RALAALARRP GLGALASPRA PASAGLLMLL
VAATGAVLST WSTLRLLPLR PLWIALVAAA LSLPALPIGV RARARLSRLR PALRRALPLA
AIALAALLVL VTGAPDGVRK ATGSYSGLGG PIARVLRTAV DVDRDGYSPL LGGGDCDDWD
ATVHPGADEI PDDGIDQNCV GGDPSLGRAQ DDTGFVPVPS SVPGNFNVVF LTIDTIRADH
VGAYGYQRPT TPTLDALAAD GALFVNGWAH APSTRYSIPA LLTGRLPLEV AYDTAVRGWP
GLLPEADTLA EYAQRAGLTT GAILNYWYFD RYRRMDQGFG HYDNENQKLH RAVSGKGPAE
TSGSSSKQQT DKALRFIDQH AAERFFLWVH YYDPHYQYEA HAEVPSFGSE DGGTPGQRDL
YDQEIRFTDM HIGRLVQDLK RRGLYQRTVI VVTGDHGEGF GEHGIDLHGY HLYAAQTKVP
FIIRVPGLAP TRVSMPVGHV DVLPTLANLL GRPASPSMMG RSLLGVMSGQ ADGDAERYVF
QQLSYENNNE MRAAVSRHCH VIYNVSPDTS WEIYRLADDP DEERDIVDAP GDCEPARRAL
EAWYDRAELP EGALEALLAE RPDVAQPLGV HFGQEVELLA VELPPGPVRA GQQMPVTFTF
AAHGPLPSGW RVFAHFEGPG RFLGDHEPPR PFSWWREGQY IRYTREITVP RQARPGDYEL
WLGLFRKAER MPARSDGVPV DGDRVKVATV RVR