Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_2507 |
Symbol | |
ID | 8544894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 3452332 |
End bp | 3454953 |
Gene Length | 2622 bp |
Protein Length | 873 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 646387207 |
Product | sulfatase |
Protein accession | YP_003266936 |
Protein GI | 262195727 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0936334 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGGCG CGGGCGCGGG CGCCATCGAC GCCGTGTGGT CGTGGTCGGC GCTGGGCCAG TTCCTGCCCG GCGCCGGCGG CAAGCTGCGC GCGCTGCTGT TTCTGGCCGC CAGCTACGGC CTGCTGTGCG GCCTGCTCGG CGCCGCGCTC GCCGCCGTGG TCGCCCTGTA CGCGCGCGCC TCGTGCCTGG GCCGCGTGCT GCCGGAGGCG ATGCGTCGCC ACCGCGCCGC CCGCGAGCGC GATCCGGGCA GCGCCGTGCT CGGCCTGGCG CTGGTGCTGG CCGGCGTGCC GACGCTGGCG GCCGCGCTCG GTATCGCGTT TTACGCCGCC CAGCTCGGCC TGCAGGAGCG CAAGCACTTC GGCCTGGTCA TCGCCGTGAC CATGGCCGCG ACCCTGGGCG CCGTGCTCGC GGCCGCGGCC CTCACCCTGC TGCTGGCCCA CGCGCTCGAG CGCGCGCTCG CGGCCCTGGC CCGGCGCCCC GGCCTGGGCG CGCTGGCCTC GCCGCGCGCG CCCGCGAGCG CGGGCCTGCT CATGCTGCTC GTGGCCGCGA CCGGCGCCGT GCTCAGCACC TGGAGCACGC TGCGCCTGCT GCCCCTGCGG CCGCTGTGGA TCGCGCTCGT GGCCGCGGCC CTGAGCCTGC CGGCGCTGCC CATCGGCGTC CGCGCGCGCG CTCGCCTGTC GCGCCTGCGC CCGGCCCTGC GCCGCGCCCT CCCGCTGGCC GCGATCGCGC TGGCCGCGCT GCTGGTGCTG GTCACCGGCG CCCCCGACGG CGTGCGCAAG GCCACCGGCA GCTACAGCGG CCTGGGCGGC CCCATCGCGC GCGTGCTGCG CACAGCCGTG GATGTCGACC GCGACGGCTA CAGCCCGCTG CTCGGCGGCG GCGACTGCGA CGACTGGGAC GCCACCGTGC ACCCGGGCGC CGACGAGATC CCGGACGACG GCATCGACCA GAACTGCGTC GGCGGCGACC CCAGCCTCGG CCGCGCGCAG GACGACACCG GCTTCGTGCC GGTGCCGAGC AGCGTGCCCG GCAACTTCAA CGTCGTGTTC CTCACCATCG ACACCATCCG CGCCGATCAC GTCGGCGCGT ACGGCTACCA GCGGCCGACC ACGCCCACGC TCGACGCCCT GGCCGCCGAC GGCGCCCTGT TCGTCAACGG CTGGGCGCAC GCGCCGTCCA CGCGCTACTC GATCCCGGCC CTGCTCACCG GCCGGCTGCC GCTCGAGGTC GCCTACGACA CCGCGGTGCG CGGCTGGCCC GGGCTGCTGC CGGAAGCCGA CACCCTGGCC GAATACGCGC AGCGCGCCGG CCTCACCACC GGCGCCATCC TCAACTACTG GTACTTCGAC CGCTACCGGC GCATGGACCA GGGCTTTGGC CACTACGACA ACGAGAACCA GAAGCTGCAC CGCGCGGTGT CGGGCAAGGG CCCGGCCGAG ACCAGCGGCT CATCGTCCAA GCAGCAGACC GACAAGGCGC TGCGTTTCAT CGACCAGCAC GCGGCCGAGC GCTTCTTCTT GTGGGTGCAC TACTACGATC CCCACTACCA GTACGAAGCT CACGCCGAGG TGCCGAGCTT CGGCAGCGAA GACGGCGGCA CGCCCGGCCA GCGCGACCTC TACGACCAGG AGATCCGCTT CACCGATATG CATATCGGAC GCCTGGTCCA GGACCTCAAG CGCCGCGGCC TGTACCAGCG CACGGTCATC GTGGTCACCG GCGACCACGG CGAGGGGTTT GGCGAGCACG GCATCGATCT CCACGGCTAT CACCTGTACG CGGCCCAGAC CAAGGTACCC TTTATCATCC GCGTGCCCGG CCTGGCGCCG ACGCGCGTGA GCATGCCGGT CGGACACGTG GACGTCTTGC CGACCCTGGC CAACCTGCTG GGCCGGCCGG CGAGCCCGTC GATGATGGGC CGCTCGCTGC TCGGGGTGAT GAGCGGCCAG GCCGACGGCG ACGCCGAGCG CTACGTGTTT CAGCAGCTCT CCTACGAGAA CAACAACGAG ATGCGGGCCG CCGTGAGCCG GCACTGTCAC GTGATCTACA ACGTCAGCCC GGATACGAGC TGGGAGATCT ACCGGCTCGC AGACGACCCC GATGAGGAGC GCGACATCGT CGACGCGCCG GGCGACTGCG AGCCGGCACG ACGCGCGCTC GAGGCCTGGT ACGATCGCGC CGAGCTGCCC GAGGGCGCGC TCGAGGCGCT GCTCGCGGAG CGGCCGGACG TGGCCCAGCC GCTGGGCGTG CATTTCGGAC AGGAGGTCGA GCTGCTCGCG GTCGAGCTGC CGCCGGGGCC GGTGCGCGCC GGCCAGCAGA TGCCCGTGAC CTTTACCTTC GCCGCCCACG GCCCGCTGCC CAGCGGCTGG CGGGTGTTCG CCCACTTCGA AGGGCCGGGC CGCTTCCTCG GCGACCACGA GCCGCCGCGG CCGTTCTCGT GGTGGCGCGA GGGCCAGTAC ATCCGCTACA CGCGCGAGAT CACCGTGCCC CGGCAGGCGC GCCCCGGGGA CTACGAGCTG TGGCTGGGGC TGTTTCGCAA GGCCGAGCGC ATGCCGGCGC GCAGCGACGG CGTGCCCGTG GACGGCGATC GCGTCAAGGT CGCCACCGTG CGCGTGCGAT GA
|
Protein sequence | MAGAGAGAID AVWSWSALGQ FLPGAGGKLR ALLFLAASYG LLCGLLGAAL AAVVALYARA SCLGRVLPEA MRRHRAARER DPGSAVLGLA LVLAGVPTLA AALGIAFYAA QLGLQERKHF GLVIAVTMAA TLGAVLAAAA LTLLLAHALE RALAALARRP GLGALASPRA PASAGLLMLL VAATGAVLST WSTLRLLPLR PLWIALVAAA LSLPALPIGV RARARLSRLR PALRRALPLA AIALAALLVL VTGAPDGVRK ATGSYSGLGG PIARVLRTAV DVDRDGYSPL LGGGDCDDWD ATVHPGADEI PDDGIDQNCV GGDPSLGRAQ DDTGFVPVPS SVPGNFNVVF LTIDTIRADH VGAYGYQRPT TPTLDALAAD GALFVNGWAH APSTRYSIPA LLTGRLPLEV AYDTAVRGWP GLLPEADTLA EYAQRAGLTT GAILNYWYFD RYRRMDQGFG HYDNENQKLH RAVSGKGPAE TSGSSSKQQT DKALRFIDQH AAERFFLWVH YYDPHYQYEA HAEVPSFGSE DGGTPGQRDL YDQEIRFTDM HIGRLVQDLK RRGLYQRTVI VVTGDHGEGF GEHGIDLHGY HLYAAQTKVP FIIRVPGLAP TRVSMPVGHV DVLPTLANLL GRPASPSMMG RSLLGVMSGQ ADGDAERYVF QQLSYENNNE MRAAVSRHCH VIYNVSPDTS WEIYRLADDP DEERDIVDAP GDCEPARRAL EAWYDRAELP EGALEALLAE RPDVAQPLGV HFGQEVELLA VELPPGPVRA GQQMPVTFTF AAHGPLPSGW RVFAHFEGPG RFLGDHEPPR PFSWWREGQY IRYTREITVP RQARPGDYEL WLGLFRKAER MPARSDGVPV DGDRVKVATV RVR
|
| |