Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_2284 |
Symbol | |
ID | 8544670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 3179991 |
End bp | 3182345 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646386989 |
Product | sulfatase |
Protein accession | YP_003266720 |
Protein GI | 262195511 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACTG CTCGCACAGC GCTCGCAACG CATGGGCGCC TCCCGGCGCT TCTCTCGATC GCCTTGACCG CGGTGGTGAT GGCCCCGGCC TGTAGCAAGG ACAGCGGGGC GCCCGCCGAG GGCAAGGCCA CCGGGGCCTC CGCGACCACG GCCGCGACCG ACGAGGCCAA AACCCCCGAG GCGCCCAAGA AACCCTCGGT GCGCACCAAA CTCGAAGATT ACCGCCCGCT GCTCGACGAG GCCGCGCGCG CCGAGCTCGA CGCCGGCGGC TTGTTCATCG ACCTCGGCAC CGCCGACCAG CACAAGTTCA CGCGCGGCGG CTGGCGCACG GGTTGGGGCG AGAGCAAGCA GGAAGACGGC CTGTCGCTGT CCGCCGGCGA CGATCGCCGC CTGCATCTCG ACGTCCTCAC CAGCGAGCCC GCGGCCGAAG TCGCGGTCCG CGCCCGCTCG GCCGTCGACG GCCAGGCGGC CACGGTGTGG GTCGACGGCG AGCGCCAGGG CGACATCTGG CTCAAGCCCG AGTGGACCGT GATGCGCGTG CCCGTGGCCG AGGGCGCGCT CGCGGCCGGG CGCCACAACG TGCAGCTCAC CTTCACCAAG AGCGGCACCC CGCGCGCCGA GATCGACTGG GTGTGGCTGG CCAACGAGGT CGACGCCGAG CCGCCCTCGG CCAGCGTCCG CGTGCTGCCC CTGCGCATCG GCAGCCAGCC CAAGCGCGCG CTCATCGCGC CGACCACGCG CACCTATTCC TTCTACATGC AGCCGCCGGC CGGCGCCTCG CTGGTCTTCG ACTACGCCTC GGACGTCGGC GCCAGCTTCG AGGTCCGCGC CCAGGCTGAC GGCGGCGCAG CCGAGTCGCT GTTCACGGCC ACCGGCGGTG AGAGCTGGGA AGAGGCCCAG GTCGATCTCG GCGCCTTCGC CGACAAGGCC ATCCGCCTCG ACCTGGTCGC CACCGGCGCC GAGGGCCGCT CCGGCTGGGG CGAGCCCGCG CTCATGGTCG CCCCGGGGAA TTCGGACAAG CCGGCCCTCA AGACCGCGGC GCCGGGCACG CCCAAGAACG TGGTCGTCAT CCTCATCGAC ACCGTGCGCG CCGACTCGTT TGCGGCCATC CGTCCGGATA ACAAGGTCGT GACCCCGGCC TTCGACGCCT TCGCCGACAA GGCCACGGTG TTCACCAACG CCTACAACAA CGAGAACTGG ACCAAGCCCT CGGTGGCTAG CCTGCTCTCG GGCCTGTACC CGAGCACGCA CGACACCAAG AAAGACGAGA GCAGCCTGCC CAAAGAGGTC GAGATCCTGT CGCAGCGCCT GAGCAAGCAG GGCTTCGCCA CCGCCGGGTT CGTGGCCAAC GGCTACGTGT CCGAAAAATT CGGCTTCGAG AAGGGCTGGG ACGCCTTTAC CAACTACATC CGCGAGAACA AGAGCTCGGA GGCCGAGTAC GTCTACGGCG ACGCGCTGGC CTGGCTGGGC GAGCGCGAAA AGGCCGCCGA CGGCAAGCCC TTCTTCCTCT ACATCCAGAC CATCGATCCG CACGTCACCT ACAAGGTCGA GCGGCCCTTC ACCCAGCACT ACTACGCCGA GGACTACGGC GGTCCGCTGG GGCCGACCAT CGACGCGCTC GAGCTGCAGG ACCTGTCCAC CGGCAAAAAG CAGGCGAGCG ACAAGGACCT GGCCTGGCTG CGCGCCATGT ACCGCGGCGA GGTCACCTAC CACGACGAGC ACATGGGCAA GTTCTTCGAG CAGCTCCAGA CCATGGGCCG CATGGACGAC ACGCTCATCG TCATCACCAA CGACCACGGC GAGGAGCTGG GCGACCACGG CAAGTTCGGC CACGGCCACA CCCTGTTCGA CGAGCTTCTG CGCGCGCCGC TGCTGATGTA CTTCCCGGGC ATGTTCCCGG AAGGCGGACG CGTCGACGAA ATCGTCGAGA CCGTGGACAT CGCGCCGACC ATCGTCGAGG TGCTCGGCCT CGAGCCCATG AGCAACGCCG ACGGCACCTC GCTGCTGCCG CTGGTGCAGG GCAAACCCAT CCAGCGCCCG ACCTACGCCA TCAGCGAGTT CCTCGACGGC CGCCGCGCCG TGCGCGTGGG CGACTGGAAG TTCATGCGCA GCTCGAGCAC CTGGGCCAAC CTGCACAACG TCGCCGACGA CTTCCACGAG GAGAACGACC GCAGCGAGGA CGCTCTGATC GCGCGCCGCA TGTGCGAGGT GCACCTGGGC GAGGGCCTGG CCACGCCCGA CAAGAGCAAG CGCCAGCAGG ACATCACCAT CCGCCGCAAG TTCAAGGCGG GCGAGGCCGA CATCGACCCG GCCATGCGCA AGCAGCTCGA GGCGCTGGGG TACTTCGGTG AGTGA
|
Protein sequence | MTTARTALAT HGRLPALLSI ALTAVVMAPA CSKDSGAPAE GKATGASATT AATDEAKTPE APKKPSVRTK LEDYRPLLDE AARAELDAGG LFIDLGTADQ HKFTRGGWRT GWGESKQEDG LSLSAGDDRR LHLDVLTSEP AAEVAVRARS AVDGQAATVW VDGERQGDIW LKPEWTVMRV PVAEGALAAG RHNVQLTFTK SGTPRAEIDW VWLANEVDAE PPSASVRVLP LRIGSQPKRA LIAPTTRTYS FYMQPPAGAS LVFDYASDVG ASFEVRAQAD GGAAESLFTA TGGESWEEAQ VDLGAFADKA IRLDLVATGA EGRSGWGEPA LMVAPGNSDK PALKTAAPGT PKNVVVILID TVRADSFAAI RPDNKVVTPA FDAFADKATV FTNAYNNENW TKPSVASLLS GLYPSTHDTK KDESSLPKEV EILSQRLSKQ GFATAGFVAN GYVSEKFGFE KGWDAFTNYI RENKSSEAEY VYGDALAWLG EREKAADGKP FFLYIQTIDP HVTYKVERPF TQHYYAEDYG GPLGPTIDAL ELQDLSTGKK QASDKDLAWL RAMYRGEVTY HDEHMGKFFE QLQTMGRMDD TLIVITNDHG EELGDHGKFG HGHTLFDELL RAPLLMYFPG MFPEGGRVDE IVETVDIAPT IVEVLGLEPM SNADGTSLLP LVQGKPIQRP TYAISEFLDG RRAVRVGDWK FMRSSSTWAN LHNVADDFHE ENDRSEDALI ARRMCEVHLG EGLATPDKSK RQQDITIRRK FKAGEADIDP AMRKQLEALG YFGE
|
| |