Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_5561 |
Symbol | |
ID | 8547975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 7632819 |
End bp | 7635212 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646390234 |
Product | sulfatase |
Protein accession | YP_003269936 |
Protein GI | 262198727 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTAATG CGACGCGCAA GGGCGCCCTG CTCGGCTCCC TGCTGGTTCT CTGTTCCCCC TGGTGGCTGG CGTGCAGCGA CAAAGCCGGC CCGACGACCT CGTCCCCGGC TCCCGAGGCC GCGGGTGCCG CCAGCGGCGG CGCGAACCCG ACCCAGGCCG GCGGCGAGGC CGAGGCCGAG GTTACCGGAC ATCCGTCCGA AGCCGAGCCG GCGATCGCGC TGGCCGACAG CCCGGATGTC GATCTGGTGG ACAATCGCTT CCTCTGGCAC CTGGCACCGC CGGCGCAGGC CTCGGGTCTG TTCGTACCGG TCGCCTCCGA GGGGCTGCGC AAGTACACGC AGCAGTACCG CAACCCCTGG GGGGATGTGG TCACCCGCGA CGGTCACAGC GGCCGGGTCC TGCAGGCGTC CTCGGCCGAG CTGCGCATCC CCTGGCGCCC GGCCGCGGAG GCCCGTGGTC CGGTGCGCTT GCGCGCGCGA CTGCACGGGC TCAGCGCCGG GCAGCGGCTG TCGGTGTCGA TCAACGGCGA GCGGGTGGTC AACGCCTCGC TCGACGATGG CTGGCAGGAC GCGGCCTTTG AGATCAAACG CGGCGCTCTG CGCGCGGGCG AGAATCAGGT GGTGCTGTTC TTCGGCAAGC GCGGTGGCCC CGAGCGCGCC TACGCGCTGG TGCACTCGCT GGCCTTCGAA GAGGCGGCCG CGTCGGACAG CGCCGGCGAC GACAGCGGTG GCGGCGACGG CGGTGACAGC GCGAGCTGGC CGCCGCTGTC GCCGGCCGCT CAGGTCAGGG TTGCCGACAC CGCGCGCCCG GCCTTGACCG GTTTCACGCG CATGAGCGCC TACCTCGAGG TGCCCGAGAC CGGCTGGCTC GAGCTGCACA CCGGTGCGCC CGGCGCGGGC ACGCGCTTTC GCGTCACCGC CCAGCCGGTC GAGGGCGCGG TCGTGGAGAT TCTCGACCAC ACGTCCGAGG ATGCCGGCTG GCGGCGCCAT CGCCGCGATC TCGCCGAGCT CGCCGGCAAG CTGGTGATCC TCGAGTTCGC GCTCTCGGGC GAGGGCGCCG AGCAGGCGGC CTGGGGCGAA CCTCGGCTGG CGCTGGCGGA AGCGCCCACG CGCGCGGCGC CGCCGCCGTA CGACAACGCC ATCCTGCTGG TGGTCGACGC GCTGCGCTCC GACCGCCTGC GCCTGTACGG GGACACCCGG GTGAAGACGC CCCATATCAG CGCCGACGGC AACGCCCGCG GCGTGGTCTT CCTGCACAAC CAGGCGGCCT CGCCCTCGTC GCCGCCCTCG CACGGCAGCA TCCAGACCGG CATGATCCCG CGCGTCCACG GCGTCACCGG CGACAACGGC AAGCTCGAGC CGGGCACGCC CATGATCAGC ACCCAGGCCG AGGCCGCCGG GATCGCGGCC GGCTACTACG GCAACAACCC CTTCGGCATG GGCCGGCTCG AGGCCCCGGG CGAGTGGACG GCCTTCCACC AGCCCAACAA GGAGGGCAAG GGCATCGACT GCACGGTGCT CATGGACGAG ATGCTCGGCT TTGCCAAGCA GCAAGCCGAT GCCGGCAAGC GCTTCTTCAT CTCCTCGCTG CCCTACGAGA CGCACACGCC GTATCGCTAC CACGAGGGCA TCACCGACAA CTATCACGAC GGCGACTGGG GTCCGCCGGT GGGCAAGAAC GTCGATGGCG TGCTGCTCAG CAAGCTGTCG AGCGCGTCGG TGACCTTGAA CGATGCCCAG TGGGCTCAGC TCCGCGCGCT CTACGACGGC GAGGCCGAGT ACATGGACGG CTGTTATCAG CAGCTTCTAG ACGGCCTCGA GGCACGCGGT CTGCGCGAGC GCACGCTGCT GGTGCTCACC TCGGACCACG GCGAGGGCAT GTACGAACAC GGACGCATGG GCCACGCTTT CGGACACTAC GCCGAGCTGG CTAATGTGCC GCTGGTGTTC CTCGGTGACG GCCTCACGCC GCAGGGTGCG GTGCTCGACG CGGTGTCCAG TCACCTCGAT ATCGCGCCGA CGATCCTGGC GCTGCTCGGG GTGACGCCGA GCGAGCGCAT CCAGGGGCGC GACCTGAGGG CGCAGATGCT GCGCGAGGGG CCGTGGACGC CGCGGGTGGT GTCGCTCGAG TACGGCCGCA GCTACGCGTT GCGCGCGCGG CGCTGGAAGT ACATCGTGGA TTATCAGCAG AACGAGAGCC TGTTCGACCT GGTCGAGGAT CCCAGCGAGC AGCGCGATCT CCTGGGCACG CAGGACATCC CCCTGCGCTA TCTGCGCGAC CTCGCCGGGG TGTTCCTGGC CCATCGCAAG GCCTGGCGCG CCGACTCCTT CGGCGAGCTC AACAACCACG CGCCGGGCTT TCTGCGCCAC GTCGCCGCGC CCGTCTTCGA TTGA
|
Protein sequence | MCNATRKGAL LGSLLVLCSP WWLACSDKAG PTTSSPAPEA AGAASGGANP TQAGGEAEAE VTGHPSEAEP AIALADSPDV DLVDNRFLWH LAPPAQASGL FVPVASEGLR KYTQQYRNPW GDVVTRDGHS GRVLQASSAE LRIPWRPAAE ARGPVRLRAR LHGLSAGQRL SVSINGERVV NASLDDGWQD AAFEIKRGAL RAGENQVVLF FGKRGGPERA YALVHSLAFE EAAASDSAGD DSGGGDGGDS ASWPPLSPAA QVRVADTARP ALTGFTRMSA YLEVPETGWL ELHTGAPGAG TRFRVTAQPV EGAVVEILDH TSEDAGWRRH RRDLAELAGK LVILEFALSG EGAEQAAWGE PRLALAEAPT RAAPPPYDNA ILLVVDALRS DRLRLYGDTR VKTPHISADG NARGVVFLHN QAASPSSPPS HGSIQTGMIP RVHGVTGDNG KLEPGTPMIS TQAEAAGIAA GYYGNNPFGM GRLEAPGEWT AFHQPNKEGK GIDCTVLMDE MLGFAKQQAD AGKRFFISSL PYETHTPYRY HEGITDNYHD GDWGPPVGKN VDGVLLSKLS SASVTLNDAQ WAQLRALYDG EAEYMDGCYQ QLLDGLEARG LRERTLLVLT SDHGEGMYEH GRMGHAFGHY AELANVPLVF LGDGLTPQGA VLDAVSSHLD IAPTILALLG VTPSERIQGR DLRAQMLREG PWTPRVVSLE YGRSYALRAR RWKYIVDYQQ NESLFDLVED PSEQRDLLGT QDIPLRYLRD LAGVFLAHRK AWRADSFGEL NNHAPGFLRH VAAPVFD
|
| |