Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_4030 |
Symbol | |
ID | 8546431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 5532558 |
End bp | 5534855 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 646388707 |
Product | sulfatase |
Protein accession | YP_003268422 |
Protein GI | 262197213 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.868128 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.235672 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCGA ACTCCGGAGC CCTGCCGCGC TCGCGGCCGC CCCGAGCCGC ACTCGCCAGC CCTCCCCAGC GGCCCTGGCG CGCGCTCATC GCCGCCCCCA TCCCGGCCTG CCTGCTCGGC GCGCTCGCCA TCGCCGCCGT GGAGATCGTC GCCCTGGGCC GCCCCAGCCT CGGCCTCATC GCCGTGTGCC TCAGCCTGCA TCTCGCGCTG GGCCTGGCCA TCGGCGCCGT GCTCAGCGCG CAGGAGGCGC TGCTGCGGCG CTGGCCGGGC CTGCGCCGGG GCGCCTCGTG GCTGTACGCG CTCGGCGCCC TGCCCGCCCT GGCCTTCGTC GCCGCGCACC TGTTCGACGG TGCCCAGGCC GCCAAGCTGC CGGGCGCCGC CCTCGGCCAC CTGTGGCTGC CGGTCCTCGG CGTGCTCGCC GCCGCCGCCG CCATCGCCCT CGCCCGCCGC CTGCGGCGCC GACTCGGCGC CGCGCCCGGC GCCCTCCTGC TGGCCCTGAG CGCGGCCCTG GTCGAATACC TCGGCCGCAC CCTCTACCCC AGCGAGTACG CCGACGTGCA CGCCTTCCTG GTCGCGCTCG CGTGCCTGCT GGCCGCGCTC AGCGTGCGCG TGGCCGCCGC CGGCAGCGCG CCGAGCGCCG GCGCTTCGGC CGCGCCGCCG AGCCAGCCAG CCGCCGCCTC GCGCCTCGTG ATCCCGGGCC TGGCCGGCGC CGTCGTGGCC GCCCTCATCG CCAGCCTGGT CTTCGGCCTG GCCGACAAAG ACGAGCGCCT GTGGGTCGCC ACCCACGGCA CCGACCTGCG CCACCTGGTG CGCGTCGCCC GCCTGGCCTT CGACCGCGAC CGCGACGGCG CCGCCGTGGT CCTCGGCGGC GGCGACTGCG ACGACGGCGA CCCCGCCATC TCGCCGACCG CGCCCGAGCG CCCCGGCAAC GGCATCGACG AGGACTGCGA CGGCCGCGAC CTCGAAGCGC TCGCGCTGCC CGCCAGCGAC CGCGACTGGG ACGCCGAGCT CGCCGCCTGG GCGCGCGCGC CCGCGCTCGA ACACACCCTG GCGCGCACCC GCGACATGAA CATCCTGGTC GTCGCCGTCG ACACCCTGCG CGCCGACGCG CTCGCGCTCC CGGGCGACGA CGACGCCGCC GACGGCGCCC CCGCGGACGG CGCCCCGGCG GACAGCGAGC AGGCCGTCGC CAGCGACACC CCGCACCTCG ACGCCCTGCT CGCCGACGCC GCCGTCTTCC GCCACGCCTA CGCCACCGGC GCGGGCACCG ACATCTCGGT CTCGAGCACC ATCACCGGCC ACATCTCGCC CTTCGAGGGC GCCGAGATCA CGCTCGCCGA GGCCCTGCGC GCGACCGGCC GCGCCACCGC CGCCGTGTTC CCCACCGAGG TGCTGCGCTA CGCCGGCCAG GTGCTGCTCG GCCGCGGCAT CGACCGTCTG CGCCGCTACG TCAACGACCG CGGCCAGCGC GACGTCGGCT CGTACACCAC CTCGGCCGAG ACCACGCGCA TGGCGCTCGC CACCATCGAC GCCATGGGCC CGCACAAGCG GCCCTTCTTC CTGTGGACGC ACTACTTCGA CGTGCACGAG CACGCCCAGG TGCTGGCCAG CGACCCCACC CTGGCCCGCT ACGGCGAGCG CTTCGAGCTC GGCGAGCTGG CCGGCAAGTA CCGCGCCCTG GTCGCGCTCA CCGACGCCGA GATCGGACGA CTGTTCGAAG AGCTGCGCGC GCGCGATCTC TGGGACCAGA CCATCATCGT GCTGCTGAGC GACCACGGCG AGAGCATGGG CGAGGACCCG CGCCTGCCGC TGCGCCACGG CCGCTTCGTG TACAACGCCC TCACCCACGT GCCCCTGGTC ATCCGCGTCC CCGGCGCGAC GCCGCGCGTG GTCGATGCCC CGGTCTCGGT CGTCGACCTC ATGCCCACGC TGCTCACCCT GGTCGGCGCC GAGCTCCCGC CCGGACTCGA CGGCCGCTCG CTGCTGCCGC TGTTTCTGGG CTCGGCGCCC GGCCTGCCGC CGCGGCCCGT GGCCATGAGC GAGAGCGAGC AGAGCGCCGT CATCGTGTGG CCCTACAAGC TGCTCCTGCG CCCGGCCGAC AACCTGGTCG AGCTCTACGA CCTGGCCAGC GATCCCGCCG AACAGCGCGA CCTGGCCGAG CAACAGCCCC AGCGCGTCTC GGCGCTGCGC GCCATCATGG CCCAGCTTCC GCAGGTCGAG ATCGACCGAA CCCGTCGCGG ACGGCGCGCG CGAGATGCGC GCTCGCGAGC ACCACGAGCG CGTACGCCAG CACCGTGA
|
Protein sequence | MTSNSGALPR SRPPRAALAS PPQRPWRALI AAPIPACLLG ALAIAAVEIV ALGRPSLGLI AVCLSLHLAL GLAIGAVLSA QEALLRRWPG LRRGASWLYA LGALPALAFV AAHLFDGAQA AKLPGAALGH LWLPVLGVLA AAAAIALARR LRRRLGAAPG ALLLALSAAL VEYLGRTLYP SEYADVHAFL VALACLLAAL SVRVAAAGSA PSAGASAAPP SQPAAASRLV IPGLAGAVVA ALIASLVFGL ADKDERLWVA THGTDLRHLV RVARLAFDRD RDGAAVVLGG GDCDDGDPAI SPTAPERPGN GIDEDCDGRD LEALALPASD RDWDAELAAW ARAPALEHTL ARTRDMNILV VAVDTLRADA LALPGDDDAA DGAPADGAPA DSEQAVASDT PHLDALLADA AVFRHAYATG AGTDISVSST ITGHISPFEG AEITLAEALR ATGRATAAVF PTEVLRYAGQ VLLGRGIDRL RRYVNDRGQR DVGSYTTSAE TTRMALATID AMGPHKRPFF LWTHYFDVHE HAQVLASDPT LARYGERFEL GELAGKYRAL VALTDAEIGR LFEELRARDL WDQTIIVLLS DHGESMGEDP RLPLRHGRFV YNALTHVPLV IRVPGATPRV VDAPVSVVDL MPTLLTLVGA ELPPGLDGRS LLPLFLGSAP GLPPRPVAMS ESEQSAVIVW PYKLLLRPAD NLVELYDLAS DPAEQRDLAE QQPQRVSALR AIMAQLPQVE IDRTRRGRRA RDARSRAPRA RTPAP
|
| |