Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0911 |
Symbol | |
ID | 8418731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 1081166 |
End bp | 1082695 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645037481 |
Product | choline-sulfatase |
Protein accession | YP_003197780 |
Protein GI | 258405038 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000929958 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGGAG AGCAACCCAA CATTCTCATT ATCCAGGCCG ATCAGCTCGC CGCCCCGGCG TTGTCCTGCT ACGGCAATCC AGTGACCAAG ACCCCGCACC TCGACGCACT GGTCGAGGAC GGCGTGGTTT TTGACAACGC CTATTGCAAC AATCCGCTCT GCGCGCCGTC GCGTTTTTCC ATGATGGCCG GACAGCATTC CTCGGCCATC GGCGCCTACG ACAACGGGGC GGAGTTCCCG GCGGACATCC CGACCTTTGC CCACTACCTC CGAGCCGGCG GGTACCGGAC CACGCTGTGC GGCAAGATGC ACTTTGTCGG CGCGGACCAG CTCCACGGCT TCGAGGAGCG GCTGACCACT GACGTCTATC CCGCGGACCA CGGCTGGGCC CCGGATTGGC AGCGCCCCGA ACACCGCTTC GACTGGTGGT ACCACAACAT GGACAGCGTC TATGAGGCCG GGCCCTGTGA GCGGACCAAC CAGATCGATT TCGACGATGA GGTTGGGTTT CGGGCCCAGC GGCATATCTA CGATCTGGCC CGGGACAGCG ACGAGCGGCC TTTTTGTCTG ACCGTCTCCT TCACCGATCC CCATGATCCC TACGCCTGCC CCAAGGAATT TTGGGATCTC TACGAGGACG AGGAAATCAA CCTGCCTCAC GTGGAGCACA TTCCGTACGA GCAGTGCGAT CCGCACAGCC AGCGGCTGCG TCACGCCTAC AAAATGGGGC AGGGAGAAAT CGCGCCTGAG GACATCCGCA ACGCACGGCG GGCGTATTAC GGGCAGATCA GTTACGTCGA TGCCAAGATC GGTCGAATCA TGAAGGCTCT GGAGGATTGC GGTCTGCGTG AGAACACCAT TGTTGTGTTC ACCGCTGACC ACGGGGATAT GCTCGGAGAG CGGGGCCTGT GGTACAAGAT GTCCTTTCAC GAATGGTCGG CGCGGGTGCC GTTTATCGTC TCTGCGCCGC AACGATTCCA GCCTGGACGG GTGACCACGC CGGTGTCGCT GGTCGATCTT TTGCCCACCC TGCTCGATCT GAGTGGTGAT CCCCATCTTG AGGCCCCGGC GGATCGCTTG GATGGACAGA GTCTGGTGCC GCTTTTGCAA GGCGAGGTGG CGAGTCTGGA TCGGCCTGTC ATTTCTGAAT ATCTTGGTGA AGGCGCTGTG GCCCCGATGG TCATGGTCCG CTGGGGGCGT TACAAATATA TCGCCTGTCC GGCCGATCCG CCGCTGCTTT TTGACCTGCA AGAGGATCCC GATGAATTGG CCAATCTGGC CGGGCGCCCC GAGATGCAGG ACATCGAGGT GCAGTTGGAC CAAGTGGTTC GCAAGCACCA GGATCTCAAC CAACTCCATG AGGCGGTGGT GGCCAGTCAG CAACGCCGTC GGTTGGTCTT TGTCGCCCAT ATGACCGGGA CCCACACCCC GTGGGACTTC CAGCCGGTGT TCGACGCCAC GAACCGGTAC ATGCGCAATC ATCTGGATCT CAATGACGTC GAAGGACGCG CACGAATCGA ATCCACGTAG
|
Protein sequence | MRGEQPNILI IQADQLAAPA LSCYGNPVTK TPHLDALVED GVVFDNAYCN NPLCAPSRFS MMAGQHSSAI GAYDNGAEFP ADIPTFAHYL RAGGYRTTLC GKMHFVGADQ LHGFEERLTT DVYPADHGWA PDWQRPEHRF DWWYHNMDSV YEAGPCERTN QIDFDDEVGF RAQRHIYDLA RDSDERPFCL TVSFTDPHDP YACPKEFWDL YEDEEINLPH VEHIPYEQCD PHSQRLRHAY KMGQGEIAPE DIRNARRAYY GQISYVDAKI GRIMKALEDC GLRENTIVVF TADHGDMLGE RGLWYKMSFH EWSARVPFIV SAPQRFQPGR VTTPVSLVDL LPTLLDLSGD PHLEAPADRL DGQSLVPLLQ GEVASLDRPV ISEYLGEGAV APMVMVRWGR YKYIACPADP PLLFDLQEDP DELANLAGRP EMQDIEVQLD QVVRKHQDLN QLHEAVVASQ QRRRLVFVAH MTGTHTPWDF QPVFDATNRY MRNHLDLNDV EGRARIEST
|
| |