Gene Dret_0911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0911 
Symbol 
ID8418731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1081166 
End bp1082695 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content61% 
IMG OID645037481 
Productcholine-sulfatase 
Protein accessionYP_003197780 
Protein GI258405038 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000929958 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGGAG AGCAACCCAA CATTCTCATT ATCCAGGCCG ATCAGCTCGC CGCCCCGGCG 
TTGTCCTGCT ACGGCAATCC AGTGACCAAG ACCCCGCACC TCGACGCACT GGTCGAGGAC
GGCGTGGTTT TTGACAACGC CTATTGCAAC AATCCGCTCT GCGCGCCGTC GCGTTTTTCC
ATGATGGCCG GACAGCATTC CTCGGCCATC GGCGCCTACG ACAACGGGGC GGAGTTCCCG
GCGGACATCC CGACCTTTGC CCACTACCTC CGAGCCGGCG GGTACCGGAC CACGCTGTGC
GGCAAGATGC ACTTTGTCGG CGCGGACCAG CTCCACGGCT TCGAGGAGCG GCTGACCACT
GACGTCTATC CCGCGGACCA CGGCTGGGCC CCGGATTGGC AGCGCCCCGA ACACCGCTTC
GACTGGTGGT ACCACAACAT GGACAGCGTC TATGAGGCCG GGCCCTGTGA GCGGACCAAC
CAGATCGATT TCGACGATGA GGTTGGGTTT CGGGCCCAGC GGCATATCTA CGATCTGGCC
CGGGACAGCG ACGAGCGGCC TTTTTGTCTG ACCGTCTCCT TCACCGATCC CCATGATCCC
TACGCCTGCC CCAAGGAATT TTGGGATCTC TACGAGGACG AGGAAATCAA CCTGCCTCAC
GTGGAGCACA TTCCGTACGA GCAGTGCGAT CCGCACAGCC AGCGGCTGCG TCACGCCTAC
AAAATGGGGC AGGGAGAAAT CGCGCCTGAG GACATCCGCA ACGCACGGCG GGCGTATTAC
GGGCAGATCA GTTACGTCGA TGCCAAGATC GGTCGAATCA TGAAGGCTCT GGAGGATTGC
GGTCTGCGTG AGAACACCAT TGTTGTGTTC ACCGCTGACC ACGGGGATAT GCTCGGAGAG
CGGGGCCTGT GGTACAAGAT GTCCTTTCAC GAATGGTCGG CGCGGGTGCC GTTTATCGTC
TCTGCGCCGC AACGATTCCA GCCTGGACGG GTGACCACGC CGGTGTCGCT GGTCGATCTT
TTGCCCACCC TGCTCGATCT GAGTGGTGAT CCCCATCTTG AGGCCCCGGC GGATCGCTTG
GATGGACAGA GTCTGGTGCC GCTTTTGCAA GGCGAGGTGG CGAGTCTGGA TCGGCCTGTC
ATTTCTGAAT ATCTTGGTGA AGGCGCTGTG GCCCCGATGG TCATGGTCCG CTGGGGGCGT
TACAAATATA TCGCCTGTCC GGCCGATCCG CCGCTGCTTT TTGACCTGCA AGAGGATCCC
GATGAATTGG CCAATCTGGC CGGGCGCCCC GAGATGCAGG ACATCGAGGT GCAGTTGGAC
CAAGTGGTTC GCAAGCACCA GGATCTCAAC CAACTCCATG AGGCGGTGGT GGCCAGTCAG
CAACGCCGTC GGTTGGTCTT TGTCGCCCAT ATGACCGGGA CCCACACCCC GTGGGACTTC
CAGCCGGTGT TCGACGCCAC GAACCGGTAC ATGCGCAATC ATCTGGATCT CAATGACGTC
GAAGGACGCG CACGAATCGA ATCCACGTAG
 
Protein sequence
MRGEQPNILI IQADQLAAPA LSCYGNPVTK TPHLDALVED GVVFDNAYCN NPLCAPSRFS 
MMAGQHSSAI GAYDNGAEFP ADIPTFAHYL RAGGYRTTLC GKMHFVGADQ LHGFEERLTT
DVYPADHGWA PDWQRPEHRF DWWYHNMDSV YEAGPCERTN QIDFDDEVGF RAQRHIYDLA
RDSDERPFCL TVSFTDPHDP YACPKEFWDL YEDEEINLPH VEHIPYEQCD PHSQRLRHAY
KMGQGEIAPE DIRNARRAYY GQISYVDAKI GRIMKALEDC GLRENTIVVF TADHGDMLGE
RGLWYKMSFH EWSARVPFIV SAPQRFQPGR VTTPVSLVDL LPTLLDLSGD PHLEAPADRL
DGQSLVPLLQ GEVASLDRPV ISEYLGEGAV APMVMVRWGR YKYIACPADP PLLFDLQEDP
DELANLAGRP EMQDIEVQLD QVVRKHQDLN QLHEAVVASQ QRRRLVFVAH MTGTHTPWDF
QPVFDATNRY MRNHLDLNDV EGRARIEST