Gene Cwoe_2406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2406 
Symbol 
ID8732849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2548113 
End bp2550440 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content68% 
IMG OID646503022 
Productsulfatase 
Protein accessionYP_003394204 
Protein GI284043864 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.214932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00167669 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCGGCTG AGTTCAGAGG TTCGATCAGA CTCGACATCC GCGACTCGGT GCCGGATTGG 
ACGCCGTACC TGGCCGAGAA GGCGCCGGCC GGCGCGCCGA ACGTGCTCGT GATCCTCTAC
GACGACACCG GGACGGCGGC GTGGTCGCCG TACGGCGGGC GGATCGAGAT GCCGACGATG
CAGCGCTTCG CCGACGAGGG GCTGACGTAC TCGCAGTGGC ACACGACCGC GCTCTGCGGG
CCGACCCGCT CGTGCTTCCT GACCGGCCGC AACCACCACC AGAACTCGTT CGCGACGATC
GCCGAGACGG CGACCGGCTT CCCCGGCAAC AACACGCACA TCCCGATGGA GAACGCGTTC
ATGGCCGAGG TGCTGCGCGA GCGAGGCTGG AGCACGTTCT GGGTCGGCAA GAACCACAAC
GTCCCGGTCG ACGAGTTCGA CCAGGGCTCG ACGAAGCGCA ACTGGCCGCT CGGCCGCGGC
TTCGACCGCT TCTACGGCTT CATCGGCGGC GAGACGAACC AGTGGTATCC CGACCTGACC
GAGGACAACC ACTACATCGA CCAGCCGTAC CGCCCCGAGG ACGGCTACCA CCTCTCGAAG
GACCTCGCTG ACCAGGCGAT CGCGATGATC CGCGACTCGC AGCAGAGCCA GCCGGAGAAG
CCGTGGCACA TGTTCTACTG CCCCGGCGCC AACCACGCCC CGCATCACGC GCCGCAGGAG
TTCATCGACA AGTACAGAGG CGTCTTCGAC GACGGCTACG AGGCGTACCG CGAGTGGGTC
CTGCCGCGGA TGATCGAGAA GGGGATCCTG CCGGAGGGGA CCGAGCTGAC GCCGCTCAAC
CCGCTGCCCG ACGACGTCGC CAACCCGGCC GACGCGGTGC GGCCGTGGGC GACGCTGTCC
AGCGAGGAGA GACGCCTGTT CGCGCGCATG GCCGAGGCCT ACGCCGGCTT CTCGGAGTAC
ACCGACCACG AGATCGGCCG GATCGTCGCC TACCTGGAGG AGACCGGCCA GCTCGACAAC
ACGCTCGTCT TCTACGCGGC CGACAACGGC GCGTCCGGCG AGGGCAGCCC GAACGGCTCG
GTCAACGAGG GCAAGTTCTT CAACGCCTGG CCCGACACGG TCGAGGACAA CCTGCCGATG
ATCGACAAGC TCGGCAGTCC CGACACGTAC AACCACTACC CGACCGGGTG GGCGGCGGCG
TTCTCGACGC CGTACAAGAT GTTCAAGCGC TACTCGTACC AGGGCGGCGT GTGCGACCCG
CTCGTGATCT CATGGCCGGC GGGGATCAGA GCGCGCGGCG AGGTGCGCGA CCAGTACCAC
CACTGCACCG ACATCGTTCC GACGATCCTC GAATGCTGCG GCGTCGAGAT GCCCGACACC
GTGCTCGGCT ACAGACAGAC GCCGCTCGCC GGCGTCTCGA TGCGCTACAG CTTCGACGAC
GCCGCCGCGC CGACGCAGAA GCCGCAGCAG TACTACGAGA TGCTCGGCAC GCGCGCGATG
TGGAAGGACG GCTGGAAGGC GGTCGCCGAG CACGCGCCGA TGCCGTCCGA CAGAGGCCAC
TTCGACAGAG ACCGCTGGCA GCTGTTCCAC ACCGACGTCG ACCGCTCCGA GTCGAGCGAC
CTCGCCGCCG AGCACCCGGA GCGGCTGAGA GCACTCGTCG ACCTGTGGTT CGAGGAGGCG
GAGAAGTACG ACGTCCTGCC GCTGTCCGAC CTCGGCATCC TCGACTACAT CAGATACGAG
TTCCAGGTGC CCGTGCCGAG AGGCGGCACG TACGTCTACG GCCCTGCCCA CGCCGGCCTG
CCCGAGCACT CGGCGGCGAG CACGCACGGC GTCTCGTACT CGCTGCTCGG CCAGATCGAG
GTCGAGGACC CCGGCGCGCA GGGCGTGATC TTCGCGCAGG GCTCGCGCTT CGGCGGGCAC
GCTCTGTTCC TGAAGGACCG CCGGCTCCAC TACGTCTACA ACTTCATCGG GATCAAGCCC
GAGCAGCACT ACGTCTCCGA CGTCGAGGTC GGAACCGGCG GCCAGGTCGT CGGCGTCGAG
TTCGTCAAGG AACGCGTCGG CGAGCACGGC GAGTCGCACG GGACCGTGAC GATGCGGCTC
GACGAGCAGG TCGTCGCGAC GGGTCCGCTG CGGACGCAGT CGGGCCACTT CTCGCTCGCC
GGCGAAGGGC TCTGCATCGG CCGCGACAGC GGCGACGCGG TCAGCGAGCA GTACAGACCC
GACTTCCCCT TCGAGGGCGG GCGGATCGTG AAGTTCGAGG TCGGCGTCGG CGACGACGGC
TACGTCGACC TGGAGCGGCG GCTGCACGCC GCCCTCGCGC GCGATTGA
 
Protein sequence
MPAEFRGSIR LDIRDSVPDW TPYLAEKAPA GAPNVLVILY DDTGTAAWSP YGGRIEMPTM 
QRFADEGLTY SQWHTTALCG PTRSCFLTGR NHHQNSFATI AETATGFPGN NTHIPMENAF
MAEVLRERGW STFWVGKNHN VPVDEFDQGS TKRNWPLGRG FDRFYGFIGG ETNQWYPDLT
EDNHYIDQPY RPEDGYHLSK DLADQAIAMI RDSQQSQPEK PWHMFYCPGA NHAPHHAPQE
FIDKYRGVFD DGYEAYREWV LPRMIEKGIL PEGTELTPLN PLPDDVANPA DAVRPWATLS
SEERRLFARM AEAYAGFSEY TDHEIGRIVA YLEETGQLDN TLVFYAADNG ASGEGSPNGS
VNEGKFFNAW PDTVEDNLPM IDKLGSPDTY NHYPTGWAAA FSTPYKMFKR YSYQGGVCDP
LVISWPAGIR ARGEVRDQYH HCTDIVPTIL ECCGVEMPDT VLGYRQTPLA GVSMRYSFDD
AAAPTQKPQQ YYEMLGTRAM WKDGWKAVAE HAPMPSDRGH FDRDRWQLFH TDVDRSESSD
LAAEHPERLR ALVDLWFEEA EKYDVLPLSD LGILDYIRYE FQVPVPRGGT YVYGPAHAGL
PEHSAASTHG VSYSLLGQIE VEDPGAQGVI FAQGSRFGGH ALFLKDRRLH YVYNFIGIKP
EQHYVSDVEV GTGGQVVGVE FVKERVGEHG ESHGTVTMRL DEQVVATGPL RTQSGHFSLA
GEGLCIGRDS GDAVSEQYRP DFPFEGGRIV KFEVGVGDDG YVDLERRLHA ALARD