Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2406 |
Symbol | |
ID | 8732849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2548113 |
End bp | 2550440 |
Gene Length | 2328 bp |
Protein Length | 775 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646503022 |
Product | sulfatase |
Protein accession | YP_003394204 |
Protein GI | 284043864 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.214932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00167669 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCGGCTG AGTTCAGAGG TTCGATCAGA CTCGACATCC GCGACTCGGT GCCGGATTGG ACGCCGTACC TGGCCGAGAA GGCGCCGGCC GGCGCGCCGA ACGTGCTCGT GATCCTCTAC GACGACACCG GGACGGCGGC GTGGTCGCCG TACGGCGGGC GGATCGAGAT GCCGACGATG CAGCGCTTCG CCGACGAGGG GCTGACGTAC TCGCAGTGGC ACACGACCGC GCTCTGCGGG CCGACCCGCT CGTGCTTCCT GACCGGCCGC AACCACCACC AGAACTCGTT CGCGACGATC GCCGAGACGG CGACCGGCTT CCCCGGCAAC AACACGCACA TCCCGATGGA GAACGCGTTC ATGGCCGAGG TGCTGCGCGA GCGAGGCTGG AGCACGTTCT GGGTCGGCAA GAACCACAAC GTCCCGGTCG ACGAGTTCGA CCAGGGCTCG ACGAAGCGCA ACTGGCCGCT CGGCCGCGGC TTCGACCGCT TCTACGGCTT CATCGGCGGC GAGACGAACC AGTGGTATCC CGACCTGACC GAGGACAACC ACTACATCGA CCAGCCGTAC CGCCCCGAGG ACGGCTACCA CCTCTCGAAG GACCTCGCTG ACCAGGCGAT CGCGATGATC CGCGACTCGC AGCAGAGCCA GCCGGAGAAG CCGTGGCACA TGTTCTACTG CCCCGGCGCC AACCACGCCC CGCATCACGC GCCGCAGGAG TTCATCGACA AGTACAGAGG CGTCTTCGAC GACGGCTACG AGGCGTACCG CGAGTGGGTC CTGCCGCGGA TGATCGAGAA GGGGATCCTG CCGGAGGGGA CCGAGCTGAC GCCGCTCAAC CCGCTGCCCG ACGACGTCGC CAACCCGGCC GACGCGGTGC GGCCGTGGGC GACGCTGTCC AGCGAGGAGA GACGCCTGTT CGCGCGCATG GCCGAGGCCT ACGCCGGCTT CTCGGAGTAC ACCGACCACG AGATCGGCCG GATCGTCGCC TACCTGGAGG AGACCGGCCA GCTCGACAAC ACGCTCGTCT TCTACGCGGC CGACAACGGC GCGTCCGGCG AGGGCAGCCC GAACGGCTCG GTCAACGAGG GCAAGTTCTT CAACGCCTGG CCCGACACGG TCGAGGACAA CCTGCCGATG ATCGACAAGC TCGGCAGTCC CGACACGTAC AACCACTACC CGACCGGGTG GGCGGCGGCG TTCTCGACGC CGTACAAGAT GTTCAAGCGC TACTCGTACC AGGGCGGCGT GTGCGACCCG CTCGTGATCT CATGGCCGGC GGGGATCAGA GCGCGCGGCG AGGTGCGCGA CCAGTACCAC CACTGCACCG ACATCGTTCC GACGATCCTC GAATGCTGCG GCGTCGAGAT GCCCGACACC GTGCTCGGCT ACAGACAGAC GCCGCTCGCC GGCGTCTCGA TGCGCTACAG CTTCGACGAC GCCGCCGCGC CGACGCAGAA GCCGCAGCAG TACTACGAGA TGCTCGGCAC GCGCGCGATG TGGAAGGACG GCTGGAAGGC GGTCGCCGAG CACGCGCCGA TGCCGTCCGA CAGAGGCCAC TTCGACAGAG ACCGCTGGCA GCTGTTCCAC ACCGACGTCG ACCGCTCCGA GTCGAGCGAC CTCGCCGCCG AGCACCCGGA GCGGCTGAGA GCACTCGTCG ACCTGTGGTT CGAGGAGGCG GAGAAGTACG ACGTCCTGCC GCTGTCCGAC CTCGGCATCC TCGACTACAT CAGATACGAG TTCCAGGTGC CCGTGCCGAG AGGCGGCACG TACGTCTACG GCCCTGCCCA CGCCGGCCTG CCCGAGCACT CGGCGGCGAG CACGCACGGC GTCTCGTACT CGCTGCTCGG CCAGATCGAG GTCGAGGACC CCGGCGCGCA GGGCGTGATC TTCGCGCAGG GCTCGCGCTT CGGCGGGCAC GCTCTGTTCC TGAAGGACCG CCGGCTCCAC TACGTCTACA ACTTCATCGG GATCAAGCCC GAGCAGCACT ACGTCTCCGA CGTCGAGGTC GGAACCGGCG GCCAGGTCGT CGGCGTCGAG TTCGTCAAGG AACGCGTCGG CGAGCACGGC GAGTCGCACG GGACCGTGAC GATGCGGCTC GACGAGCAGG TCGTCGCGAC GGGTCCGCTG CGGACGCAGT CGGGCCACTT CTCGCTCGCC GGCGAAGGGC TCTGCATCGG CCGCGACAGC GGCGACGCGG TCAGCGAGCA GTACAGACCC GACTTCCCCT TCGAGGGCGG GCGGATCGTG AAGTTCGAGG TCGGCGTCGG CGACGACGGC TACGTCGACC TGGAGCGGCG GCTGCACGCC GCCCTCGCGC GCGATTGA
|
Protein sequence | MPAEFRGSIR LDIRDSVPDW TPYLAEKAPA GAPNVLVILY DDTGTAAWSP YGGRIEMPTM QRFADEGLTY SQWHTTALCG PTRSCFLTGR NHHQNSFATI AETATGFPGN NTHIPMENAF MAEVLRERGW STFWVGKNHN VPVDEFDQGS TKRNWPLGRG FDRFYGFIGG ETNQWYPDLT EDNHYIDQPY RPEDGYHLSK DLADQAIAMI RDSQQSQPEK PWHMFYCPGA NHAPHHAPQE FIDKYRGVFD DGYEAYREWV LPRMIEKGIL PEGTELTPLN PLPDDVANPA DAVRPWATLS SEERRLFARM AEAYAGFSEY TDHEIGRIVA YLEETGQLDN TLVFYAADNG ASGEGSPNGS VNEGKFFNAW PDTVEDNLPM IDKLGSPDTY NHYPTGWAAA FSTPYKMFKR YSYQGGVCDP LVISWPAGIR ARGEVRDQYH HCTDIVPTIL ECCGVEMPDT VLGYRQTPLA GVSMRYSFDD AAAPTQKPQQ YYEMLGTRAM WKDGWKAVAE HAPMPSDRGH FDRDRWQLFH TDVDRSESSD LAAEHPERLR ALVDLWFEEA EKYDVLPLSD LGILDYIRYE FQVPVPRGGT YVYGPAHAGL PEHSAASTHG VSYSLLGQIE VEDPGAQGVI FAQGSRFGGH ALFLKDRRLH YVYNFIGIKP EQHYVSDVEV GTGGQVVGVE FVKERVGEHG ESHGTVTMRL DEQVVATGPL RTQSGHFSLA GEGLCIGRDS GDAVSEQYRP DFPFEGGRIV KFEVGVGDDG YVDLERRLHA ALARD
|
| |