Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1306 |
Symbol | |
ID | 8731745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 1379555 |
End bp | 1381051 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646501924 |
Product | sulfatase |
Protein accession | YP_003393110 |
Protein GI | 284042770 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCCG GCCGTGCCTC CCTGGCCGCG TTCGCCGCGG CCGCACTGCT CTGCATGCTC GCCGCGCGCC CGGCCGGCGC GCAGGTCGCC CCGCCCGCGC AGCCGAACAT CGTCTTCGTG CTCGCCGACG ACCTGTCGTG GGATCTCGTC GAGCACATGC CGAACGTCCA GCGGCTGCGG CGCGACGGCG TCACCTTCTC CGACTACGTC GTGACCGACT CGCTCTGCTG CCCGTCGCGC GCGTCGATCC TGACGGGCAG ATATCCGCAC AACACCGGCA TCTACCGCAA CAGCGGCGCG GACGGCGGCT TCCTCGCGTT CCGCGCGCGG GGGCTGGAGC AGGCGACGTT CGCGACCGCG CTGCAGGCGG CCGGCTACCG CACCGCGCTG ATGGGCAAGT ACATGAACCA GTACAGCCCG GGGCGCGTGC GCGACGCGCT CGGCCGCCCG TACGTGCCGC CGGGCTGGAC CGACTGGCGC GTCGCGGGCA ACGGCTATCC CGGCTACGGC TACCGGCTGA CGAGCGGCGA GGGCGTCGAG CGGCGCGGCC ACGCGGCGCA GGACTACCTG ACCGACGTGC TGCGCCGCGA CGCGATCGGC TTCGTCTCCG GCGCGGTCGC GGCGGGGCAG CCGTTCCTGC TGCAGCTGTC GACGTTCGCG CCGCACACGC CGGCGACACC GGCGCCGCGC GACGAGGACC GCTTCGGCAA CGCGATGGCG CCGCGGACCG CGTCGTTCGA CGAGGCCGAC CTCTCGGACA AGCCGCGCTG GCTGCGCGGC CACCCGCCGC TGACCGCCGC GCAGCAGTGG CGGATCGACG ACCTCTTCCG CGAGCGCGTC CGCTCCGTGC AGGCGGTCGA CCGCGCGATC GGCCGGCTGC GCGAGCAGCT GCGCCGGCTC GGCGTCGCCC GCAGCACGTA CGTCGTCTTC AGCTCCGACA ACGGCTTCCA CATGGGTCAG CACCGCCTCA CGCCCGGCAA GCTGACGGCG TACGACGCCG ACGTGCGCGT GCCGCTGATC GTCGCAGGGC CGGGGGTGCC GGCGGGGGCG ACGGTGTCCG AGATCGCCGA GAACGTCGAC CTCTGCCCGA CCTTCTCGGA GCTGGGCGGG GCGGTCGCGC CGGCGGGCGT GGACGGGCGC AGCCTCGTGC CCCTGCTGCA TGGGAGCCCG GTCGCCGAAT GGCGCGAGGC GGCGCTGATC GAGCATCGCG GGACGGTGAC CTCGCCGGCG GACCCGGACT TCCCGGAGCG CGGCAGCGGC AACCCGCCGA CGTACGAGGC GCTGCGCACG CGCGACGCTC TCTACGTCGA GTACGCCGAC GGCGAGCGCG AGCTCTACGA TCGCCGGGTG GATCCCGACG AGCTGGACAA CATCGCCGCG GAGGCGCCGC CCGAGCGGCT CGCGCGGCTG TCGGCCGCGC TGCGGGCGAT GCGGACGTGC GCCGGCCCCG GCTGCCGTGG TGCCGCGCCG CTGGGGACCG GGGTGCTGAC CCCGTGA
|
Protein sequence | MKAGRASLAA FAAAALLCML AARPAGAQVA PPAQPNIVFV LADDLSWDLV EHMPNVQRLR RDGVTFSDYV VTDSLCCPSR ASILTGRYPH NTGIYRNSGA DGGFLAFRAR GLEQATFATA LQAAGYRTAL MGKYMNQYSP GRVRDALGRP YVPPGWTDWR VAGNGYPGYG YRLTSGEGVE RRGHAAQDYL TDVLRRDAIG FVSGAVAAGQ PFLLQLSTFA PHTPATPAPR DEDRFGNAMA PRTASFDEAD LSDKPRWLRG HPPLTAAQQW RIDDLFRERV RSVQAVDRAI GRLREQLRRL GVARSTYVVF SSDNGFHMGQ HRLTPGKLTA YDADVRVPLI VAGPGVPAGA TVSEIAENVD LCPTFSELGG AVAPAGVDGR SLVPLLHGSP VAEWREAALI EHRGTVTSPA DPDFPERGSG NPPTYEALRT RDALYVEYAD GERELYDRRV DPDELDNIAA EAPPERLARL SAALRAMRTC AGPGCRGAAP LGTGVLTP
|
| |