Gene Cwoe_1306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1306 
Symbol 
ID8731745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1379555 
End bp1381051 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content75% 
IMG OID646501924 
Productsulfatase 
Protein accessionYP_003393110 
Protein GI284042770 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCCG GCCGTGCCTC CCTGGCCGCG TTCGCCGCGG CCGCACTGCT CTGCATGCTC 
GCCGCGCGCC CGGCCGGCGC GCAGGTCGCC CCGCCCGCGC AGCCGAACAT CGTCTTCGTG
CTCGCCGACG ACCTGTCGTG GGATCTCGTC GAGCACATGC CGAACGTCCA GCGGCTGCGG
CGCGACGGCG TCACCTTCTC CGACTACGTC GTGACCGACT CGCTCTGCTG CCCGTCGCGC
GCGTCGATCC TGACGGGCAG ATATCCGCAC AACACCGGCA TCTACCGCAA CAGCGGCGCG
GACGGCGGCT TCCTCGCGTT CCGCGCGCGG GGGCTGGAGC AGGCGACGTT CGCGACCGCG
CTGCAGGCGG CCGGCTACCG CACCGCGCTG ATGGGCAAGT ACATGAACCA GTACAGCCCG
GGGCGCGTGC GCGACGCGCT CGGCCGCCCG TACGTGCCGC CGGGCTGGAC CGACTGGCGC
GTCGCGGGCA ACGGCTATCC CGGCTACGGC TACCGGCTGA CGAGCGGCGA GGGCGTCGAG
CGGCGCGGCC ACGCGGCGCA GGACTACCTG ACCGACGTGC TGCGCCGCGA CGCGATCGGC
TTCGTCTCCG GCGCGGTCGC GGCGGGGCAG CCGTTCCTGC TGCAGCTGTC GACGTTCGCG
CCGCACACGC CGGCGACACC GGCGCCGCGC GACGAGGACC GCTTCGGCAA CGCGATGGCG
CCGCGGACCG CGTCGTTCGA CGAGGCCGAC CTCTCGGACA AGCCGCGCTG GCTGCGCGGC
CACCCGCCGC TGACCGCCGC GCAGCAGTGG CGGATCGACG ACCTCTTCCG CGAGCGCGTC
CGCTCCGTGC AGGCGGTCGA CCGCGCGATC GGCCGGCTGC GCGAGCAGCT GCGCCGGCTC
GGCGTCGCCC GCAGCACGTA CGTCGTCTTC AGCTCCGACA ACGGCTTCCA CATGGGTCAG
CACCGCCTCA CGCCCGGCAA GCTGACGGCG TACGACGCCG ACGTGCGCGT GCCGCTGATC
GTCGCAGGGC CGGGGGTGCC GGCGGGGGCG ACGGTGTCCG AGATCGCCGA GAACGTCGAC
CTCTGCCCGA CCTTCTCGGA GCTGGGCGGG GCGGTCGCGC CGGCGGGCGT GGACGGGCGC
AGCCTCGTGC CCCTGCTGCA TGGGAGCCCG GTCGCCGAAT GGCGCGAGGC GGCGCTGATC
GAGCATCGCG GGACGGTGAC CTCGCCGGCG GACCCGGACT TCCCGGAGCG CGGCAGCGGC
AACCCGCCGA CGTACGAGGC GCTGCGCACG CGCGACGCTC TCTACGTCGA GTACGCCGAC
GGCGAGCGCG AGCTCTACGA TCGCCGGGTG GATCCCGACG AGCTGGACAA CATCGCCGCG
GAGGCGCCGC CCGAGCGGCT CGCGCGGCTG TCGGCCGCGC TGCGGGCGAT GCGGACGTGC
GCCGGCCCCG GCTGCCGTGG TGCCGCGCCG CTGGGGACCG GGGTGCTGAC CCCGTGA
 
Protein sequence
MKAGRASLAA FAAAALLCML AARPAGAQVA PPAQPNIVFV LADDLSWDLV EHMPNVQRLR 
RDGVTFSDYV VTDSLCCPSR ASILTGRYPH NTGIYRNSGA DGGFLAFRAR GLEQATFATA
LQAAGYRTAL MGKYMNQYSP GRVRDALGRP YVPPGWTDWR VAGNGYPGYG YRLTSGEGVE
RRGHAAQDYL TDVLRRDAIG FVSGAVAAGQ PFLLQLSTFA PHTPATPAPR DEDRFGNAMA
PRTASFDEAD LSDKPRWLRG HPPLTAAQQW RIDDLFRERV RSVQAVDRAI GRLREQLRRL
GVARSTYVVF SSDNGFHMGQ HRLTPGKLTA YDADVRVPLI VAGPGVPAGA TVSEIAENVD
LCPTFSELGG AVAPAGVDGR SLVPLLHGSP VAEWREAALI EHRGTVTSPA DPDFPERGSG
NPPTYEALRT RDALYVEYAD GERELYDRRV DPDELDNIAA EAPPERLARL SAALRAMRTC
AGPGCRGAAP LGTGVLTP