Gene Cwoe_3065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3065 
Symbol 
ID8733511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3268848 
End bp3271208 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content70% 
IMG OID646503680 
Productsulfatase 
Protein accessionYP_003394859 
Protein GI284044519 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGTC AGAGCAACGG AGTGAGCCTG CCGAAGCCCG ACCAGGTGTA CAGCGGCCCG 
CTCGCGTTCG ACGTCCGCGA GCTGCCGGAG AGACTGCCGG CGATCGAGGC CAAGCGCCCG
CCAAGAGGCG CGCCGAACGT GCTGATCATG CTGCTCGACG ACGTCGGCTT CGGTGCCTCC
AGCACGTTCG GCGGGCCGAT CCACACGCCG ACCGCGGAGC GGCTCGCGGG CAGCGGCCTG
CGCTACAGCC GCTTCCACAC GACCTCGCTC TGCTCGCCGA CGCGCGCGGC GCTGCTGTCC
GGGCGCAACC ACCACGCGGT CGGCATGGGC CACATCACCG AGACGGCGAC GCCGGCGCCC
GGCTACCGCT CGACCCGCCC CAACTCCGCG ACGCCGCTGG CGGAGATCCT GCGCCAGAAC
GGCTACAACA CCGCGCAGTT CGGCAAGTGC CACGAGGTTC CCGTGTGGGA GTCGGGACCG
ACCGGGCCGT TCGACCACTG GCCGGTCAAC AGCGGCTTCG AGCGCTTCTT CGGCTTCATC
GGCGGCGAGA CGAACCAGTT CACGCCGGCG CTCGTGGACG GCACCAGCAC GGTCGAGCCG
CCCGACGATC CCGACTATCA CCTGATGACC GACCTCGCCG ACCGCACGAT CGCGTACGTC
CGCGAGCAGC AGTCGCTGAC ACCCGACAAG CCGTTCTTCG CCTACCTCGC GCCCGGCGCG
ACCCACACGC CCCATCACGT CCCCAGAGCG TGGATCCACA AGTACAAGGG GCAGTTCGAC
GACGGCTGGG ACGCGCTGCG GGAACGCTCG ATCGCCCGCC AGCGAGAGCT GGGCGTCGTC
GCCGCCGACT GCGAGCTGAC GAGACGGCCC GACGGGATCG CCGCGTGGGC CGACGTGCCG
GCGGAATGGA GACCGATCCT CTCCCGCCAG ATGGAGGTCT ACGCCGCGTT CCTCGAGTAC
GCCGACACCG AGTGCGGCCG CGTGCTCGAC GCGCTGGAGG AGCTGGGGGT GCTCGACGAC
ACGCTCGTGC TGTACATCAT CGGCGACAAC GGGGCCTCGG CCGAGGGCGG GCTCAACGGC
GCGTACATGA TCACGACCGC GTCCAACGGC GGCGGCGAGT ACGAGACGCC GCAGTTCTGG
CAGGAGCACC TGTCCGAGGT CGGCGGCCCG AGAGCGTACA ACCACTATTC GGTGGCCTGG
GCGCATGCGC TGTGCGCGCC GTACCAGTGG ACCAAGCAGG TCGCCTCGCA CTACGGCGGG
ACGCGCAACG GCACGATCGT CCACTGGCCG GCGCGGATCA GAGCGAGCGG CGAGGTGCGC
ACGCAGTGGC ACCACGTGAT CGACGTCGCG CCGACGATCC TCGAGCTGGC CGGGATCGCC
GAGCCCGACA CCGTCAACGG CGTCACGCAG GTGCCGATGC AGGGCGTCTC GTTCGCCTAC
TCGTTCGACG ACGCCGACGC GGACGAGCGC CATCACACGC AGTACTTCGA GCTGATGGGC
AACCGCGGGA TCTACCACCG CGGCTGGACC GCCGTCACGA AGCACCGCAC GCCGTGGGAC
GTCGTCTCCA GACCGCACCC GTTCGACGCC GACGTGTGGG AGCTGTACGA CACGACGAGC
GACTGGAGCC AGGCGAGAGA CCTCTCCGCC GAGCAGCCGG CGAAGCTCGC CGAGCTGCAG
CAGCTGTTCC TGATCGAGGC GGTGCGGCAC AACGTGCTGC CGCTCGACGA TCGCGCGGCG
GAGCGGATGA ACCCCGAGAT CGCCGGCCGT CCGACGCTCG TCACCGGCGA CCGCCTGCGG
CTCTACCCGG GGATGAAGCG GCTCGGCGAG AACGTCGCGA TCAACGTCAA GAACCGCTCC
TACTCCGTCA CCGCCGAGGT CGAGGTCGAG ACGGACGGCA GCGCGAGCGG CGCGCTCGTC
GCGCAGGGCG GCCGGACCGG CGGCTGGGCG CTGCTCGTGA CCGACGGCGG CAGACTCGCC
TACCACTACA ACTACTGCGG CCTCAGACGG GCGACGATCG AGTCGAGCGC GCCGATCGCG
GCCGGGAGGC ACCAGCTGCG GGCCGAGTTC GGCTACGACG GCGGTGGGAT CGGCCGCGGC
GGCAGCGTCA CGCTGTTCGT CGACGGGAGC GCGGTCGGGG AGGGGAGGGT GGAGCGCACG
CACCCGCTCT ACTTCTCCTT CGACGAAGGG CTCGACGTCG GCCTCGACAG CGGGATGCCG
GTGTTCGAGG GCTACGCGGC GCCGAGAGGC CGCTTCGGCG GCGGATCGAT CCTGTGGGCG
CAGATCGACC TCGGCGGCGA CGATCACAAC CACCTCGTCC CGCCGGAGGC GCACCTGCAG
GCTGCTTTGA TCCACCAATG A
 
Protein sequence
MSSQSNGVSL PKPDQVYSGP LAFDVRELPE RLPAIEAKRP PRGAPNVLIM LLDDVGFGAS 
STFGGPIHTP TAERLAGSGL RYSRFHTTSL CSPTRAALLS GRNHHAVGMG HITETATPAP
GYRSTRPNSA TPLAEILRQN GYNTAQFGKC HEVPVWESGP TGPFDHWPVN SGFERFFGFI
GGETNQFTPA LVDGTSTVEP PDDPDYHLMT DLADRTIAYV REQQSLTPDK PFFAYLAPGA
THTPHHVPRA WIHKYKGQFD DGWDALRERS IARQRELGVV AADCELTRRP DGIAAWADVP
AEWRPILSRQ MEVYAAFLEY ADTECGRVLD ALEELGVLDD TLVLYIIGDN GASAEGGLNG
AYMITTASNG GGEYETPQFW QEHLSEVGGP RAYNHYSVAW AHALCAPYQW TKQVASHYGG
TRNGTIVHWP ARIRASGEVR TQWHHVIDVA PTILELAGIA EPDTVNGVTQ VPMQGVSFAY
SFDDADADER HHTQYFELMG NRGIYHRGWT AVTKHRTPWD VVSRPHPFDA DVWELYDTTS
DWSQARDLSA EQPAKLAELQ QLFLIEAVRH NVLPLDDRAA ERMNPEIAGR PTLVTGDRLR
LYPGMKRLGE NVAINVKNRS YSVTAEVEVE TDGSASGALV AQGGRTGGWA LLVTDGGRLA
YHYNYCGLRR ATIESSAPIA AGRHQLRAEF GYDGGGIGRG GSVTLFVDGS AVGEGRVERT
HPLYFSFDEG LDVGLDSGMP VFEGYAAPRG RFGGGSILWA QIDLGGDDHN HLVPPEAHLQ
AALIHQ