Gene Cwoe_5750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5750 
Symbol 
ID8736226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp6155467 
End bp6156984 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content71% 
IMG OID646506377 
Productsulfatase 
Protein accessionYP_003397526 
Protein GI284047186 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.457372 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACG ACGGACGGAT CTCACGCCGC GCCCTGCTGC GCGGCGGCGG CGCGTCGGCC 
GGCCTCGCCG CGCTGACCGG CGCCGGCGTG CTCGGCGCCG CCGCGACCGC GCAGGCGCAG
GACGGCGACA GAGCGAGAGA GCGCCCCAAC GTCGTGCTGA TCGTCGCTGA CCGCCTGCGG
GCCGACTACG TCGGCGCCTA CGACGACGTC TTCGACGACC GCCACGCCAA GACGCCCAAC
ATCGACGAGC TGGCCGACCA GGCGCTGCGC TTCAAGTACG CCGTCCCGGA CGGCATGCCG
GCGATCCCGA TGCGCCGCGG GATGCTGACC GGGATGCGCA GCTACCCGTT CCGCGACTGG
AGAGCGACGG CGGGGATGCC AGCGATCCCC GGCTTCAACA AGATCTACGA CTTCCAGCCG
ATGGTCACCG AGCTGGCCGC CGCCGCCGGG ATCACGACCG TCTACGTGAC CGACAACCCG
ACGTTCACCG GCCCGCGCTT CGGCCGGATC GTGCGGACCG GCCCGCTCGC GACCTCGGCC
GGCTTCGAGT CGACCGAGCG CGACTACCTG CTGCCGCTCG GCGGCACGGT CCAGCGCAGA
CGCCAGGAGC CGACGAGCCG CGTCCTGCGC GAGGGGATCG AGCAGCTCGA CCAGCTGAGA
GGCAGACAGC CGTTCTTCCT GACGCTCGAC GCGTTCGACC CCAACGACGC GTTCCGGCTG
CCGCGCCAGT TCGTCGAGGG CAGCGGCCCG CTGCACGACG ACGTCACGCT GCCGAAGGAC
CGCGTCTACC AGCAGACGTT CAGAGCCGAC GGCGACGTCA AGGGCGAGGT GCGCGACCGC
TACGCCGCCG AGGTCGAGTC GGTCGACGGC TGGGTCGGCA GATTCCTCGA CAAGCTCGAC
GACGCCGGCC TCGCCGACAA CACCGTCGTC GTCTTCGTCG GCGACTCCGG CATCGCGCTC
GGCGAGCAGG GCGTCTACGG CCACCCCGCC GGCGTCTGGC ACCGGCGCGC CTACCACGTG
CCGTTCCTGA TCCGCGACCC CGACGGCCGC TGGGCCGGCG ACACCAGCAA GTGGTTCGCC
TCCACGCACG ACATCCCGTC GACGCTGCTC TCCTACTGGG GCATCACGAG CCCGGGCAGA
ATGCAGGGCG AGGACCTGAC GACGCTGTTC GACGACTTCG ACCTGCCGGC GCGGCCGTAC
TACACGACCG CGATCGACAC GCACATCGTC ACGGGCAGCC GCACGTGGCT GCTGATCGGC
CGCTCCGACC AGGACCGCTG GCGCCTGTAC GAGGCCGAGG ACGAGGACGA GCCGGACGAG
ATCCGCACCG AGACCGTCAA GTCGCCGACC GTGCTCGAGG AGATGAGACG GTATGCGCTG
GCCGCCGCCG GCGGCACGCT GCCCGACTTC GGCGACACGG CCGCGATACG GCCGGCACAG
CCCGACTCGC TCGACAAGAA GGTCGCCGAC GACGGCACGC TCGACGAGGA CGAGGCGGAG
GCGAACGAGC TTCGATGA
 
Protein sequence
MADDGRISRR ALLRGGGASA GLAALTGAGV LGAAATAQAQ DGDRARERPN VVLIVADRLR 
ADYVGAYDDV FDDRHAKTPN IDELADQALR FKYAVPDGMP AIPMRRGMLT GMRSYPFRDW
RATAGMPAIP GFNKIYDFQP MVTELAAAAG ITTVYVTDNP TFTGPRFGRI VRTGPLATSA
GFESTERDYL LPLGGTVQRR RQEPTSRVLR EGIEQLDQLR GRQPFFLTLD AFDPNDAFRL
PRQFVEGSGP LHDDVTLPKD RVYQQTFRAD GDVKGEVRDR YAAEVESVDG WVGRFLDKLD
DAGLADNTVV VFVGDSGIAL GEQGVYGHPA GVWHRRAYHV PFLIRDPDGR WAGDTSKWFA
STHDIPSTLL SYWGITSPGR MQGEDLTTLF DDFDLPARPY YTTAIDTHIV TGSRTWLLIG
RSDQDRWRLY EAEDEDEPDE IRTETVKSPT VLEEMRRYAL AAAGGTLPDF GDTAAIRPAQ
PDSLDKKVAD DGTLDEDEAE ANELR