Gene Cwoe_5751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5751 
Symbol 
ID8736227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp6156984 
End bp6158522 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content69% 
IMG OID646506378 
Productsulfatase 
Protein accessionYP_003397527 
Protein GI284047187 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCAC GGCGATTCAC GCGGCGCGGT GCGCTGAAGG TCGGTGCGGC CGGAGCGGCC 
GCGGCTGGCC TCTCCGCCTG TGGCAACGAG TACGAGGAGA CGTCCGGACG CGCCGACGGC
GCGCCGAACG TCGTGCTGAT CATCACGGAC TCCACGCGTG CGGACTACAT CGGTGCGTAC
AACCCCAACT CGCTTGCCAG AACGCCCAAC CTCGACGAGC TGTTCAAACG CTCGCTGAAG
TTCGAGCTGG CGATCCCGGA GGCGATGCCG ACCGGGCTCG TGCGGCGCTC CGTGCTGACG
GGCATGCGCT CGTTCCCCAA CCGCGACTGG GTGCTCTCCC CGCCGATGCC GGCCGAGGTC
GGCTGGACCC GCCCGCTGCC GCACCAGCCG CTGCTGACCG AGGAGCTCGG CAAGGCCGGC
GTCGAGACCG CGTACGTGAC CGACAACCCG TTCATCGTCG GCCCGCGCTA CACCGACTTC
CGGCGCACGC TCGACATCGG CCGGCCCGAC TTCTCCCAGG GCGCCTACCG CGCCTTCAAC
ACGCCCTTCA GACGCCCCGC GCCGCGCAGC GCGATCGAGA AGTACCTGCT GCCCGCGCTG
TCGGACACCG TCGAGGTGCG CCGCCTGCAG GACTACGTCG GCTGGAACAG CATGTACCGC
GTCGGCGAGC GCAACTACTC CGCCGCGCGC GTCATGCGCG GCGGGATGGA CGCGCTCGAC
GACCTCAAGG ACAAGCAGCC GTTCTTCCTC GGCGTCGACT CGTTCGACCC GCACGAGCCG
TTCGACGCGC CGCCGCTCTA CGTGCGCGAG ATCGAGGGCC AGCCGAAGGG GATCCAGCGC
GACAGAGGGA TCATGCCGAT CCAGCCGTTC CTGACGCCCG CCTCCAGACT CGACAAGATC
GACGTCGACC CCGAGACGCT GCAGCTCATA CGCGAGCTGT ACGCGGCCGA GCTGACGTTC
GTCGACGTCT GGATCGGCAA GCTGCTGAAC AAGCTCGACG ACCTCGGGCT CTCCGACAAC
ACCGTCGTCT ACTTCCTCAG CGACCACGGC CTGACGCTCG GCGAGCACGG CATCATCGGC
AAGTCGACGC CGCGGCCGTA CCGCGAGATC CACCACATCC CGTACCTGAT CCACGACCCG
TCGGGCCGGA TGGCCGGCAA GACGAGCCGC TACTTCGCCT CCACGCACGA CGTCGCCCGC
ACGGTCATGT CGTTCATGGG CGTCCGCGCG CCCGGCCTGA TGAACGGCGA GGACCTGACC
GTCCTGTTCG ACGGCCGCGA GCCGCCGGCA CGGCCGTACT ACACCTCCTG CTACCAGGAG
ACGTGGATCT GCGGCGACTA CGACTGGCTG CTGATCTCGA CGCCGGACGG CTCCAGAAGA
GAGCTGTACG ACCTCGTCAA CGACCCCGGC AACACGAGAG ACGTCGCGAG CGATCCCGCC
AACCAGGCGA CGATCGACAC GCTCTGGACG GTCCTGATCA ACGAGGCGGG CGGCACGCTG
CCCGTCTTCC ACGACCACAT CGGCGTGGTG GGCGGCTGA
 
Protein sequence
MAARRFTRRG ALKVGAAGAA AAGLSACGNE YEETSGRADG APNVVLIITD STRADYIGAY 
NPNSLARTPN LDELFKRSLK FELAIPEAMP TGLVRRSVLT GMRSFPNRDW VLSPPMPAEV
GWTRPLPHQP LLTEELGKAG VETAYVTDNP FIVGPRYTDF RRTLDIGRPD FSQGAYRAFN
TPFRRPAPRS AIEKYLLPAL SDTVEVRRLQ DYVGWNSMYR VGERNYSAAR VMRGGMDALD
DLKDKQPFFL GVDSFDPHEP FDAPPLYVRE IEGQPKGIQR DRGIMPIQPF LTPASRLDKI
DVDPETLQLI RELYAAELTF VDVWIGKLLN KLDDLGLSDN TVVYFLSDHG LTLGEHGIIG
KSTPRPYREI HHIPYLIHDP SGRMAGKTSR YFASTHDVAR TVMSFMGVRA PGLMNGEDLT
VLFDGREPPA RPYYTSCYQE TWICGDYDWL LISTPDGSRR ELYDLVNDPG NTRDVASDPA
NQATIDTLWT VLINEAGGTL PVFHDHIGVV GG