Gene Cwoe_1305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1305 
Symbol 
ID8731744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1378083 
End bp1379558 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content73% 
IMG OID646501923 
Productsulfatase 
Protein accessionYP_003393109 
Protein GI284042769 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAGA GGATCGACCG CCGCACGTTG CTCGGGGCGG CCGGCGCGGG CGTCGCCGGC 
GGCGCGCTGC TGGCACATGT CCCGCCCGCC CGCGCCCGCG CGGCCGCGGC GGCGGCCGGT
CCCAACGTGC TCGTGCTCGT GCTCGACTCG CTCCGCTACG ACCACGTCGG AGCGAACGGG
AACGCGTGGA TCAGAACGCC CAACATCGAC GCGCTCGCAC GCGAGAGCGT GCGCTTCACG
CGCGCCTTCC CGGAGGCGAT GCCGACAGTG CCGGCGCGCC GTTCGCTCCT GACGAGCCGC
CGCGTCTACC CGTGGACGAC GTGGAGGCCG ACGCGCAACC TGCCCGACAG TCCCGGCTGG
ACCGGGCTCG GGTGGAGCGA GCAGACGTGG CTCAGAGCGC TGAGAGCGCA CGGCTACTGG
ACCGGCTACG TGACCGACAA CCCGTTCCTG GCGTTCGCCT CGGCGTGGAA GCCGCTGCGC
AGAGGCGTCG ACCGCTTCAT GCGGATCGGC GGTCAGGTCG GCGCGCTGCG GCCGGCGTCG
ACCGTCTCGC TCGCGTCCGC CCGCCACTGG CTTCCGCGCG ACATGCAGAC GGACGGCTAC
GTGAGCGGGA TGCGCCAGCA CCTCGCGAAC CTCGGCGGCG CGCGCGACGA GCGCGAGCAG
TGCTGCGCGC GCGTCTTCTC GCACGCGCTC GACGTGCTCG GCAGCGCCCA GCGCTCGAAG
AAGCCGTTCG CGCTCGTCGT CGACTGCTTC GACCCGCACG AGCCGTGGGC GCCGGTGCGC
AAGTACCGCG ACATGTACGG CGACGACGGC TACAGAGCGA ACGAGCCGGG CAACGTCCGC
TACCGGCCGG CGAGATACCT GACCGGCGAC GAGCTGCGCC GGCTGCCGCA GCTGTACGCG
GCGGCAGTCA CGCAGACCGA CGCCTGGCTC GGCCACTTCC TCGGCCGCTT CTACGACGCC
GGCCTCGCCG ACAGCACCGC GATCGTCCTG CTCAGCGATC ACGGGATCGT GCTCGGCGAC
CGCGGCTGGA CGGGCAAGCC GGCGCTGCAG CTGCACCCGG AGCTGATCCA GGTGCCGTTC
CTGCTGCGCG CGCCCGACCG CCGCGGCGCC GGCACGACGA GCGGCTACTT CGCCTCGCCG
CACGACGTCG GCCCGACGCT GCTGGCGATG ACCGGTGTGC CGCGGCCGGA GCGGATGGAC
GGGGCCGACC TCTCGCCGCT GCTGAGCGGT GGGGCGCCCG CGACCCCGCG GCCGTTCTGG
TTCGGCGGCT ACGCCAACCA CTGGTACTAC CGCGACGACC GCTGGGCGCT GATCGCGGAC
GGCAACAACA GAGGCCGCGG CCTGTACGAC CTGCGCAAGG ACCCCGGCGA GCGGGACAAC
GTCGCGTCCG AGCACCTGCG TCTGCTCGCG CAGATCCACG GCACGGTGCT GCGCGCTGCG
GGGTTGAAGC GGTTGCCCTA TTTTGGAGCG AGATGA
 
Protein sequence
MEQRIDRRTL LGAAGAGVAG GALLAHVPPA RARAAAAAAG PNVLVLVLDS LRYDHVGANG 
NAWIRTPNID ALARESVRFT RAFPEAMPTV PARRSLLTSR RVYPWTTWRP TRNLPDSPGW
TGLGWSEQTW LRALRAHGYW TGYVTDNPFL AFASAWKPLR RGVDRFMRIG GQVGALRPAS
TVSLASARHW LPRDMQTDGY VSGMRQHLAN LGGARDEREQ CCARVFSHAL DVLGSAQRSK
KPFALVVDCF DPHEPWAPVR KYRDMYGDDG YRANEPGNVR YRPARYLTGD ELRRLPQLYA
AAVTQTDAWL GHFLGRFYDA GLADSTAIVL LSDHGIVLGD RGWTGKPALQ LHPELIQVPF
LLRAPDRRGA GTTSGYFASP HDVGPTLLAM TGVPRPERMD GADLSPLLSG GAPATPRPFW
FGGYANHWYY RDDRWALIAD GNNRGRGLYD LRKDPGERDN VASEHLRLLA QIHGTVLRAA
GLKRLPYFGA R