Gene Cwoe_2363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2363 
Symbol 
ID8732806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2500091 
End bp2502442 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content71% 
IMG OID646502980 
Productsulfatase 
Protein accessionYP_003394162 
Protein GI284043822 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0892214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCCTCA CCGAGTACAC CGCCGGGACG GTCTTTCCCG GCGTCATCGG GCGCACCGCC 
GACGAGTCGA CTCCTGCGTG GCCAGCCCCC GCCCGGGCCG AACGCGGGTC GCCGAACGTG
CTGACGGTCG TCCTCGACGA CACGGGCTTC GGGCAGCTCG GCTGCTACGG CAGCCCGATC
CGCACCCCCA ACCTCGACCG CGTCGCCGCG GCGGGGGTGC GCTTCAGCAA CATGCACACG
ACGGCGCTGT GCTCGCCGAG CCGCTCGTGC ATCCTCACGG GGCGCAACCA CCACAGCAAC
GGCATGGCGG CGATCACGGA GCTGGCGACC GGCTACCCGG GCTACAACGG GAACATCCCG
TTCGAGAACG GCTTCCTCTC CGAGACGCTC GCCGAGCACG GCTACAGCAC GTACATGATC
GGCAAGTGGC ACCTCATCCC CAGCAGCCAG GAGACGGGCG CCGGGCCGTA CGACCGCTGG
CCGCTCGGCC GCGGCTACCA GCGCTTCTAC GGCTTCCTCG GCGGCGACAC CAGCCAATGG
GAGCCGGACC TGACCTACGA CAACCACCAG GTCGAGCCGC CGCGCACCGC AGCCGAGGGG
TACCACCTGA CAGCGGATCT GGCCGACAAG TCGATCCAGT TCATCGCCGA CGCCAAGCAG
GTCGATCCGG ACAAGCCGTT CCACCTGCAC TTCTGCCCAG GCGCGACCCA CGCCCCGCAC
CACGTCCCCA AGGCGTGGGC CGACGGCTAC GCGGGAGCGT TCGACGACGG CTGGGACGCC
TACCGCGAGC AGGTGTTCGC CCGCCAGGTC GAGCTCGGCA TCGTGCCCGC CGACGCCGAG
CTGTCGCGCC ACGACCCGGA CGTGCCCGAG TGGGGCGCGC TGACCGACGA TCAGCGGCGG
CTCTACAGCC GGATGATGGA GGTGTTCGCC GGGTTCCTGG AGCACACCGA CCACCACATC
GGGCGGGTGC TCGACTTCCT CGAGTCGATC GGCGAGCTCG ACAACACCAT CGTCATGGTC
GTCTCCGACA ACGGCGCCAG CGCCGAGGGC GGCCTCACCG GCACGACCAA CGAGGCGCAG
TTCTTCAACA ACGCGCCGGA GCCGTTCGAG GACAGCCTCG CGGCGATCGA CGAGCTGGGC
GGCCCGCGCC ACTTCAACCA CTACCCATGG GGCTGGACGT GGGCGGGCAA CACCCCGTTC
CGCCGCTGGA AGCGCGAGAC CTACCGCGGC GGCACCAGCG ACCCGTTCAT CGTCGCGTGG
CCGCAGGGGA TCGCCGCGCG CGGCGAGGTC CGCACCCAGT ACGCGCACAT CATCGACATG
GTGCCGACGA TCCTCGAGCT GGTGGGCATC GACGCGCCCA AGACGATCCG CGGCGTGACG
CAGTCCCCGC TGCACGGCGT GAGCTTCGCG CACGCGCTCG CCGATCCCGG CGCCGACAGC
AGACGCCGGA CGCAGTACTT CGAGATGCTC GGCCACCGCG CGATCTACCA CGACGGCTGG
CGCGCCGTCT GCCCATGGCC AGGTCCGTCG TTCGCGGAGG CCGAGCAGGC GTTCGGCGAC
CCGATCTCCG AGCACCAGCT CGCCCAGCTC GACCAGAGCG GGTGGGAGCT TTACCACGTC
GCCGTCGACT TCGCCGAGAA CCACGACGTC GCCGCGGACA ACCGCGAGCG GCTGATCGCG
CTGATCGGCA CCTGGTACGC CGAGGCCGGA AAGTACGACG TGCTGCCGGT CGACGGCAGC
GCGGTCGCGC GGATGGTCTC GGAGAAGCCG TTGGTCGCGG CACCCCGCGA CACGTTCACG
TACTACCCGA ACACGCAGTC GGTGCCGTTC TTCGCCGCGC CGCGCGTGCT CAACCGCCCG
CACAGCATCA CCGCCGACGT CGAGATCCCC GACGGCGGCG CGGAGGGCGT GCTGCTGTGC
CAGGGCACGT CAGCCGGCGG GTACTCGTTC TACCTCCGAG ACGGCCGCCT GCACTACGTC
CACAACTGGG TCGCCCGCGA GCTGTTCCGC GTCTCCTCCC CCGACGCGAT CCCCGCCGGC
CGGCACGAGC TGCGCTTCGA GTTCGAGCCG ACAGGAGCGC CGGACATAGC AGCCGGGCGC
GGCGCGCCCG GCCTGTTCCA GCTCTACGTT GACGGGACGC TCGTGGCCGA GACAGAGGCC
CCCTACACGA CGGCCATCGT GTTCAACCCG GGCCAGCTGA CCTGCGGCGC CGACCCGGGC
TCGACCGTCG TCGCCGACTA CCCCGCGCCG TTCCGCTTCA GCGGGACGCT GCACACCGTG
AAGGTCGACC TCTCCGGCCA CCTGATCACC GACGAGGAGG CCGAGCTGCG CATGGCTCTG
GCGCGGCAGT AG
 
Protein sequence
MSLTEYTAGT VFPGVIGRTA DESTPAWPAP ARAERGSPNV LTVVLDDTGF GQLGCYGSPI 
RTPNLDRVAA AGVRFSNMHT TALCSPSRSC ILTGRNHHSN GMAAITELAT GYPGYNGNIP
FENGFLSETL AEHGYSTYMI GKWHLIPSSQ ETGAGPYDRW PLGRGYQRFY GFLGGDTSQW
EPDLTYDNHQ VEPPRTAAEG YHLTADLADK SIQFIADAKQ VDPDKPFHLH FCPGATHAPH
HVPKAWADGY AGAFDDGWDA YREQVFARQV ELGIVPADAE LSRHDPDVPE WGALTDDQRR
LYSRMMEVFA GFLEHTDHHI GRVLDFLESI GELDNTIVMV VSDNGASAEG GLTGTTNEAQ
FFNNAPEPFE DSLAAIDELG GPRHFNHYPW GWTWAGNTPF RRWKRETYRG GTSDPFIVAW
PQGIAARGEV RTQYAHIIDM VPTILELVGI DAPKTIRGVT QSPLHGVSFA HALADPGADS
RRRTQYFEML GHRAIYHDGW RAVCPWPGPS FAEAEQAFGD PISEHQLAQL DQSGWELYHV
AVDFAENHDV AADNRERLIA LIGTWYAEAG KYDVLPVDGS AVARMVSEKP LVAAPRDTFT
YYPNTQSVPF FAAPRVLNRP HSITADVEIP DGGAEGVLLC QGTSAGGYSF YLRDGRLHYV
HNWVARELFR VSSPDAIPAG RHELRFEFEP TGAPDIAAGR GAPGLFQLYV DGTLVAETEA
PYTTAIVFNP GQLTCGADPG STVVADYPAP FRFSGTLHTV KVDLSGHLIT DEEAELRMAL
ARQ