Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4747 |
Symbol | |
ID | 8735213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 5064228 |
End bp | 5065781 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646505376 |
Product | sulfatase |
Protein accession | YP_003396535 |
Protein GI | 284046195 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.833748 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.42259 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCCG AGATCGACCG ACGAGAACTG CTGCGCGCCG CCGGCATCGG AGCCGCCGGC GTCGGCCTGC TCGGGGTCGC GGGGTGCGGG AGCGGCGACG GGACGGCGAA TGCCGCGCGC GGCGAGACGG CGACGATCGC GGTCAAGCCG AAGCCGCCGC GGACGCCGGT GCGGGTCGGC AGAAAGCTGA AGAAGGGCGA CGTCCCGAAC CTGCTGCTCG TGATCATCGA CTCCGTCCGC GCCGACGCGC TCGGCTCCTA CGGCCGCCGC AACGCGCACA CGCCGAACCT CGACGCGCTC GCGCGCGAGT CGCTGCGCTT CACCGAGTGC TACCCGGAGT CGTTCCCGAC CGGCCCAGCG CGCGCGACGA TCTTCGGCGG CTCGCGCCTG TTCCCGTTTC GCGACTGGAA GGCGCCGGCC GATATGCCCG GCACGCCCGG CTGGCAGGCG GTGCCGGACG TGAACCTGAT CTCGACGCTC AGACGCGCCG GCTACTGGAC CGGCTTCGCG GTCGACACGC CGTGGGTGAT GGTCGGCTCC CAGCAGCCGT TCCTGCGCGA CTGGGACAGA TACGTCCCGG TCAAGGGCCA GACCGGCACG GTCACCGCCG ACCAGTCGAA GATCAGCGAC GCCGAGCTGG CGAAGTGGGT CGCGCCGAAG ATCATCGACT CCAGCTCGGG TCAGAAGATG CGCCAGTACC TGGCCAACCA GCTCGGGCGC AGAAACGAGG ACGAGTACCT CCCCGCGCGC GTCTTCACCG AGGGGATGCG GCTGCTGGAG GAAGGGTCGA AGTCGAGAAA GCCGTTCGCG ATCGCGATCG ACTGCTTCGA CCCGCACGAG CCGTGGGACC CGCCGGAGAG ATACCTCAAG CTGCACGGCG GCGACCTCGA CCGCGCCTGG AACCCCGGCA CGGTCCTCAA CGGCACCGCC AGATCGAACG GCCTCGCGCC GCGCGACGTC AAGCAGATGC AGGCGCTCTA CTACGCCGAG CTGACGATGG CCGACCGCTG GTTCGGCAAC TTCATGCAGC GCTTCCACGA GCTGGGCCTC GAGAGAGACA CGATCGTGAT GTTCCTCTCC GACCACGGCT TCCTGCTCGG CGAGCGCGGC TACGTCGCGA AGTTCGCATG GGAGCTGCAC CCGGAGCTGA CCCACGTGCC GATGCTGCTG CGCCGCCCCG ACGGGACGGG CGCGCGCAAG AAGACCGACT TCTACGCGCA GACCGAGGAC GTCGCCGCGA CGCTGCTCGG CGCGACCGGG ATCAAGCAGC CCGAGTGGAT GGACGGCATC GACCTGATGC CGCTGTCCGA GGGCAAGAAG CCGAAGAAGA GACGCGACTA CGTCACCGGC TCCTACAGCT CCGTCGTCTT CGCGCGCGAT CGCAACTGGT CCTACATCGC CGACAGCCAG GGCGGCAGAC CGGAGCTGTA CAACCTCCGG CGCGACCGCC GCGAGGTCCG CAACCTCGCG GGCAGCAACG CCGCGCAGGT CCGCAAGATG TACCGCGGCA TGCTCGTCAG AGACGCCGGC GGGCCGCTGC CGAAGTTCAC GTAG
|
Protein sequence | MNPEIDRREL LRAAGIGAAG VGLLGVAGCG SGDGTANAAR GETATIAVKP KPPRTPVRVG RKLKKGDVPN LLLVIIDSVR ADALGSYGRR NAHTPNLDAL ARESLRFTEC YPESFPTGPA RATIFGGSRL FPFRDWKAPA DMPGTPGWQA VPDVNLISTL RRAGYWTGFA VDTPWVMVGS QQPFLRDWDR YVPVKGQTGT VTADQSKISD AELAKWVAPK IIDSSSGQKM RQYLANQLGR RNEDEYLPAR VFTEGMRLLE EGSKSRKPFA IAIDCFDPHE PWDPPERYLK LHGGDLDRAW NPGTVLNGTA RSNGLAPRDV KQMQALYYAE LTMADRWFGN FMQRFHELGL ERDTIVMFLS DHGFLLGERG YVAKFAWELH PELTHVPMLL RRPDGTGARK KTDFYAQTED VAATLLGATG IKQPEWMDGI DLMPLSEGKK PKKRRDYVTG SYSSVVFARD RNWSYIADSQ GGRPELYNLR RDRREVRNLA GSNAAQVRKM YRGMLVRDAG GPLPKFT
|
| |