Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5751 |
Symbol | |
ID | 8736227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 6156984 |
End bp | 6158522 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646506378 |
Product | sulfatase |
Protein accession | YP_003397527 |
Protein GI | 284047187 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCAC GGCGATTCAC GCGGCGCGGT GCGCTGAAGG TCGGTGCGGC CGGAGCGGCC GCGGCTGGCC TCTCCGCCTG TGGCAACGAG TACGAGGAGA CGTCCGGACG CGCCGACGGC GCGCCGAACG TCGTGCTGAT CATCACGGAC TCCACGCGTG CGGACTACAT CGGTGCGTAC AACCCCAACT CGCTTGCCAG AACGCCCAAC CTCGACGAGC TGTTCAAACG CTCGCTGAAG TTCGAGCTGG CGATCCCGGA GGCGATGCCG ACCGGGCTCG TGCGGCGCTC CGTGCTGACG GGCATGCGCT CGTTCCCCAA CCGCGACTGG GTGCTCTCCC CGCCGATGCC GGCCGAGGTC GGCTGGACCC GCCCGCTGCC GCACCAGCCG CTGCTGACCG AGGAGCTCGG CAAGGCCGGC GTCGAGACCG CGTACGTGAC CGACAACCCG TTCATCGTCG GCCCGCGCTA CACCGACTTC CGGCGCACGC TCGACATCGG CCGGCCCGAC TTCTCCCAGG GCGCCTACCG CGCCTTCAAC ACGCCCTTCA GACGCCCCGC GCCGCGCAGC GCGATCGAGA AGTACCTGCT GCCCGCGCTG TCGGACACCG TCGAGGTGCG CCGCCTGCAG GACTACGTCG GCTGGAACAG CATGTACCGC GTCGGCGAGC GCAACTACTC CGCCGCGCGC GTCATGCGCG GCGGGATGGA CGCGCTCGAC GACCTCAAGG ACAAGCAGCC GTTCTTCCTC GGCGTCGACT CGTTCGACCC GCACGAGCCG TTCGACGCGC CGCCGCTCTA CGTGCGCGAG ATCGAGGGCC AGCCGAAGGG GATCCAGCGC GACAGAGGGA TCATGCCGAT CCAGCCGTTC CTGACGCCCG CCTCCAGACT CGACAAGATC GACGTCGACC CCGAGACGCT GCAGCTCATA CGCGAGCTGT ACGCGGCCGA GCTGACGTTC GTCGACGTCT GGATCGGCAA GCTGCTGAAC AAGCTCGACG ACCTCGGGCT CTCCGACAAC ACCGTCGTCT ACTTCCTCAG CGACCACGGC CTGACGCTCG GCGAGCACGG CATCATCGGC AAGTCGACGC CGCGGCCGTA CCGCGAGATC CACCACATCC CGTACCTGAT CCACGACCCG TCGGGCCGGA TGGCCGGCAA GACGAGCCGC TACTTCGCCT CCACGCACGA CGTCGCCCGC ACGGTCATGT CGTTCATGGG CGTCCGCGCG CCCGGCCTGA TGAACGGCGA GGACCTGACC GTCCTGTTCG ACGGCCGCGA GCCGCCGGCA CGGCCGTACT ACACCTCCTG CTACCAGGAG ACGTGGATCT GCGGCGACTA CGACTGGCTG CTGATCTCGA CGCCGGACGG CTCCAGAAGA GAGCTGTACG ACCTCGTCAA CGACCCCGGC AACACGAGAG ACGTCGCGAG CGATCCCGCC AACCAGGCGA CGATCGACAC GCTCTGGACG GTCCTGATCA ACGAGGCGGG CGGCACGCTG CCCGTCTTCC ACGACCACAT CGGCGTGGTG GGCGGCTGA
|
Protein sequence | MAARRFTRRG ALKVGAAGAA AAGLSACGNE YEETSGRADG APNVVLIITD STRADYIGAY NPNSLARTPN LDELFKRSLK FELAIPEAMP TGLVRRSVLT GMRSFPNRDW VLSPPMPAEV GWTRPLPHQP LLTEELGKAG VETAYVTDNP FIVGPRYTDF RRTLDIGRPD FSQGAYRAFN TPFRRPAPRS AIEKYLLPAL SDTVEVRRLQ DYVGWNSMYR VGERNYSAAR VMRGGMDALD DLKDKQPFFL GVDSFDPHEP FDAPPLYVRE IEGQPKGIQR DRGIMPIQPF LTPASRLDKI DVDPETLQLI RELYAAELTF VDVWIGKLLN KLDDLGLSDN TVVYFLSDHG LTLGEHGIIG KSTPRPYREI HHIPYLIHDP SGRMAGKTSR YFASTHDVAR TVMSFMGVRA PGLMNGEDLT VLFDGREPPA RPYYTSCYQE TWICGDYDWL LISTPDGSRR ELYDLVNDPG NTRDVASDPA NQATIDTLWT VLINEAGGTL PVFHDHIGVV GG
|
| |