Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1305 |
Symbol | |
ID | 8731744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 1378083 |
End bp | 1379558 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646501923 |
Product | sulfatase |
Protein accession | YP_003393109 |
Protein GI | 284042769 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCAGA GGATCGACCG CCGCACGTTG CTCGGGGCGG CCGGCGCGGG CGTCGCCGGC GGCGCGCTGC TGGCACATGT CCCGCCCGCC CGCGCCCGCG CGGCCGCGGC GGCGGCCGGT CCCAACGTGC TCGTGCTCGT GCTCGACTCG CTCCGCTACG ACCACGTCGG AGCGAACGGG AACGCGTGGA TCAGAACGCC CAACATCGAC GCGCTCGCAC GCGAGAGCGT GCGCTTCACG CGCGCCTTCC CGGAGGCGAT GCCGACAGTG CCGGCGCGCC GTTCGCTCCT GACGAGCCGC CGCGTCTACC CGTGGACGAC GTGGAGGCCG ACGCGCAACC TGCCCGACAG TCCCGGCTGG ACCGGGCTCG GGTGGAGCGA GCAGACGTGG CTCAGAGCGC TGAGAGCGCA CGGCTACTGG ACCGGCTACG TGACCGACAA CCCGTTCCTG GCGTTCGCCT CGGCGTGGAA GCCGCTGCGC AGAGGCGTCG ACCGCTTCAT GCGGATCGGC GGTCAGGTCG GCGCGCTGCG GCCGGCGTCG ACCGTCTCGC TCGCGTCCGC CCGCCACTGG CTTCCGCGCG ACATGCAGAC GGACGGCTAC GTGAGCGGGA TGCGCCAGCA CCTCGCGAAC CTCGGCGGCG CGCGCGACGA GCGCGAGCAG TGCTGCGCGC GCGTCTTCTC GCACGCGCTC GACGTGCTCG GCAGCGCCCA GCGCTCGAAG AAGCCGTTCG CGCTCGTCGT CGACTGCTTC GACCCGCACG AGCCGTGGGC GCCGGTGCGC AAGTACCGCG ACATGTACGG CGACGACGGC TACAGAGCGA ACGAGCCGGG CAACGTCCGC TACCGGCCGG CGAGATACCT GACCGGCGAC GAGCTGCGCC GGCTGCCGCA GCTGTACGCG GCGGCAGTCA CGCAGACCGA CGCCTGGCTC GGCCACTTCC TCGGCCGCTT CTACGACGCC GGCCTCGCCG ACAGCACCGC GATCGTCCTG CTCAGCGATC ACGGGATCGT GCTCGGCGAC CGCGGCTGGA CGGGCAAGCC GGCGCTGCAG CTGCACCCGG AGCTGATCCA GGTGCCGTTC CTGCTGCGCG CGCCCGACCG CCGCGGCGCC GGCACGACGA GCGGCTACTT CGCCTCGCCG CACGACGTCG GCCCGACGCT GCTGGCGATG ACCGGTGTGC CGCGGCCGGA GCGGATGGAC GGGGCCGACC TCTCGCCGCT GCTGAGCGGT GGGGCGCCCG CGACCCCGCG GCCGTTCTGG TTCGGCGGCT ACGCCAACCA CTGGTACTAC CGCGACGACC GCTGGGCGCT GATCGCGGAC GGCAACAACA GAGGCCGCGG CCTGTACGAC CTGCGCAAGG ACCCCGGCGA GCGGGACAAC GTCGCGTCCG AGCACCTGCG TCTGCTCGCG CAGATCCACG GCACGGTGCT GCGCGCTGCG GGGTTGAAGC GGTTGCCCTA TTTTGGAGCG AGATGA
|
Protein sequence | MEQRIDRRTL LGAAGAGVAG GALLAHVPPA RARAAAAAAG PNVLVLVLDS LRYDHVGANG NAWIRTPNID ALARESVRFT RAFPEAMPTV PARRSLLTSR RVYPWTTWRP TRNLPDSPGW TGLGWSEQTW LRALRAHGYW TGYVTDNPFL AFASAWKPLR RGVDRFMRIG GQVGALRPAS TVSLASARHW LPRDMQTDGY VSGMRQHLAN LGGARDEREQ CCARVFSHAL DVLGSAQRSK KPFALVVDCF DPHEPWAPVR KYRDMYGDDG YRANEPGNVR YRPARYLTGD ELRRLPQLYA AAVTQTDAWL GHFLGRFYDA GLADSTAIVL LSDHGIVLGD RGWTGKPALQ LHPELIQVPF LLRAPDRRGA GTTSGYFASP HDVGPTLLAM TGVPRPERMD GADLSPLLSG GAPATPRPFW FGGYANHWYY RDDRWALIAD GNNRGRGLYD LRKDPGERDN VASEHLRLLA QIHGTVLRAA GLKRLPYFGA R
|
| |