Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcenmc03_5213 |
Symbol | |
ID | 6128024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia MC0-3 |
Kingdom | Bacteria |
Replicon accession | NC_010515 |
Strand | + |
Start bp | 2311767 |
End bp | 2313317 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641652304 |
Product | choline-sulfatase |
Protein accession | YP_001778832 |
Protein GI | 170737572 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACC CGACACCCAA CATCCTGATC CTGATGGCCG ACCAGCTCAC GCCGTTTGCG CTGCCGGCGT ACGGCAACCG CGTCGCCCGT ACGCCGACGC TCGACCGGCT CGCCGCCGAA GGCGTGGTGT TCGACGCCGC GTACTGCGCG AGCCCTTTGT GCGCGCCGTC GCGCTTCTCG CTGCTGACCG GCAAGCTGCC GTCGGGAATC GGCGCCTACG ATAACGCCGC CGAATTGCCG GCGCAAACGC TGACGTTCGC GCACTACCTG CGCGCGGGCG GCTACAGGAC GATGCTGTCC GGCAAGATGC ATTTCTGCGG ACCCGACCAG TTGCACGGCT TCGAGGAGCG CCTCACGACC GACATCTATC CGGCCGATTT CGGCTGGGTG CCGGACTGGG ACCAGCCGAC CGAGCGGCCG AGCTGGTATC ACAACATGAG CTCGGTGCTC GATGCCGGCC CGTGCGTGCG CACGAACCAG CTCGACTTCG ACGACGAAGT GACGTTCGCC GCGAAGCAGA AGCTGTACGA CGTCGCGCGC GAGCGCGCGG CCGGGCACGA TGCGCGGCCG TTCTGCATGG TCGTATCGCT GACCCATCCG CACGACCCGT ATGCGATCAC GCGCGAATAC TGGGATCTGT ACCGCGACGA AGACATCGAC ATGCCGGCCG TGCGGCTCGA TGCGGCCGAA AGCGATCCGC ATTCGCAGCG GCTGCGCTTC GTCTGCGAGA ACGACCGCAC GCCGCCGACC GATGCGCAGA TCCGCGCGGC CCGCCGCGCG TATTACGGTG CGACGTCCTA CGTCGACACG CAGTTCGGCA GCGTGCTGGC CGCGCTCGAG CAATGCGGGT TCGCCGACGA CACGATCGTG ATCGTCACGT CCGACCACGG CGACATGCTC GGCGAACGCG GGCTCTGGTA CAAGATGACG TTCTTCGAAG GCGGCTGCCG CGTGCCGCTG ATCGTGCATG CGCCGGGCCG CTTCGGCGCT GCGCGGGTGC GCGGGCCCGT GTCGCACGTC GACCTGCTGC CGACGCTCGT CGAGCTGGCC GGCGCGGCGC CCGCCGGCGG CTGGCCGGAC GCCGGATGGC CGGACCCGGT CGACGGCACG AGCCTCGTGC CGCACCTGCA CGGCACGCCC GCGCACGATG TCGCGCTCGG CGAATACCTC GCGGAAGGCG CGCTCGCGCC GGTCGTGATG ATCCGCCGCG GCGACTGGAA ATACGTGCAT TGCCCGGCCG ATCCCGATCA GCTCTACCAC CTGTCGGACG ACCCGCGCGA GCTGACGAAC CTGGCCGGGC AGCCGGAAGC CGCCGACGTG CTCGCCGCGT TTCGCGCGGA GGCCGCGCAG CGCTGGAACC TGCCCGAACT GGACCGGCAG GTGCGCGCGA GCCAGCGGCG CCGGCGCTTC CATTACGCGG CGACGACGCA GGGCCGCATC CACGCGTGGG ACTGGCAGCC GTTCACCGAC GCGAGCCAGC GCTACATGCG CAATCACATC GAACTCGACG CGCTCGAGGC GATGGCGCGT TTTCCGCGCG TCGGGCGCTG A
|
Protein sequence | MTNPTPNILI LMADQLTPFA LPAYGNRVAR TPTLDRLAAE GVVFDAAYCA SPLCAPSRFS LLTGKLPSGI GAYDNAAELP AQTLTFAHYL RAGGYRTMLS GKMHFCGPDQ LHGFEERLTT DIYPADFGWV PDWDQPTERP SWYHNMSSVL DAGPCVRTNQ LDFDDEVTFA AKQKLYDVAR ERAAGHDARP FCMVVSLTHP HDPYAITREY WDLYRDEDID MPAVRLDAAE SDPHSQRLRF VCENDRTPPT DAQIRAARRA YYGATSYVDT QFGSVLAALE QCGFADDTIV IVTSDHGDML GERGLWYKMT FFEGGCRVPL IVHAPGRFGA ARVRGPVSHV DLLPTLVELA GAAPAGGWPD AGWPDPVDGT SLVPHLHGTP AHDVALGEYL AEGALAPVVM IRRGDWKYVH CPADPDQLYH LSDDPRELTN LAGQPEAADV LAAFRAEAAQ RWNLPELDRQ VRASQRRRRF HYAATTQGRI HAWDWQPFTD ASQRYMRNHI ELDALEAMAR FPRVGR
|
| |