Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BamMC406_5009 |
Symbol | |
ID | 6179725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria MC40-6 |
Kingdom | Bacteria |
Replicon accession | NC_010552 |
Strand | - |
Start bp | 2166353 |
End bp | 2167903 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641684761 |
Product | choline-sulfatase |
Protein accession | YP_001811671 |
Protein GI | 172064020 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACC CGACTCCCAA CATCCTGATC CTGATGGCCG ACCAGCTGAC GCCGTTCGCG CTGCCGGCCT ACGGCAACCG CGTCGCGCGC ACGCCGACGC TCGATCGTCT CGCCGCTCAA GGCGTGGTGT TCGACGCCGC GTATTGCGCG AGCCCGCTGT GCGCGCCGTC GCGCTTCTCG CTGCTGACCG GCAAGCTGCC GTCGGGTATC GGCGCCTACG ATAACGCCGC CGAATTGCCG GCGCAAACGC TGACGTTCGC GCATTACCTG CGCGCGGGCG GCTACCGCAC GATGCTGTCC GGCAAGATGC ATTTCTGCGG ACCCGACCAA CTGCACGGTT TCGAGGAGCG GCTCACGACC GACATCTATC CGGCCGACTT CGGCTGGGTG CCCGACTGGG ATCACCCGAC CGAGCGGCCG AGCTGGTATC ACAACATGAG CTCGGTGCTC GAGGCCGGCC CGTGCGTACG CACGAACCAG CTCGACTTCG ACGACGAGGT GACGTTCGCG GCCAGGCAGA AGCTGTACGA CGTCGCGCGC GAGCGCGCGG CCGGGCACGA CGCGCGGCCG TTCTGCATGG TCGTGTCGCT GACCCATCCG CACGATCCGT ATGCGATCAC GCGCGAATAC TGGGATCTGT ACAGCGACGA CGAGATCGAC ATGCCGGCCT TGCGGCTCGA TGCCGAGCAG AGCGATCCGC ATTCGCAGCG GCTGCGCTTC GTCTGCGAAA ACGACCGCAC GCCGCCGACC GACGCGCAGA TCCGCGCCGC GCGCCGCGCG TACTACGGCG CGACGTCGTA CGTCGACGCG CAGTTCGGCA GCGTGCTGGC CGCGCTCGAG CAATGCGGGT TCGCCGACGA CACGATCGTG ATCGTCACGT CCGACCACGG CGACATGCTC GGCGAGCGCG GGCTCTGGTA CAAGATGACG TTCTTCGAAG GCGGCTGCCG CGTGCCGCTG ATCGTCCACG CGCCGGGCCG CTTCGACGCC GCGCGCGTGC GCGGGCCCGT GTCGCACCTC GACCTGCTGC CGACGCTCGT CGACCTGACC GGTGCGGCGC CCGCCGGCGG CTGGCCGGAC CCCGGATGGC CGGACCCGGT CGACGGCGCG AGCCTCGTGC CGCATCTGCG AGGCGCACCA GCGCACGACG TCGCGCTGGG CGAATACCTC GCGGAAGGGG CGGTCGCGCC GGTCGTGATG ATCCGCCGCG GCGACTGGAA GTACGTGCAT TGCCCGGCCG ATCCCGACCA GCTCTACCAT CTCGCCGACG ATCCGCGCGA GCGCACGAAC CTGGCCGGCC TGCCCGAAGC CGCCGACGTG CTCGCTGCCT TCCGTGCGGA GGCCGCGCAG CGCTGGAACC TGCCGGAACT CGACGCGCAG GTGCGCGCGA GCCAGCGTCG CCGCCGCTTC CATTACGCGG CGACGACGCA GGGCCGCATC CAGGCGTGGG ACTGGCAGCC GTTCACCGAT GCGAGCCAGC GCTACATGCG CAATCACATC GAACTCGATA CGCTCGAAGC GATGGCGCGC TTTCCGCGCG TCGGGCGCTG A
|
Protein sequence | MTNPTPNILI LMADQLTPFA LPAYGNRVAR TPTLDRLAAQ GVVFDAAYCA SPLCAPSRFS LLTGKLPSGI GAYDNAAELP AQTLTFAHYL RAGGYRTMLS GKMHFCGPDQ LHGFEERLTT DIYPADFGWV PDWDHPTERP SWYHNMSSVL EAGPCVRTNQ LDFDDEVTFA ARQKLYDVAR ERAAGHDARP FCMVVSLTHP HDPYAITREY WDLYSDDEID MPALRLDAEQ SDPHSQRLRF VCENDRTPPT DAQIRAARRA YYGATSYVDA QFGSVLAALE QCGFADDTIV IVTSDHGDML GERGLWYKMT FFEGGCRVPL IVHAPGRFDA ARVRGPVSHL DLLPTLVDLT GAAPAGGWPD PGWPDPVDGA SLVPHLRGAP AHDVALGEYL AEGAVAPVVM IRRGDWKYVH CPADPDQLYH LADDPRERTN LAGLPEAADV LAAFRAEAAQ RWNLPELDAQ VRASQRRRRF HYAATTQGRI QAWDWQPFTD ASQRYMRNHI ELDTLEAMAR FPRVGR
|
| |