Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcenmc03_3976 |
Symbol | |
ID | 6126809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia MC0-3 |
Kingdom | Bacteria |
Replicon accession | NC_010515 |
Strand | - |
Start bp | 908128 |
End bp | 910098 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641651084 |
Product | sulfatase |
Protein accession | YP_001777617 |
Protein GI | 170736357 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0677618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGT CCGCGTCGCC GTTGCATGCT TTTCGTTTCC GTGTCGTCTG CGCCGCGATT GCCGGCGCGG TGTCGCTCGC GTCGTGCGGC GGCGTCGACA GCGATGCGCC GCCGTCGCAG GCCGGCGTCA CGCCGACGCC TACCCCGACC CAGGCCGCGA AACGCCCGAA CATCCTGTAC ATCATGGCCG ACGATCTCGG CTATTCCGAC ATCCATGCGT TCGGCGGCGA GATCAACACG CCGAACCTCG ACGCGCTCGT CGCATCGGGC CGCATCCTGT CGAACCATCA CACGGGCACC GTCTGCGCGA TCACGCGCGC GATGCTGGTG TCCGGCACCG ATCACCATCT CGTCGGCGAA GGCACGATGG GCGTGCCGAC CGACGAGCGG CGCGGGCTGC CCGGCTACGA GGGCTACCTG AACGATCGCG CGCTGTCGTT CGCGCAACTG CTGAAGGACG CCGGCTATCA CACGTATATC GCGGGCAAGT GGCACATCGG CTCGGGCATC GTCGGCAGTA CGACGGGCAG CGGGCAGACG CCCGACCAGT GGGGCTTCGA GCGCAGTTAC GTGCTGCTCG GCGGCGCGGC GACGAACCAC TTCGCGCACG AGCCGGCCGG CTCGTCGAAC TACACGGAAG ACGGCCGCTA CGTGCAGCCG GGCCAGCCCG GGCAGCCGGG CGGCACGGGC GGCAGCCCGG CCGTGTTCTA TTCGACCGAT TTCTATACGC AGAAGCTGAT CTCGTACATC GATGCGAACA AGCAGGACGG CAAGCCGTTC TTCGCGTACG CGGCCTACAC GTCGCCGCAC TGGCCGCTGC AGGTGCCCGA TCCGTGGCTG CACAAATATG CGGGCGTGTA CGACGCCGGC TACGATGCGA TCCGCAACGC GCGGATCGCG CGGCAGAAGG CACTCGGCCT GATTCCGGCC GATTTCAAGC CGTTCGACGG GCTGCCCGAG ACGACGGTCG CCTCGCCCGC GACCGCGAAC AACGGCACGG CCAGCGCGAA GTACATCAAC GCGGTGCATT CGGCCGCCGA CGGCTACAGC GACTACGGCC CCGGCAAGGT CGACAAGCTG TGGTCGAGCC TGTCGCCGGC CGAACGCAAG GCGCAGGCGC GCTACATGGA GATCTACGCG GGGATGGTCG AGAACCTCGA CTACAACATC GGCCTGCTGA TCCAGCACCT GAAGGACATC GGCGAATACG ACAACACGTT CATCATGTTC CAGTCGGACA ACGGCGCGGA AGGCTGGCCG ATCGATTCCG GCGCCGACCC GACCGCCACC GACACCGCGA ACGCGCAGGA GCCGACCTAT TCGGCGCTCG GCACCGACAA CGGCAAGCAG AATGCGCAGC GCCTGCAGTA CGGGCTGCGC TGGGCCGAAG TGAGCGCGGC GCCGTTCCGG CTCACGAAGG GCTATTCGGG CGAAGGCGGC GTATCGACGC CGACGATCGT GCACCTGCCG GGCCAGTCGC AATCGTTGCC GACGCTGCGC GCGTTCACGC ACGTGACCGA CAACACGGCG ACGTTCCTCG CGGTCGCGGG TGTCACGCCG CCGTCGCAGC CGGCGCCGCC GCTCGTCAAC ACGCTGACGG GCGTCGACCA GAACAAGGGC AAGGTGGTCT ACAACAACCG CTACGTGTAT CCGGTCACGG GCCAGTCGCT GCTGCCGGTG CTCACCGGTT CGGCGACGGG CGAAGTGCAC ACGACGCCGT TCGGCGACGA AGCGTACGGC CGCGCGTATC TGCGCAGCGC CGACGGCCGC TGGAAGGCAT TGTGGACCGA GCCGCCGCTC GGCCCGCTCG ACGGTCACTG GCAGCTTTAC GACCTCGCGT CGGATCGCGG CGAGACGACC GACGTGTCCG CGCAGAACCC GTCGGTGATC GGCACGCTGG TCGACCAGTG GAAGACCTAC ATGGGCAACG TCGGCGGCGT CGAACCGCTG CGTCCGCGCG GCTACTACTG A
|
Protein sequence | MKKSASPLHA FRFRVVCAAI AGAVSLASCG GVDSDAPPSQ AGVTPTPTPT QAAKRPNILY IMADDLGYSD IHAFGGEINT PNLDALVASG RILSNHHTGT VCAITRAMLV SGTDHHLVGE GTMGVPTDER RGLPGYEGYL NDRALSFAQL LKDAGYHTYI AGKWHIGSGI VGSTTGSGQT PDQWGFERSY VLLGGAATNH FAHEPAGSSN YTEDGRYVQP GQPGQPGGTG GSPAVFYSTD FYTQKLISYI DANKQDGKPF FAYAAYTSPH WPLQVPDPWL HKYAGVYDAG YDAIRNARIA RQKALGLIPA DFKPFDGLPE TTVASPATAN NGTASAKYIN AVHSAADGYS DYGPGKVDKL WSSLSPAERK AQARYMEIYA GMVENLDYNI GLLIQHLKDI GEYDNTFIMF QSDNGAEGWP IDSGADPTAT DTANAQEPTY SALGTDNGKQ NAQRLQYGLR WAEVSAAPFR LTKGYSGEGG VSTPTIVHLP GQSQSLPTLR AFTHVTDNTA TFLAVAGVTP PSQPAPPLVN TLTGVDQNKG KVVYNNRYVY PVTGQSLLPV LTGSATGEVH TTPFGDEAYG RAYLRSADGR WKALWTEPPL GPLDGHWQLY DLASDRGETT DVSAQNPSVI GTLVDQWKTY MGNVGGVEPL RPRGYY
|
| |