Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen2424_3543 |
Symbol | |
ID | 4452812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia HI2424 |
Kingdom | Bacteria |
Replicon accession | NC_008543 |
Strand | + |
Start bp | 383275 |
End bp | 385245 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639695602 |
Product | sulfatase |
Protein accession | YP_837175 |
Protein GI | 116691642 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGT CCGCGTCGCC GTTGCATGCT TTTCGTTTCC GTGTCGTCTG CGCCGCGATT GCCGGCGCGG TGTCGCTCGC GTCGTGCGGC GGCGTCGACA GCGACGCGCC GCCGTCGCAG GCCGGCGTCA CGCCGACGCC CACCCCGACC CAGGCCGCGA AACGCCCGAA CATCCTGTAC ATCATGGCCG ACGATCTCGG CTATTCCGAC ATCCATGCGT TCGGCGGCGA GATCAACACG CCGAACCTCG ACGCGCTCGT CGCATCGGGC CGCATCCTGT CGAACCATCA CACGGGCACC GTCTGCGCGA TCACGCGCGC GATGCTGGTG TCCGGCACCG ATCACCATCT CGTCGGCGAA GGCACGATGG GCGTGCCGAC CGACGAGCGG CGCGGGCTGC CCGGCTACGA GGGCTACCTG AACGATCGCG CGCTGTCGTT CGCGCAACTG CTGAAGGACG CCGGCTATCA CACGTATATC GCGGGCAAGT GGCACATCGG CTCGGGCATC GTCGGCAGTG CGACGGGCAG CGGGCAGACG CCCGACCAGT GGGGCTTCGA GCGCAGCTAC GTGCTGCTCG GCGGCGCGGC GACGAACCAC TTCGCGCACG AGCCGGCCGG CTCGTCGAAC TACACGGAAG ACGGCCGCTA CGTGCAGCCG GGCCAGCCCG GGCAGCCGGG CGGCACGGGC GGCAGCCCGG CCGTGTTCTA TTCGACCGAT TTCTATACGC AGAAGCTGAT CTCGTACATC GATGCGAACA AGCAGGACGG CAAGCCGTTC TTCGCGTACG CGGCCTACAC GTCGCCGCAC TGGCCGCTGC AGGTGCCCGA TCCGTGGCTG CACAAGTACG CGGGCGTGTA CGACGCCGGC TACGATGCGA TCCGCAACGC GCGGATCGCG CGGCAGAAGG CGCTCGGCCT GATTCCGGCC GATTTCAAGC CGTTCGACGG GCTGCCCGAG ACGACGGTCG CCTCGCCCGC GACCGCGAAC AACGGCACGG CCAGCGCGAA GTACATCAAC GCGGTGCATT CGGCCGCCGA CGGCTACAGC GACTACGGCC CCGGCAAGGT CGACAAGCTG TGGTCGAGCC TGTCGCCGGC CGAGCGCAAG GCGCAGGCGC GCTACATGGA GATCTACGCG GGGATGGTCG AGAACCTCGA CTACAACATC GGCCTGCTGA TCCAGCACCT GAAGGACATC GGCGAATACG ACAACACGTT CATCATGTTC CAGTCGGACA ACGGCGCGGA AGGCTGGCCG ATCGATTCCG GCGCCGACCC GACCGCGACC GACACCGCGA ACGCGCAGGA GCCGACCTAT TCGGCGCTCG GCACCGACAA CGGCAAGCAG AATGCGCAGC GCCTGCAGTA CGGGCTGCGC TGGGCCGAAG TGAGCGCGGC GCCGTTCCGG CTCACGAAGG GCTATTCGGG CGAAGGCGGC GTATCGACGC CGACGATCGT GCACCTGCCG GGCCAGTCGC AATCGTTGCC GACGCTGCGC GCGTTCACGC ACGTGACCGA CAACACGGCG ACGTTCCTCG CGGTCGCGGG CGTCACGCCG CCGTCGCAGC CGGCGCCGCC GCTGGTGAAC ACGCTGACGG GCGTCGATCA GAACAAGGGC AAGGTGATCT ACAACAACCG CTATGTGTAT CCGGTCACGG GCCAGTCGCT GCTGCCGGTG CTCACCGGTT CGGCGACGGG CGAAGTGCAC ACGACGCCGT TCGGCGACGA AGCGTACGGC CGCGCGTATC TGCGCAGCGC CGACGGCCGC TGGAAGGCAT TGTGGACCGA GCCGCCGCTC GGCCCGCTCG ACGGTCACTG GCAGCTGTAC GACCTCGCGT CGGATCGCGG CGAGACGACC GACGTGTCCG CGCAGAACCC GTCGGTGATC GGCACGCTGG TCGACCAGTG GAAGACCTAC ATGGGCAACG TCGGCGGCGT CGAACCGCTG CGTCCGCGCG GCTACTACTG A
|
Protein sequence | MKKSASPLHA FRFRVVCAAI AGAVSLASCG GVDSDAPPSQ AGVTPTPTPT QAAKRPNILY IMADDLGYSD IHAFGGEINT PNLDALVASG RILSNHHTGT VCAITRAMLV SGTDHHLVGE GTMGVPTDER RGLPGYEGYL NDRALSFAQL LKDAGYHTYI AGKWHIGSGI VGSATGSGQT PDQWGFERSY VLLGGAATNH FAHEPAGSSN YTEDGRYVQP GQPGQPGGTG GSPAVFYSTD FYTQKLISYI DANKQDGKPF FAYAAYTSPH WPLQVPDPWL HKYAGVYDAG YDAIRNARIA RQKALGLIPA DFKPFDGLPE TTVASPATAN NGTASAKYIN AVHSAADGYS DYGPGKVDKL WSSLSPAERK AQARYMEIYA GMVENLDYNI GLLIQHLKDI GEYDNTFIMF QSDNGAEGWP IDSGADPTAT DTANAQEPTY SALGTDNGKQ NAQRLQYGLR WAEVSAAPFR LTKGYSGEGG VSTPTIVHLP GQSQSLPTLR AFTHVTDNTA TFLAVAGVTP PSQPAPPLVN TLTGVDQNKG KVIYNNRYVY PVTGQSLLPV LTGSATGEVH TTPFGDEAYG RAYLRSADGR WKALWTEPPL GPLDGHWQLY DLASDRGETT DVSAQNPSVI GTLVDQWKTY MGNVGGVEPL RPRGYY
|
| |