Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen2424_5650 |
Symbol | |
ID | 4452131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia HI2424 |
Kingdom | Bacteria |
Replicon accession | NC_008543 |
Strand | - |
Start bp | 2764093 |
End bp | 2765904 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639697711 |
Product | sulfatase |
Protein accession | YP_839276 |
Protein GI | 116693743 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.372268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.201991 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTCCC CCCTGACCCG GCGATTCCGC CGGCATATGC CGTCACGCGT CATCGCGTGC GCGATGGCCT GGCTGCTGAT GCTGTGCGCC GGCGCCGCGC ATGCCGGTGC ATCGCGACCG AACATCGTGT GGATCACCGT CGAGGACATC ACGACCTTCA TCGGCGGATA CGGCGACCCG CAGGTCAAGA CGCCGAACAT CGACCGGCTG GCACGCGAAG GCGTGCTGTA CACGCATGCG TACCAGGTGT CGGGCGTGTG CGCGCCGTCG CGTTCCGCGC TGATCACCGG CGTGTACCCG ACCTCGGTGG GCGCGCAGCA TCACCGGACC GGGCCGGGCG AGATCTCGGT TCCCGGCGTG ACGGCGAAGG ACAAGCCGAA CGGCGTGCCG GCGACCTATT CGGTGGTGCT GCCGCCCGAC GTGAAGGCGT TCCCCGAGCT GCTGCGCAAG GCCGGCTACT ACACGTCGAA CAACCAGAAG ACCGACTACC AGTTCGTGCC GCCGGTGACG GTGTGGGACG AGAACGGCCC CGCCGCGTCC TACCGCTATC GCCCGAAGGA CAAGCCGTTT TTCGCGGTGT TCAACTTCTT CGTCACGCAC GAGTCGATGA TCACCTATCG CAAGGACCCG CTGCGCGCCG ATCCCGCGTC GATCACGGTG CCGCCGATCT ATCCGGACAC GCCGGCGGTG CGCGGCGACA TCGCGCGCAT GTACACCAAC ATCGAGACGA TGGACCGGCA GGTCGGCGAG CTGATCGAGA TGCTCAAGCG CGACGGCGTG TACGACAACA CGATCATCTT CTTCTTCGCG GACAACGGCG GCACGCTGCC GTGGATGAAG CGCGAGGTGC TCGAGCGCGG CACGCGCGTA CCGCTGATCA TTCGCTTCCC CGGCGCGCCG CGAGGCGGGT CCACCGATGC GCAGCTCGTG AGCGGCGTCG ATCTCGCGCC GACCGTGCTG TCGCTGGCCG GCGTGCCGAT TCCGTCGTAC ATGCAGGGGC AGGCGTTCCT CGGGCCGGCG CGCGCATCGG CGCCGCGCCG CTACGTGTTT GCCGCGCGTG ACCGGATGGA CAACGAATAC GACCGCGTGC GGATGGTGCG CGATCAACGC TTCCGCTATC TGTACAACTA CATGCCGGAG AAGCCGTACT ACCAGCCGAT CCGGTTCCGC GAAAGCATGC CGATGATGCG CGACATCCTG CGGCTGAAGG ATGAAGGCAA GCTGCCGCCG GCCACCGCGG CGTGGTTCGG CACGAAGCCG GTCGAGGAGC TGTACGACGC CGATCGGGAC CCGTGCGAGC TGCACAACCT CGCGGACGAT CCGCGCTATC GCGCCAAGCT CGACGAACTG CGTGCCGCCT TCCACGCGTG GACCGATCGT TACGGCGACA TGGGTGGCAT ACCGGAACCC GAAATGATCT CGCGGATGTG GCTCGGCGGC TCGGCGCCCC CCGCCACGGC GACCCCCGAG ATCCGGCCGG CGCCGGGCGG CGTGACGATC GCATGCGCGA CCCAGGGCGC GTCGATCGGC TACTGGGTCG AACGTCGCGA CGACCCGGCG CCGCGCCTCT CGCACACCGT GCTCAGCTGG GACTTCGAAC GGCTCGCCGG CGAAATGCTG CCGCCGAAGC TCGGTGCGCG CTTCGCCCAT CTCGGCGATC AGCGGCCCGT GTCGCCGGCC TGGTCCGTGT ACGACGCGGG GCGTGTGATT CCGTTGCGCC CCGGCGACGT GCTGCACGTC AACGCGATGC GGATCGGCTA TACGGCCGCG ACGCTCGACT ACCCGTTCCC GCAGACGGAA GCGCGCCGCT AG
|
Protein sequence | MNSPLTRRFR RHMPSRVIAC AMAWLLMLCA GAAHAGASRP NIVWITVEDI TTFIGGYGDP QVKTPNIDRL AREGVLYTHA YQVSGVCAPS RSALITGVYP TSVGAQHHRT GPGEISVPGV TAKDKPNGVP ATYSVVLPPD VKAFPELLRK AGYYTSNNQK TDYQFVPPVT VWDENGPAAS YRYRPKDKPF FAVFNFFVTH ESMITYRKDP LRADPASITV PPIYPDTPAV RGDIARMYTN IETMDRQVGE LIEMLKRDGV YDNTIIFFFA DNGGTLPWMK REVLERGTRV PLIIRFPGAP RGGSTDAQLV SGVDLAPTVL SLAGVPIPSY MQGQAFLGPA RASAPRRYVF AARDRMDNEY DRVRMVRDQR FRYLYNYMPE KPYYQPIRFR ESMPMMRDIL RLKDEGKLPP ATAAWFGTKP VEELYDADRD PCELHNLADD PRYRAKLDEL RAAFHAWTDR YGDMGGIPEP EMISRMWLGG SAPPATATPE IRPAPGGVTI ACATQGASIG YWVERRDDPA PRLSHTVLSW DFERLAGEML PPKLGARFAH LGDQRPVSPA WSVYDAGRVI PLRPGDVLHV NAMRIGYTAA TLDYPFPQTE ARR
|
| |