Gene Bcen2424_5650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen2424_5650 
Symbol 
ID4452131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia HI2424 
KingdomBacteria 
Replicon accessionNC_008543 
Strand
Start bp2764093 
End bp2765904 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content68% 
IMG OID639697711 
Productsulfatase 
Protein accessionYP_839276 
Protein GI116693743 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.372268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.201991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCCC CCCTGACCCG GCGATTCCGC CGGCATATGC CGTCACGCGT CATCGCGTGC 
GCGATGGCCT GGCTGCTGAT GCTGTGCGCC GGCGCCGCGC ATGCCGGTGC ATCGCGACCG
AACATCGTGT GGATCACCGT CGAGGACATC ACGACCTTCA TCGGCGGATA CGGCGACCCG
CAGGTCAAGA CGCCGAACAT CGACCGGCTG GCACGCGAAG GCGTGCTGTA CACGCATGCG
TACCAGGTGT CGGGCGTGTG CGCGCCGTCG CGTTCCGCGC TGATCACCGG CGTGTACCCG
ACCTCGGTGG GCGCGCAGCA TCACCGGACC GGGCCGGGCG AGATCTCGGT TCCCGGCGTG
ACGGCGAAGG ACAAGCCGAA CGGCGTGCCG GCGACCTATT CGGTGGTGCT GCCGCCCGAC
GTGAAGGCGT TCCCCGAGCT GCTGCGCAAG GCCGGCTACT ACACGTCGAA CAACCAGAAG
ACCGACTACC AGTTCGTGCC GCCGGTGACG GTGTGGGACG AGAACGGCCC CGCCGCGTCC
TACCGCTATC GCCCGAAGGA CAAGCCGTTT TTCGCGGTGT TCAACTTCTT CGTCACGCAC
GAGTCGATGA TCACCTATCG CAAGGACCCG CTGCGCGCCG ATCCCGCGTC GATCACGGTG
CCGCCGATCT ATCCGGACAC GCCGGCGGTG CGCGGCGACA TCGCGCGCAT GTACACCAAC
ATCGAGACGA TGGACCGGCA GGTCGGCGAG CTGATCGAGA TGCTCAAGCG CGACGGCGTG
TACGACAACA CGATCATCTT CTTCTTCGCG GACAACGGCG GCACGCTGCC GTGGATGAAG
CGCGAGGTGC TCGAGCGCGG CACGCGCGTA CCGCTGATCA TTCGCTTCCC CGGCGCGCCG
CGAGGCGGGT CCACCGATGC GCAGCTCGTG AGCGGCGTCG ATCTCGCGCC GACCGTGCTG
TCGCTGGCCG GCGTGCCGAT TCCGTCGTAC ATGCAGGGGC AGGCGTTCCT CGGGCCGGCG
CGCGCATCGG CGCCGCGCCG CTACGTGTTT GCCGCGCGTG ACCGGATGGA CAACGAATAC
GACCGCGTGC GGATGGTGCG CGATCAACGC TTCCGCTATC TGTACAACTA CATGCCGGAG
AAGCCGTACT ACCAGCCGAT CCGGTTCCGC GAAAGCATGC CGATGATGCG CGACATCCTG
CGGCTGAAGG ATGAAGGCAA GCTGCCGCCG GCCACCGCGG CGTGGTTCGG CACGAAGCCG
GTCGAGGAGC TGTACGACGC CGATCGGGAC CCGTGCGAGC TGCACAACCT CGCGGACGAT
CCGCGCTATC GCGCCAAGCT CGACGAACTG CGTGCCGCCT TCCACGCGTG GACCGATCGT
TACGGCGACA TGGGTGGCAT ACCGGAACCC GAAATGATCT CGCGGATGTG GCTCGGCGGC
TCGGCGCCCC CCGCCACGGC GACCCCCGAG ATCCGGCCGG CGCCGGGCGG CGTGACGATC
GCATGCGCGA CCCAGGGCGC GTCGATCGGC TACTGGGTCG AACGTCGCGA CGACCCGGCG
CCGCGCCTCT CGCACACCGT GCTCAGCTGG GACTTCGAAC GGCTCGCCGG CGAAATGCTG
CCGCCGAAGC TCGGTGCGCG CTTCGCCCAT CTCGGCGATC AGCGGCCCGT GTCGCCGGCC
TGGTCCGTGT ACGACGCGGG GCGTGTGATT CCGTTGCGCC CCGGCGACGT GCTGCACGTC
AACGCGATGC GGATCGGCTA TACGGCCGCG ACGCTCGACT ACCCGTTCCC GCAGACGGAA
GCGCGCCGCT AG
 
Protein sequence
MNSPLTRRFR RHMPSRVIAC AMAWLLMLCA GAAHAGASRP NIVWITVEDI TTFIGGYGDP 
QVKTPNIDRL AREGVLYTHA YQVSGVCAPS RSALITGVYP TSVGAQHHRT GPGEISVPGV
TAKDKPNGVP ATYSVVLPPD VKAFPELLRK AGYYTSNNQK TDYQFVPPVT VWDENGPAAS
YRYRPKDKPF FAVFNFFVTH ESMITYRKDP LRADPASITV PPIYPDTPAV RGDIARMYTN
IETMDRQVGE LIEMLKRDGV YDNTIIFFFA DNGGTLPWMK REVLERGTRV PLIIRFPGAP
RGGSTDAQLV SGVDLAPTVL SLAGVPIPSY MQGQAFLGPA RASAPRRYVF AARDRMDNEY
DRVRMVRDQR FRYLYNYMPE KPYYQPIRFR ESMPMMRDIL RLKDEGKLPP ATAAWFGTKP
VEELYDADRD PCELHNLADD PRYRAKLDEL RAAFHAWTDR YGDMGGIPEP EMISRMWLGG
SAPPATATPE IRPAPGGVTI ACATQGASIG YWVERRDDPA PRLSHTVLSW DFERLAGEML
PPKLGARFAH LGDQRPVSPA WSVYDAGRVI PLRPGDVLHV NAMRIGYTAA TLDYPFPQTE
ARR