Gene Bcen2424_3543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen2424_3543 
Symbol 
ID4452812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia HI2424 
KingdomBacteria 
Replicon accessionNC_008543 
Strand
Start bp383275 
End bp385245 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content68% 
IMG OID639695602 
Productsulfatase 
Protein accessionYP_837175 
Protein GI116691642 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT CCGCGTCGCC GTTGCATGCT TTTCGTTTCC GTGTCGTCTG CGCCGCGATT 
GCCGGCGCGG TGTCGCTCGC GTCGTGCGGC GGCGTCGACA GCGACGCGCC GCCGTCGCAG
GCCGGCGTCA CGCCGACGCC CACCCCGACC CAGGCCGCGA AACGCCCGAA CATCCTGTAC
ATCATGGCCG ACGATCTCGG CTATTCCGAC ATCCATGCGT TCGGCGGCGA GATCAACACG
CCGAACCTCG ACGCGCTCGT CGCATCGGGC CGCATCCTGT CGAACCATCA CACGGGCACC
GTCTGCGCGA TCACGCGCGC GATGCTGGTG TCCGGCACCG ATCACCATCT CGTCGGCGAA
GGCACGATGG GCGTGCCGAC CGACGAGCGG CGCGGGCTGC CCGGCTACGA GGGCTACCTG
AACGATCGCG CGCTGTCGTT CGCGCAACTG CTGAAGGACG CCGGCTATCA CACGTATATC
GCGGGCAAGT GGCACATCGG CTCGGGCATC GTCGGCAGTG CGACGGGCAG CGGGCAGACG
CCCGACCAGT GGGGCTTCGA GCGCAGCTAC GTGCTGCTCG GCGGCGCGGC GACGAACCAC
TTCGCGCACG AGCCGGCCGG CTCGTCGAAC TACACGGAAG ACGGCCGCTA CGTGCAGCCG
GGCCAGCCCG GGCAGCCGGG CGGCACGGGC GGCAGCCCGG CCGTGTTCTA TTCGACCGAT
TTCTATACGC AGAAGCTGAT CTCGTACATC GATGCGAACA AGCAGGACGG CAAGCCGTTC
TTCGCGTACG CGGCCTACAC GTCGCCGCAC TGGCCGCTGC AGGTGCCCGA TCCGTGGCTG
CACAAGTACG CGGGCGTGTA CGACGCCGGC TACGATGCGA TCCGCAACGC GCGGATCGCG
CGGCAGAAGG CGCTCGGCCT GATTCCGGCC GATTTCAAGC CGTTCGACGG GCTGCCCGAG
ACGACGGTCG CCTCGCCCGC GACCGCGAAC AACGGCACGG CCAGCGCGAA GTACATCAAC
GCGGTGCATT CGGCCGCCGA CGGCTACAGC GACTACGGCC CCGGCAAGGT CGACAAGCTG
TGGTCGAGCC TGTCGCCGGC CGAGCGCAAG GCGCAGGCGC GCTACATGGA GATCTACGCG
GGGATGGTCG AGAACCTCGA CTACAACATC GGCCTGCTGA TCCAGCACCT GAAGGACATC
GGCGAATACG ACAACACGTT CATCATGTTC CAGTCGGACA ACGGCGCGGA AGGCTGGCCG
ATCGATTCCG GCGCCGACCC GACCGCGACC GACACCGCGA ACGCGCAGGA GCCGACCTAT
TCGGCGCTCG GCACCGACAA CGGCAAGCAG AATGCGCAGC GCCTGCAGTA CGGGCTGCGC
TGGGCCGAAG TGAGCGCGGC GCCGTTCCGG CTCACGAAGG GCTATTCGGG CGAAGGCGGC
GTATCGACGC CGACGATCGT GCACCTGCCG GGCCAGTCGC AATCGTTGCC GACGCTGCGC
GCGTTCACGC ACGTGACCGA CAACACGGCG ACGTTCCTCG CGGTCGCGGG CGTCACGCCG
CCGTCGCAGC CGGCGCCGCC GCTGGTGAAC ACGCTGACGG GCGTCGATCA GAACAAGGGC
AAGGTGATCT ACAACAACCG CTATGTGTAT CCGGTCACGG GCCAGTCGCT GCTGCCGGTG
CTCACCGGTT CGGCGACGGG CGAAGTGCAC ACGACGCCGT TCGGCGACGA AGCGTACGGC
CGCGCGTATC TGCGCAGCGC CGACGGCCGC TGGAAGGCAT TGTGGACCGA GCCGCCGCTC
GGCCCGCTCG ACGGTCACTG GCAGCTGTAC GACCTCGCGT CGGATCGCGG CGAGACGACC
GACGTGTCCG CGCAGAACCC GTCGGTGATC GGCACGCTGG TCGACCAGTG GAAGACCTAC
ATGGGCAACG TCGGCGGCGT CGAACCGCTG CGTCCGCGCG GCTACTACTG A
 
Protein sequence
MKKSASPLHA FRFRVVCAAI AGAVSLASCG GVDSDAPPSQ AGVTPTPTPT QAAKRPNILY 
IMADDLGYSD IHAFGGEINT PNLDALVASG RILSNHHTGT VCAITRAMLV SGTDHHLVGE
GTMGVPTDER RGLPGYEGYL NDRALSFAQL LKDAGYHTYI AGKWHIGSGI VGSATGSGQT
PDQWGFERSY VLLGGAATNH FAHEPAGSSN YTEDGRYVQP GQPGQPGGTG GSPAVFYSTD
FYTQKLISYI DANKQDGKPF FAYAAYTSPH WPLQVPDPWL HKYAGVYDAG YDAIRNARIA
RQKALGLIPA DFKPFDGLPE TTVASPATAN NGTASAKYIN AVHSAADGYS DYGPGKVDKL
WSSLSPAERK AQARYMEIYA GMVENLDYNI GLLIQHLKDI GEYDNTFIMF QSDNGAEGWP
IDSGADPTAT DTANAQEPTY SALGTDNGKQ NAQRLQYGLR WAEVSAAPFR LTKGYSGEGG
VSTPTIVHLP GQSQSLPTLR AFTHVTDNTA TFLAVAGVTP PSQPAPPLVN TLTGVDQNKG
KVIYNNRYVY PVTGQSLLPV LTGSATGEVH TTPFGDEAYG RAYLRSADGR WKALWTEPPL
GPLDGHWQLY DLASDRGETT DVSAQNPSVI GTLVDQWKTY MGNVGGVEPL RPRGYY