Gene Bcen2424_5072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen2424_5072 
Symbol 
ID4452956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia HI2424 
KingdomBacteria 
Replicon accessionNC_008543 
Strand
Start bp2096014 
End bp2097564 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content69% 
IMG OID639697128 
Productsulfatase 
Protein accessionYP_838698 
Protein GI116693165 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.263154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC CGACACCCAA CATCCTGATC CTGATGGCCG ACCAGCTCAC GCCGTTCGCG 
CTGCCGGCGT ACGGCAACCG CGTCGCCCGT ACGCCGACGC TCGACCGGCT CGCCGCCGAA
GGCGTGGTGT TCGACGCCGC GTACTGCGCG AGCCCTTTGT GCGCGCCGTC GCGCTTCTCG
CTGCTGACCG GCAAGCTGCC GTCGGGGATC GGCGCCTACG ATAACGCCGC CGAATTGCCG
GCGCAAACGC TGACGTTCGC ACACTACCTG CGCGCGGGCG GCTACCGGAC GATGCTGTCC
GGCAAGATGC ATTTCTGCGG GCCCGATCAG TTGCACGGCT TCGAGGAGCG CCTCACGACC
GACATCTATC CGGCCGATTT CGGCTGGGTG CCGGACTGGG ATCAACCGAC CGAGCGGCCG
AGCTGGTATC ACAACATGAG CTCGGTGCTC GATGCCGGCC CGTGCGTGCG TACGAACCAG
CTCGACTTCG ACGACGAAGT GACGTTCGCC GCGAAACAGA AGCTGTACGA CGTCGCGCGC
GAACGTGCGG CCGGACACGA TACGCGGCCG TTCTGCATGG TCGTGTCGCT GACCCATCCG
CACGACCCGT ATGCGATCAC GCGCGAATAC TGGGATCTGT ACCGCGACGA AGACATCGAC
ATGCCGGCCG TGCGGCTCGA TGCGGCCGAA AGCGATCCGC ATTCGCAGCG GCTGCGCTTC
GTCTGCGAGA ACGACCGCAC GCCGCCGACC GACGCGCAGA TCCGCGCCGC CCGCCGCGCG
TATTACGGCG CGACGTCCTA CGTCGACACG CAGTTCGGCA GCGTGCTGGC CGCCCTCGAG
CAATGCGGGT TCGCCGACGA CACGATCGTG ATCGTCACGT CCGACCACGG CGACATGCTC
GGCGAACGCG GGCTCTGGTA CAAGATGACG TTCTTCGAAG GCGGCTGCCG CGTGCCGCTG
ATCGTGCATG CGCCGGGCCG CTTCGGCGCT GCGCGGGTGC GCGGGCCCGT GTCGCACGTC
GACCTGCTGC CGACGCTCGT CGAGCTGGCC GGCGCCACGC CCGCCGGCGG CTGGCCGGAC
GCCGGATGGC CGGACCCGGT CGACGGTGCG AGCCTCGTGC CGCACCTGCA CGGCACGCCC
GCGCACGATG TCGCGCTCGG CGAATACCTC GCGGAAGGCG CGCTCGCGCC GGTCGTGATG
ATCCGCCGCG GCGACTGGAA ATACGTGCAT TGCCTGGCCG ATCCCGACCA GCTCTACCAC
CTGTCGGACG ACCCGCGCGA GCTGACGAAC CTGGCCGGGC AGCCGGAAGC CGCCGACGTG
CTCGCCGCGT TCCGCGTGGA GGCCGCACAG CGCTGGAACC TGCCCGAGCT GGACCGGCAG
GTGCGCGCGA GCCAGCGGCG CCGGCGCTTC CATTACGCGG CGACGACGCA GGGCCGCATC
CACGCGTGGG ACTGGCAGCC GTTCACCGAC GCGAGCCAGC GCTACATGCG CAATCACATC
GAACTCGACA CGCTCGAGGC GATGGCGCGT TTTCCGCGCG TCGGGCGCTG A
 
Protein sequence
MTNPTPNILI LMADQLTPFA LPAYGNRVAR TPTLDRLAAE GVVFDAAYCA SPLCAPSRFS 
LLTGKLPSGI GAYDNAAELP AQTLTFAHYL RAGGYRTMLS GKMHFCGPDQ LHGFEERLTT
DIYPADFGWV PDWDQPTERP SWYHNMSSVL DAGPCVRTNQ LDFDDEVTFA AKQKLYDVAR
ERAAGHDTRP FCMVVSLTHP HDPYAITREY WDLYRDEDID MPAVRLDAAE SDPHSQRLRF
VCENDRTPPT DAQIRAARRA YYGATSYVDT QFGSVLAALE QCGFADDTIV IVTSDHGDML
GERGLWYKMT FFEGGCRVPL IVHAPGRFGA ARVRGPVSHV DLLPTLVELA GATPAGGWPD
AGWPDPVDGA SLVPHLHGTP AHDVALGEYL AEGALAPVVM IRRGDWKYVH CLADPDQLYH
LSDDPRELTN LAGQPEAADV LAAFRVEAAQ RWNLPELDRQ VRASQRRRRF HYAATTQGRI
HAWDWQPFTD ASQRYMRNHI ELDTLEAMAR FPRVGR