Gene Bcen_3296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen_3296 
Symbol 
ID4093831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia AU 1054 
KingdomBacteria 
Replicon accessionNC_008061 
Strand
Start bp382099 
End bp383649 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content69% 
IMG OID638016595 
Productsulfatase 
Protein accessionYP_623164 
Protein GI107025653 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.787928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAACC CGACACCCAA CATCCTGATC CTGATGGCCG ACCAGCTCAC GCCGTTCGCG 
CTGCCGGCGT ACGGCAACCG CGTCGCCCGT ACGCCGACGC TCGACCGGCT CGCCGCCGAA
GGCGTGGTGT TCGACGCCGC GTACTGCGCG AGCCCTTTGT GCGCGCCGTC GCGCTTCTCG
CTGCTGACCG GCAAGCTGCC GTCGGGGATC GGCGCCTACG ATAACGCCGC CGAATTGCCG
GCGCAAACGC TGACGTTCGC ACACTACCTG CGCGCGGGCG GCTACCGGAC GATGCTGTCC
GGCAAGATGC ATTTCTGCGG GCCCGATCAG TTGCACGGCT TCGAGGAGCG CCTCACGACC
GACATCTATC CGGCCGATTT CGGCTGGGTG CCGGACTGGG ATCAACCGAC CGAGCGGCCG
AGCTGGTATC ACAACATGAG CTCGGTGCTC GATGCCGGCC CGTGCGTGCG TACGAACCAG
CTCGACTTCG ACGACGAAGT GACGTTCGCC GCGAAACAGA AGCTGTACGA CGTCGCGCGC
GAACGTGCGG CCGGACACGA TACGCGGCCG TTCTGCATGG TCGTGTCGCT GACCCATCCG
CACGACCCGT ATGCGATCAC GCGCGAATAC TGGGATCTGT ACCGCGACGA AGACATCGAC
ATGCCGGCCG TGCGGCTCGA TGCGGCCGAA AGCGATCCGC ATTCGCAGCG GCTGCGCTTC
GTCTGCGAGA ACGACCGCAC GCCGCCGACC GACGCGCAGA TCCGCGCCGC CCGCCGCGCG
TATTACGGCG CGACGTCCTA CGTCGACACG CAGTTCGGCA GCGTGCTGGC CGCCCTCGAG
CAATGCGGGT TCGCCGACGA CACGATCGTG ATCGTCACGT CCGACCACGG CGACATGCTC
GGCGAACGCG GGCTCTGGTA CAAGATGACG TTCTTCGAAG GCGGCTGCCG CGTGCCGCTG
ATCGTGCATG CGCCGGGCCG CTTCGGCGCT GCGCGGGTGC GCGGGCCCGT GTCGCACGTC
GACCTGCTGC CGACGCTCGT CGAGCTGGCC GGCGCCACGC CCGCCGGCGG CTGGCCGGAC
GCCGGATGGC CGGACCCGGT CGACGGTGCG AGCCTCGTGC CGCACCTGCA CGGCACGCCC
GCGCACGATG TCGCGCTCGG CGAATACCTC GCGGAAGGCG CGCTCGCGCC GGTCGTGATG
ATCCGCCGCG GCGACTGGAA ATACGTGCAT TGCCTGGCCG ATCCCGACCA GCTCTACCAC
CTGTCGGACG ACCCGCGCGA GCTGACGAAC CTGGCCGGGC AGCCGGAAGC CGCCGACGTG
CTCGCCGCGT TCCGCGTGGA GGCCGCACAG CGCTGGAACC TGCCCGAGCT GGACCGGCAG
GTGCGCGCGA GCCAGCGGCG CCGGCGCTTC CATTACGCGG CGACGACGCA GGGCCGCATC
CACGCGTGGG ACTGGCAGCC GTTCACCGAC GCGAGCCAGC GCTACATGCG CAATCACATC
GAACTCGACA CGCTCGAGGC GATGGCGCGT TTTCCGCGCG TCGGGCGCTG A
 
Protein sequence
MTNPTPNILI LMADQLTPFA LPAYGNRVAR TPTLDRLAAE GVVFDAAYCA SPLCAPSRFS 
LLTGKLPSGI GAYDNAAELP AQTLTFAHYL RAGGYRTMLS GKMHFCGPDQ LHGFEERLTT
DIYPADFGWV PDWDQPTERP SWYHNMSSVL DAGPCVRTNQ LDFDDEVTFA AKQKLYDVAR
ERAAGHDTRP FCMVVSLTHP HDPYAITREY WDLYRDEDID MPAVRLDAAE SDPHSQRLRF
VCENDRTPPT DAQIRAARRA YYGATSYVDT QFGSVLAALE QCGFADDTIV IVTSDHGDML
GERGLWYKMT FFEGGCRVPL IVHAPGRFGA ARVRGPVSHV DLLPTLVELA GATPAGGWPD
AGWPDPVDGA SLVPHLHGTP AHDVALGEYL AEGALAPVVM IRRGDWKYVH CLADPDQLYH
LSDDPRELTN LAGQPEAADV LAAFRVEAAQ RWNLPELDRQ VRASQRRRRF HYAATTQGRI
HAWDWQPFTD ASQRYMRNHI ELDTLEAMAR FPRVGR