Gene Bcenmc03_5213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcenmc03_5213 
Symbol 
ID6128024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia MC0-3 
KingdomBacteria 
Replicon accessionNC_010515 
Strand
Start bp2311767 
End bp2313317 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content70% 
IMG OID641652304 
Productcholine-sulfatase 
Protein accessionYP_001778832 
Protein GI170737572 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC CGACACCCAA CATCCTGATC CTGATGGCCG ACCAGCTCAC GCCGTTTGCG 
CTGCCGGCGT ACGGCAACCG CGTCGCCCGT ACGCCGACGC TCGACCGGCT CGCCGCCGAA
GGCGTGGTGT TCGACGCCGC GTACTGCGCG AGCCCTTTGT GCGCGCCGTC GCGCTTCTCG
CTGCTGACCG GCAAGCTGCC GTCGGGAATC GGCGCCTACG ATAACGCCGC CGAATTGCCG
GCGCAAACGC TGACGTTCGC GCACTACCTG CGCGCGGGCG GCTACAGGAC GATGCTGTCC
GGCAAGATGC ATTTCTGCGG ACCCGACCAG TTGCACGGCT TCGAGGAGCG CCTCACGACC
GACATCTATC CGGCCGATTT CGGCTGGGTG CCGGACTGGG ACCAGCCGAC CGAGCGGCCG
AGCTGGTATC ACAACATGAG CTCGGTGCTC GATGCCGGCC CGTGCGTGCG CACGAACCAG
CTCGACTTCG ACGACGAAGT GACGTTCGCC GCGAAGCAGA AGCTGTACGA CGTCGCGCGC
GAGCGCGCGG CCGGGCACGA TGCGCGGCCG TTCTGCATGG TCGTATCGCT GACCCATCCG
CACGACCCGT ATGCGATCAC GCGCGAATAC TGGGATCTGT ACCGCGACGA AGACATCGAC
ATGCCGGCCG TGCGGCTCGA TGCGGCCGAA AGCGATCCGC ATTCGCAGCG GCTGCGCTTC
GTCTGCGAGA ACGACCGCAC GCCGCCGACC GATGCGCAGA TCCGCGCGGC CCGCCGCGCG
TATTACGGTG CGACGTCCTA CGTCGACACG CAGTTCGGCA GCGTGCTGGC CGCGCTCGAG
CAATGCGGGT TCGCCGACGA CACGATCGTG ATCGTCACGT CCGACCACGG CGACATGCTC
GGCGAACGCG GGCTCTGGTA CAAGATGACG TTCTTCGAAG GCGGCTGCCG CGTGCCGCTG
ATCGTGCATG CGCCGGGCCG CTTCGGCGCT GCGCGGGTGC GCGGGCCCGT GTCGCACGTC
GACCTGCTGC CGACGCTCGT CGAGCTGGCC GGCGCGGCGC CCGCCGGCGG CTGGCCGGAC
GCCGGATGGC CGGACCCGGT CGACGGCACG AGCCTCGTGC CGCACCTGCA CGGCACGCCC
GCGCACGATG TCGCGCTCGG CGAATACCTC GCGGAAGGCG CGCTCGCGCC GGTCGTGATG
ATCCGCCGCG GCGACTGGAA ATACGTGCAT TGCCCGGCCG ATCCCGATCA GCTCTACCAC
CTGTCGGACG ACCCGCGCGA GCTGACGAAC CTGGCCGGGC AGCCGGAAGC CGCCGACGTG
CTCGCCGCGT TTCGCGCGGA GGCCGCGCAG CGCTGGAACC TGCCCGAACT GGACCGGCAG
GTGCGCGCGA GCCAGCGGCG CCGGCGCTTC CATTACGCGG CGACGACGCA GGGCCGCATC
CACGCGTGGG ACTGGCAGCC GTTCACCGAC GCGAGCCAGC GCTACATGCG CAATCACATC
GAACTCGACG CGCTCGAGGC GATGGCGCGT TTTCCGCGCG TCGGGCGCTG A
 
Protein sequence
MTNPTPNILI LMADQLTPFA LPAYGNRVAR TPTLDRLAAE GVVFDAAYCA SPLCAPSRFS 
LLTGKLPSGI GAYDNAAELP AQTLTFAHYL RAGGYRTMLS GKMHFCGPDQ LHGFEERLTT
DIYPADFGWV PDWDQPTERP SWYHNMSSVL DAGPCVRTNQ LDFDDEVTFA AKQKLYDVAR
ERAAGHDARP FCMVVSLTHP HDPYAITREY WDLYRDEDID MPAVRLDAAE SDPHSQRLRF
VCENDRTPPT DAQIRAARRA YYGATSYVDT QFGSVLAALE QCGFADDTIV IVTSDHGDML
GERGLWYKMT FFEGGCRVPL IVHAPGRFGA ARVRGPVSHV DLLPTLVELA GAAPAGGWPD
AGWPDPVDGT SLVPHLHGTP AHDVALGEYL AEGALAPVVM IRRGDWKYVH CPADPDQLYH
LSDDPRELTN LAGQPEAADV LAAFRAEAAQ RWNLPELDRQ VRASQRRRRF HYAATTQGRI
HAWDWQPFTD ASQRYMRNHI ELDALEAMAR FPRVGR