Gene Bcep18194_B0587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B0587 
Symbol 
ID3752351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp658337 
End bp659872 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content68% 
IMG OID637765435 
Productsulfatase 
Protein accessionYP_371345 
Protein GI78061437 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC CGACACCCAA CATCCTGATC CTGATGGCCG ACCAGCTCAC GCCGTTCGCG 
CTGCCGGCCT ATGGCAATCG CGTCGCGCGC ACGCCGACGC TGGACCGTCT CGCCGCGCAA
GGCGTCGTAT TCGACGCCGC ATACTGCGCG AGCCCGCTGT GCGCGCCGTC GCGTTTCTCG
CTGCTGACCG GCAAGCTGCC GTCGGGGATC GGCGCCTACG ATAACGCCGC CGAATTGCCG
GCGCAAACAT TGACGTTCGC ACACTACCTG CGCGCGGGCG GCTACCGGAC GATGCTGTCC
GGCAAGATGC ATTTCTGCGG GCCCGACCAG TTGCACGGCT TCGAGGAGCG GCTGACGACC
GACATCTATC CGGCCGATTT CGGCTGGGTG CCCGACTGGG ACAGCCCGAC CGAGCGGCCG
AGCTGGTATC ACAACATGAG TTCGGTGCTG GAGGCCGGCC CGTGCGTGCG CACGAACCAG
CTCGACTTCG ACGACGAAGT CACGTTTGCC GCGAAGCAGA AGCTGTACGA CGTCGCGCGC
GAGCGTGCGG CCGGGCACGA TGCGCGGCCG TTCTGCATGG TCGTGTCGCT GACTCATCCG
CACGACCCGT ATGCGATCAC GCGCGAATAC TGGGATCAAT ACAGCGACGA CGAGATCGAC
ATGCCGGCCG TGCACCTCGA TGCGGCGGAA AGCGACCCGC ATTCGCAGCG GCTGCGCTTC
GTCTGCGAAA ACGACCGCAC GCCGCCCACC GACGCGCAGA TTCGCGCCGC GCGCCGTGCC
TACTACGGTG CGACGTCCTA CGTCGACGCG CAGTTCGGCA GCGTGCTGGG CGCGCTCGAA
CAGTGCGGAT TCGCCGACGA TACGATCGTG ATCGTCACGT CCGACCACGG CGACATGCTC
GGGGAGCGCG GGCTCTGGTA CAAGATGACG TTCTTCGAAG GCGGCTGCCG CGTGCCGCTG
ATCGTGCATG CGCCGGGCCG CTTCGATGCC GCGCGTGTGC GCGGGCCTGT GTCGCACGTC
GACCTGCTGC CGACGCTCGT CGATCTCGCC GGCGCCGCAC CGGCCGGCGG CTGGCCTGAC
CCGGTCGATG GCGCGAGTCT CGTGCCGCAC CTGCACGGCA CGCCGGCGCA CGACGTCGCG
CTCGGCGAAT ACCTTGCGGA AGGCGCAGTT GCACCGGTCG TGATGATCCG CCGCGGCGAC
TGGAAGTACG TCCATTGCCC GGCCGATCCC GACCAGCTCT ACAACCTCTC CGACGACCCG
CGCGAGCTCA CGAACCTCGC CGACACGCCG GAAGCGGCCG ACGTGCTCGC TACGTTCCGC
GCGCAAGCCG CGCAGCGCTG GAACCTGCCC GAGCTGGACC GGCAGGTGCG CGCGAGCCAG
CGGCGCCGGC GCTTCCATTA CGCGGCGACG ACGCAGGGCC GCATCCAGGC GTGGGACTGG
CAGCCGTTCA CCGACGCGAG CCAGCGCTAC ATGCGCAATC ACATCGAACT CGACACGCTC
GAGGCGATGG CGCGTTTTCC GCGCGTCGGG CGCTGA
 
Protein sequence
MTNPTPNILI LMADQLTPFA LPAYGNRVAR TPTLDRLAAQ GVVFDAAYCA SPLCAPSRFS 
LLTGKLPSGI GAYDNAAELP AQTLTFAHYL RAGGYRTMLS GKMHFCGPDQ LHGFEERLTT
DIYPADFGWV PDWDSPTERP SWYHNMSSVL EAGPCVRTNQ LDFDDEVTFA AKQKLYDVAR
ERAAGHDARP FCMVVSLTHP HDPYAITREY WDQYSDDEID MPAVHLDAAE SDPHSQRLRF
VCENDRTPPT DAQIRAARRA YYGATSYVDA QFGSVLGALE QCGFADDTIV IVTSDHGDML
GERGLWYKMT FFEGGCRVPL IVHAPGRFDA ARVRGPVSHV DLLPTLVDLA GAAPAGGWPD
PVDGASLVPH LHGTPAHDVA LGEYLAEGAV APVVMIRRGD WKYVHCPADP DQLYNLSDDP
RELTNLADTP EAADVLATFR AQAAQRWNLP ELDRQVRASQ RRRRFHYAAT TQGRIQAWDW
QPFTDASQRY MRNHIELDTL EAMARFPRVG R