Gene Bcep18194_B2584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B2584 
Symbol 
ID3754351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp2938524 
End bp2940482 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content67% 
IMG OID637767432 
Productsulfatase 
Protein accessionYP_373339 
Protein GI78063431 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT CCGCGTCGCC GTTACATGCT TTTCGTTTCC GTGTCGTCTG CGCCGCGATT 
GCCGGCGCGC TGTCGCTCGC ATCGTGCGGC GGCGTCGACA GCGATCCGCC GCCGTCACCC
ACCAGCACCA CGCCGCCGCT GGCTCAGAAG CGCCCGAACA TCCTGTACAT CATGGCCGAC
GATCTCGGCT ATTCCGACAT CCATGCATTC GGCGGCGAGA TCAACACGCC GAACCTCGAC
GCGCTCGTCG CGTCGGGCCG CATCCTGTCG AACCATCACA CGGGCACCGT CTGCGCGATC
ACGCGCGCGA TGCTGGTGTC CGGCACCGAC CACCATCTCG TCGGCGAAGG CACGATGGGC
GTGCCGACCG ACGAACGGCG CGGGCTGCCC GGCTACGAGG GCTACCTGAA CGATCGTGCA
TTGTCGTTCG CACAACTATT GAAGGATGCC GGCTATCACA CGTACATCGC GGGCAAGTGG
CACATCGGCT CGGGGATCGT CGGCAGCGCG ACGGGCAGCG GGCAGACGCC GGACCAGTGG
GGCTTCGAGC GCAGCTACGT GCTGCTCGGC GGCGCGGCGA CGAACCACTT CGCGCACGAG
CCGGCCGGCT CGTCGAACTA CACGGAAGAC GGCCGCTACG TGCAGCCTGG CCAGCCCGGA
CAGCCGGGCG GCACGGGCGG CAGCCCGGCC GTGTTCTATT CGACCGACTT CTATACGCAG
AAGCTGATTT CGTACATCGA CTCGAACCAG CGCGACGGCA AGCCGTTCTT CGCGTACGCG
GCCTTCACGT CGCCGCACTG GCCGCTGCAG GTACCCGATC CGTGGCTGCA CAAGTACGCG
GGCGTGTACG ACGCCGGCTA CGACGCGATC CGCAACGCGC GGATCGCGCG GCAGAAGGCG
CTCGGCCTGA TCCCCGCCGA TTTCAAGCCG TTCGACGGCC TGCCTGAAAC GACGGCCGCG
TCGCCCGCGA CCGCGAACAA CGGCACGGCC GCCGCGAAAT ACATCAGCGC GGTGCATTCG
GCCGCGGACG GCTACAGCGA CTACGGCCCC GGCAAGGTCG ACAAGCTGTG GTCGAGCCTG
ACGCCGGCCG AGCGCAAGGC GCAGGCGCGC TACATGGAGA TCTATGCGGG GATGGTCGAG
AACCTCGACT ACAACATCGG CCTGCTGATC CAGCACCTGA AGGACATCGG CGAATACGAC
AACACGTTCA TCATGTTCCA GTCGGACAAC GGCGCGGAAG GCTGGCCGAT CGACGGCGGC
GCCGACCCGA CGGCGACCGA CACCGCGAAC GGGCAGGACC CGATCTATTC GACGCTCGGC
ACCGACAACG GCAAGCAGAA CGCGCAGCGC CTGCAGTACG GGTTGCGCTG GGCCGAAGTG
AGCGCGTCGC CGTTCCGGCT CACGAAGGGG TATTCCGCTG AAGGCGGCGT ATCGACGCCG
ACGATCGTCC GCCTGCCGGG CCAGACGCAG CAGTTGCCGA CGCTGCGCGC CTTCACGCAC
GTGACCGACA ACACGGCGAC GTTCCTCGCG GTCGCGGGCG TCACGCCGCC GTCGCAGCCG
GCGCCGCCGC TCGTCAACAC ACTGACGGGT GTCGACCAGA ACAAGGGCAA GGTCGTCTAC
AACAACCGCT ACGTGTATCC GGTCACCGGC CAGTCGTTGC TGCCGGTGCT GACGGGCACG
GCGACGGGCG AAGTGCATAC CGCGCCGTTC GGCGACGAAG CCTACGGCCG TGCATACCTG
CGCAGCGCCG ACGGCCGCTG GAAGGCCTTG TGGACCGAGC CGCCGCTCGG GCCGCCCGAC
GGTCACTGGC AGCTGTACGA CCTCGCGTCG GATCGCGGCG AGACGACCGA CGTGTCCGCG
CAGAACCCGT CGGTGATCAG CACGCTCGTC GACCAGTGGA AGACCTACAT GAGCAACGTC
GGCGGTGTCG AACCGCTGCG TCCGCGCGGC TACTACTGA
 
Protein sequence
MKKSASPLHA FRFRVVCAAI AGALSLASCG GVDSDPPPSP TSTTPPLAQK RPNILYIMAD 
DLGYSDIHAF GGEINTPNLD ALVASGRILS NHHTGTVCAI TRAMLVSGTD HHLVGEGTMG
VPTDERRGLP GYEGYLNDRA LSFAQLLKDA GYHTYIAGKW HIGSGIVGSA TGSGQTPDQW
GFERSYVLLG GAATNHFAHE PAGSSNYTED GRYVQPGQPG QPGGTGGSPA VFYSTDFYTQ
KLISYIDSNQ RDGKPFFAYA AFTSPHWPLQ VPDPWLHKYA GVYDAGYDAI RNARIARQKA
LGLIPADFKP FDGLPETTAA SPATANNGTA AAKYISAVHS AADGYSDYGP GKVDKLWSSL
TPAERKAQAR YMEIYAGMVE NLDYNIGLLI QHLKDIGEYD NTFIMFQSDN GAEGWPIDGG
ADPTATDTAN GQDPIYSTLG TDNGKQNAQR LQYGLRWAEV SASPFRLTKG YSAEGGVSTP
TIVRLPGQTQ QLPTLRAFTH VTDNTATFLA VAGVTPPSQP APPLVNTLTG VDQNKGKVVY
NNRYVYPVTG QSLLPVLTGT ATGEVHTAPF GDEAYGRAYL RSADGRWKAL WTEPPLGPPD
GHWQLYDLAS DRGETTDVSA QNPSVISTLV DQWKTYMSNV GGVEPLRPRG YY