Gene Bcep18194_B2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B2021 
Symbol 
ID3753786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp2316211 
End bp2318109 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content69% 
IMG OID637766869 
Productsulfatase 
Protein accessionYP_372778 
Protein GI78062870 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.46218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.427112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCC CCGGCACGAG CCGGCACGAT CAGGAGACAG CCGACGTGAA AGACGATCAG 
GACACCCCCG CCGCCGCCGA ACCCACCCCG CGGCGCCAGT TCCTCAAGCT GGCCGGCGCG
GCGGTCGCGG CGGCCGGCTT CGGCACCGAT GCGCTGGCCG CGAGCACGCC GCCGGCCGCC
CCGGTTGCCG CCGGCGCGGG CGTCGATCCC GTGCCCGCAT TCCACGGCGC GACGACCGCG
CCCGCCGCGC CGCCCGCCGG CTACAACATC CTGTTCATCC TGACCGACCA GGAACGCCAC
TTCGACCGCT GGCCGTTTCC GGTGCCCGGC CGCGAAGCGC TGCGCCGCGA CGGCATCACG
TTCATGCATC ACCAGATCGC CGCATGCGTG TGCTCGCCGT CGCGCTCGAC GGTCTACACG
GGGCAGCACA TCCAGCACAC CGGCGTGCTC GACAACGCGG GCGTGCCCTG GCAGAAGGAC
ATGTCGCCGG ACGTGCGCAC CGTCGGCCAC ATGCTGCGCG ACGCCGGCTA CTACGCCGCG
TATCTCGGCA AGTGGCACCT GAGCGCGTCG ATGCACGAAA CCGCAAGCCC GTACACGGCG
CCGGTGGCCG ACTACAACCG CACGATCCGG TCGTACGGTT TCGACGACTA TTTCGGCGTG
GGCGACCTGA TCGGGATGGT GCGCGGCGGC TACCAGTACG ACGGGATCAC GGCCGAGGCC
GCGGTGAGCT GGATGCGCAA CCACGCGCCG CGTCTCGCGA AGGAAGGCAA GCCGTGGTTC
CTCGCGGTGA ATCTGGTGAA CCCGCACGAC GCGATGTTCG TGAACACCGA CACCAACGGC
TCGACGGTGC AGGACGCGAA CCACCCGATG CTCGGCAACG CGCCGCCGCC GAACGACGCG
CTGTATCGCA CGTCGTGGCA CGACAAGCCG CTCGCGGCAT CGCGGCGGCA GCCGTACGAC
GAACCGGGAC GGCCGCCCGC GCACGGGATG TTCAACGCCG CGCATGCGAA CCTCGTCGGG
CGCTATCCGT TCACCGACGA ACGCCTGCGC ATCTATCAGG ACAACTACTT CAACTGCGTG
CGCGACTGCG ACACGCACGT CGTGCGCCTG CTGCAGTCGC TGCAGGCGCT CGGCCTCGAC
GAACGCACGA TCGTCGTGAT GACGGCCGAC CACGGCGATC ACATCGGCGC GCACCAGCTC
GTCGGCAAGG GCGCCACCGC GTACCAGCCG CAGAACCACG TGCCGCTCGT GATCCGCCAT
CCCGCGTATC CGGGCGGCAT GCAGTGCGAT GCGCTGACAT CGCACATCGA CATCGCGCCG
ACGCTGCTCG GGCTCACCGG TCTCGACGAC GCGCGTCTCG CGTCGATCCG CGGCAGCGCA
CTCAAGGGGC ACGACCTCAC GCGCTGGCTC GCGAAGCCGG CCGGGGCGAA GCTGCACGCG
GCGCGTGATG CCACGCTGTT CAACTACGCG ATGCTGCTCT ACTACGACAG CGAATGGATG
CTGAAGGAAC TGGGCACGAT GCGCCAGAAA GGCGTGCCGG AGGACGAGCT GCTGCGCCGC
GCGCTCGCGC AGCAGCCGGA TTTCCGGTTG CGCGGCACGA TCCGCAGCGT GTTCGACGGC
CGCTACCGGT TCACGCGCTA TTTCTCGCCG CTCGAATTCA ACCGGCCGAC GACGATGGAG
GACCTGTTCG CGCGCAACGA CGTCGAGCTG TTCGATATCG CGAGCGATCC GGGCGAGATG
CGCAATCTCG CGATGGACCG GAAACAGCAC GGCGAGCTGC TGCTCGCGAT GAACGGCCGC
CTGAACGACC TGATCGCGAG CGAAGTCGGC GACGACAGCC CCGACGTCAT GCCGATCCGC
GACGGCAAGG TGCAGGTGCA GATCCGCAAG TGGCATTGA
 
Protein sequence
MKTPGTSRHD QETADVKDDQ DTPAAAEPTP RRQFLKLAGA AVAAAGFGTD ALAASTPPAA 
PVAAGAGVDP VPAFHGATTA PAAPPAGYNI LFILTDQERH FDRWPFPVPG REALRRDGIT
FMHHQIAACV CSPSRSTVYT GQHIQHTGVL DNAGVPWQKD MSPDVRTVGH MLRDAGYYAA
YLGKWHLSAS MHETASPYTA PVADYNRTIR SYGFDDYFGV GDLIGMVRGG YQYDGITAEA
AVSWMRNHAP RLAKEGKPWF LAVNLVNPHD AMFVNTDTNG STVQDANHPM LGNAPPPNDA
LYRTSWHDKP LAASRRQPYD EPGRPPAHGM FNAAHANLVG RYPFTDERLR IYQDNYFNCV
RDCDTHVVRL LQSLQALGLD ERTIVVMTAD HGDHIGAHQL VGKGATAYQP QNHVPLVIRH
PAYPGGMQCD ALTSHIDIAP TLLGLTGLDD ARLASIRGSA LKGHDLTRWL AKPAGAKLHA
ARDATLFNYA MLLYYDSEWM LKELGTMRQK GVPEDELLRR ALAQQPDFRL RGTIRSVFDG
RYRFTRYFSP LEFNRPTTME DLFARNDVEL FDIASDPGEM RNLAMDRKQH GELLLAMNGR
LNDLIASEVG DDSPDVMPIR DGKVQVQIRK WH