Gene Bcav_4108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcav_4108 
Symbol 
ID7861494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeutenbergia cavernae DSM 12333 
KingdomBacteria 
Replicon accessionNC_012669 
Strand
Start bp4537666 
End bp4539936 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content69% 
IMG OID643868211 
Productsulfatase 
Protein accessionYP_002884111 
Protein GI229822585 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGTCCG AGGCCGCCTG GCCGCGGATC CCGACGGCGC CCGAGGGCGC CCCGAACATC 
GTCGTCATCC TGCTCGACGA CGTCGGCTTC GGTCAGACGT CGACGTTCGG CGGGCTGATC
CCGACCCCGA ACCTGGACAA GCTCGCGAGC GAGGGGCTGC GCTACAACCG CTTCCACACC
ACGGCGATCT GCGGCCCGTC ACGGGCCGCG CTGCTGACCG GGCGCAACCA CCACGACACC
GGCAACGGGT TCCTCATGGA GTGGGCCACG GGGTTCCCGA GCTACACGAC CATGATCCCG
AGGACGACCG CGACGGTGGC CGAGGTCCTC AAGGACAACG GCTACTCGAC CTGGTGGTTC
GGGAAGAACC ACAACACGCC TGACTGGGAG ACGAGCGTCG CCGGACCGTT CGACCGGTGG
CCGACGGGGA TGGGTTTCGA GTACTTCTAC GGCTTCAACG CGGGCGAGAC CCACCAGTAC
TACCCGGTGC TGTTCGAGAA CACGACTCCG GTGGAACCGG ACAAGGGGCC CGACGAGGGC
TACCACTTCA TGACCGACAT GACCGACCGG GCGATCTCAC GCATGCGGTA CGCGACGTCG
GTCGCGCCGG ACAAGCCGTT CTTCATGTAC TTCGCACCGG GTGCGATGCA CGCGCCGCAC
CACGTCACGA AGGAGTGGCG CGACCGCTTC ACGGGCGCGT TCGACATGGG GTGGGAGAAG
TACCGCGACG TGGTGTTCGC GAACCAGAAG AGGATGGGCA TCGTCCCGCC CGACGCCGAG
CTGACTCCGC GTCCCGACTG GGTGGCGGAG TGGGACTCGC TCAGCGAGCA GCAGAAGAGG
GTCTACTGCG CGCTGTTCGA GAACTATGCC GGGTACTTCG CGTTCACGGA CCACGAGGTG
GGGCGGCTCC TCGACGCGAT CAAGGAGCTG CCCGACGCCG AGAACACCCT CGTGCTCTAC
ATCGTGGGCG ACAACGGGGC GTCGTCGGAG GGCGGTCCCG ATGGCACCCT CGACGAGATC
AAGAACCTGA GCGGGATCCT GCCGTCGATC GAGGAGATCC TGGCGGACCT CGACAAGCTC
GGCGGACCCG AGACGGAGCC GCACTACCCG CTCGGCTGGG CGTGGGCGGG CAACACCCCG
TTCCAGTGGG TGAAGCAGGT CGCCTCGCAC CTCGGCGGCT CGCGCAACCC GATGGTGGTG
AGCTGGCCCG CTCGGGTGTC CCACGACCCG GTGCCCCGCG ACCCGTTCCT GCACCTCGTC
GACGTCGCGC CGACGCTGTA CGAGGCCGCC GGCGTCACGA TGCCGGACAC GGTGAACGGG
ATCGAGCAGA TGCCCCTCGC GGGCCGGTCG TTCCTCCCGA GCCTGACCGA CCCGGGATTC
GAGGGCCGCG GCGAGCAGTA CTTCGAGATC CTCAGCAACC GATCGATCTA CTCCGACGGC
TGGAAGGCCA ACGCTCAGCA CACGCTGCCG TGGCGGCAGG ACATCGCACC CGGTAATTGG
GACCAGGACC GCTGGGAGCT GTACCACCTG GAGCAGGACT TCTCGGAGGC GAAGGACCTG
GCCGAGGCGA TGCCCGAGAA GCTCGAGGAG ATGAAGCACA GGTTCGACGA GGCCGCGGAG
AAGTACCACG TCTACCCGCT GGACGATCGC GGAGTCGCTC GCGCGCTGAT CCCGAAGCCG
ACGGCGCCCG GGTCGGACCC TGAGGCGCTC GACTTCACGT TCTACGCCGG CGCCACGCGC
CTGCCCGAGA CGGCGGCGCC GTCGATGAAG AACCGCAGCT GGAAGCTCGC CGCACGTGTC
ACGATGGAGG GCGCGGCGAC GAACGGCGTC ATCATGGCGA TCGGCGGCGT CGCGGGCGGG
ATGTCGCTGT ACCTGAAGGA CGGCGTACCG ATCTTCGACT ACAACTACTA CGGCGAGCAC
ACGACGGTGC GGGCGGCGCA ACCGCTGCCG GCCGGTGACG CCGTAGTCGG CGTGGAGTTC
GCCTATGACG GCGGCGGCAT CGGCAAGGGG GCGGACGTCA CGCTCACCGT CGACGGCGCA
CCGGTCGGAT CCGGGCGCGC CGAGAGGACG GTGTTCGCAC GGTTCGGCGT CGACACGTTC
GGCATCGGCG AGGACACCGG CCAGCCGGTG ACGACCGACT ACCGTCCGCC GTTCCGGTTC
ACCGGGACGA TCGACCGCGT CGACATCCAG CTGGAGCCGG TTGAGCTCGC GCCGCACGAC
GCCGCGAAGG TCCACGAGAC GGAGCTCCGG GCGGTGCAGC GGCGGGAGTA G
 
Protein sequence
MESEAAWPRI PTAPEGAPNI VVILLDDVGF GQTSTFGGLI PTPNLDKLAS EGLRYNRFHT 
TAICGPSRAA LLTGRNHHDT GNGFLMEWAT GFPSYTTMIP RTTATVAEVL KDNGYSTWWF
GKNHNTPDWE TSVAGPFDRW PTGMGFEYFY GFNAGETHQY YPVLFENTTP VEPDKGPDEG
YHFMTDMTDR AISRMRYATS VAPDKPFFMY FAPGAMHAPH HVTKEWRDRF TGAFDMGWEK
YRDVVFANQK RMGIVPPDAE LTPRPDWVAE WDSLSEQQKR VYCALFENYA GYFAFTDHEV
GRLLDAIKEL PDAENTLVLY IVGDNGASSE GGPDGTLDEI KNLSGILPSI EEILADLDKL
GGPETEPHYP LGWAWAGNTP FQWVKQVASH LGGSRNPMVV SWPARVSHDP VPRDPFLHLV
DVAPTLYEAA GVTMPDTVNG IEQMPLAGRS FLPSLTDPGF EGRGEQYFEI LSNRSIYSDG
WKANAQHTLP WRQDIAPGNW DQDRWELYHL EQDFSEAKDL AEAMPEKLEE MKHRFDEAAE
KYHVYPLDDR GVARALIPKP TAPGSDPEAL DFTFYAGATR LPETAAPSMK NRSWKLAARV
TMEGAATNGV IMAIGGVAGG MSLYLKDGVP IFDYNYYGEH TTVRAAQPLP AGDAVVGVEF
AYDGGGIGKG ADVTLTVDGA PVGSGRAERT VFARFGVDTF GIGEDTGQPV TTDYRPPFRF
TGTIDRVDIQ LEPVELAPHD AAKVHETELR AVQRRE