Gene Bcav_4134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcav_4134 
Symbol 
ID7858062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeutenbergia cavernae DSM 12333 
KingdomBacteria 
Replicon accessionNC_012669 
Strand
Start bp4566234 
End bp4568600 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content70% 
IMG OID643868236 
Productsulfatase 
Protein accessionYP_002884136 
Protein GI229822610 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCCCG ACCGGCACGC CCGCACGATG CTCCCGATCC CCGACCGACC GGCGGCGGGA 
CTCACGACCT ACGACGCCAA GGACCCGGAC ACGTCCTACC CGCCGATCGA GCCGCTCCTC
CCGCCCGACG GCGCCCCGAA CGTGCTCGTG ATCCTCATCG ACGACGTGGG CTTCGGTGCG
TCGAGCGCGT TCGGCGGGCC GTGCCGCACG CCGACGGCGG AGCGGCTCGC CGCGGGTGGG
CTGCGGTACA ACCGGTTCCA CACCACCGCG CTCTGCGCAC CGACCCGGCA GGCGCTGCTG
ACCGGCCGGA ACCACCACTC GGTCGGCATG GGCAGCATCA CGGAGACGGC GACGTCGGCG
CCCGGGAACA GCTCGCTGCG GCCGAACACG AAGGCGCCGC TCGCGCTGAC CCTGCGGCTC
AACGGGTACT CGACGGCGCA GTTCGGCAAG TGCCACGAGG TTCCCGTCTG GCAGACCTCA
CCGATGGGCC CGTTCGACGC GTGGCCCACG GGCGGCGGTG GCTTCGAGAC CTTCTACGGC
TTCATCGGCG GCGAGAACAA CCAGTGGGAC CCAGCACTGT ACGAGGGCAC GACGCCGATC
GAACCGCCGG CCACGGCCGA GGAGGGCTAC CACCTCACCG AAGACCTCAT CGACCACGCC
TGCGCCTGGG TGCGGCAGCA GAAGGCGCTC ATGCCGGACA AGCCGTTCTT CGTGTACGTC
GCCCCCGGTG CCACGCACGC GCCGCACCAC GTGCCGGCGG AGTGGATCGA GAAGTACCGG
GGAGCGTTCG ACGACGGCTG GGACGTGCAG CGGGAGCGCA CGTTCGCCCG GCAGAAGGAG
CTCGGCGTGA TCCCGGCCGA CGCCGAGCTG ACGGTGCGCC ACGACGAGAT CCCCGCGTGG
GACGACATGC CCGACGACCT CAAGCCCGTG CTGTCGCGGC AGATGGAGGT GTACGCCGCG
TTCCTGGAGC ACACCGACCA CCATGTCGGC CGGCTCATCG ACGCGCTCGA GGATCTCGAG
GTGCTCGGCG ACACGCTCGT CTACTACATC ATCGGCGACA ACGGCGCGTC GGCCGAGGGC
ACCGTCAACG GCGCCTTCAA CGAGATGGCC AACTTCAACG GCATGGCGGC GCTCGAGACG
CCGGACTTCA TGCGGAGCAA GATGGACGAG TTCGGCTCGC CGACCTCCTA CAACCACTAC
GCGGTGGGCT GGGCGTGGGC GATGGACACC CCGTTCCAGT GGACGAAGCA GGTCGCCTCC
CACTGGGGCG GCACCCGGAA CGGAACGATC GTGCACTGGC CGAGCACCAT CGCCGACCGC
GGCGGCCTGC GTACTCAGTT CACCCATGTC ATCGACGTCG CCCCGACCGT GCTCGAGGCC
GCGGGCCTCC CGGAGCCGGC CATGGTGAAC GGCGTCCTGC AGGCGCCCAT CGAGGGCACG
AGCATGCTGT ACACGTTCGA CGGCGCGGAC ATCCCGGAGC GGCACGACCT GCAGTACTTC
GAGATGTTCG CCAACCGCGG GGTCTACCAC CGCGGGTGGA GCGCCGTCAC CAAGCACCGC
ACGCCCTGGG TCATGGTCGG CGGCGAGCTG CCGGCGTTCG ACGACGACCT GTGGGAGCTC
TACGACGGGT CCTCCGACTA CAGCCAGGCG CACGACCTCG CCGCCGAGCG GCCGGACATG
CTCGCGAAGC TCCAGCGCCT GTGGCTCATC GAGGCCACGA AGTACAACGT GCTGCCGCTG
GACGACCGCA CCGGTGAGCG GCTCGAGCCG ACGATGGCCG GGCGCCCCAC GCTCATCCGC
GGCGACTCCC AGCTGTTCTT CCCTGGCATG GGGCGCCTCT CGGAGAACAG CGTCGTCACC
GTGAAGAACC GGTCGTTCTC CGTCACCGCT GAGGTCGAGG TGCCCGACGG CGGAGCGTCC
GGCGTGCTCA TCGCGCAGGG TGGGCGGTTC GGCGGCTGGA GCGTCTACCT CCGCGACGGC
CGGATGACGT TCTGCTACAA CGTGCTCGGC ATCCAGAGCT TCGTGGTGTC GTCGGAGGAG
CCGGTGCCGG CGGGCACCCA TCAGGCGCGG ATGGAGTTCG CGTACGACGG CGGCGGGCTC
GGCAAGGGGG GCGACGTCAC GCTGTACGTC GACGGCGGTC CTGTCGGCAC GGGCCGGGTC
GACGCCACTC AGGCGATGGT GTTCTCCGCC GACGAGACGA CCGACATCGG GTACGAGTCC
GGCACCACCG TCTCGCCGGA CTACACGGCC CGCGACAGCC GGTTCACCGG CAAGCTGCTC
TGGGTCCAGC TCGACGTCGG GACCGACGAC CACGACCACG TCATCAGCCC CGAGGAGCGC
CTGCGGGTGG CCATGGCCCG GCAGTAG
 
Protein sequence
MRPDRHARTM LPIPDRPAAG LTTYDAKDPD TSYPPIEPLL PPDGAPNVLV ILIDDVGFGA 
SSAFGGPCRT PTAERLAAGG LRYNRFHTTA LCAPTRQALL TGRNHHSVGM GSITETATSA
PGNSSLRPNT KAPLALTLRL NGYSTAQFGK CHEVPVWQTS PMGPFDAWPT GGGGFETFYG
FIGGENNQWD PALYEGTTPI EPPATAEEGY HLTEDLIDHA CAWVRQQKAL MPDKPFFVYV
APGATHAPHH VPAEWIEKYR GAFDDGWDVQ RERTFARQKE LGVIPADAEL TVRHDEIPAW
DDMPDDLKPV LSRQMEVYAA FLEHTDHHVG RLIDALEDLE VLGDTLVYYI IGDNGASAEG
TVNGAFNEMA NFNGMAALET PDFMRSKMDE FGSPTSYNHY AVGWAWAMDT PFQWTKQVAS
HWGGTRNGTI VHWPSTIADR GGLRTQFTHV IDVAPTVLEA AGLPEPAMVN GVLQAPIEGT
SMLYTFDGAD IPERHDLQYF EMFANRGVYH RGWSAVTKHR TPWVMVGGEL PAFDDDLWEL
YDGSSDYSQA HDLAAERPDM LAKLQRLWLI EATKYNVLPL DDRTGERLEP TMAGRPTLIR
GDSQLFFPGM GRLSENSVVT VKNRSFSVTA EVEVPDGGAS GVLIAQGGRF GGWSVYLRDG
RMTFCYNVLG IQSFVVSSEE PVPAGTHQAR MEFAYDGGGL GKGGDVTLYV DGGPVGTGRV
DATQAMVFSA DETTDIGYES GTTVSPDYTA RDSRFTGKLL WVQLDVGTDD HDHVISPEER
LRVAMARQ