Gene Bcav_0304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcav_0304 
Symbol 
ID7858996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeutenbergia cavernae DSM 12333 
KingdomBacteria 
Replicon accessionNC_012669 
Strand
Start bp326697 
End bp328190 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content74% 
IMG OID643864380 
Productsulfatase 
Protein accessionYP_002880330 
Protein GI229818804 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.567268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC GCCCGAACGT CCTGCTCGTC ATGACGGACC AGCAGCGCTG GGACACGCTC 
GGGTCCGCCG GGGGTCCCGT CGAGACGGCG AACCTCGACC ACCTGGCGGC GCAGGGCACC
ACGTTCACGC ACGCGTACTC GGCGACGCCG TCGTGCACCC CGGCGCGGGC GTCCCTGCTC
ACCGGGCAGG ACCCGTGGCA CACCGGCATC CTCGGCATGG GCGCCGGCCA GCCTCCGATG
GCGGGCCTGG AGAACACGCT CCCGGAGGCG CTCGCGGACG CGGGCTACCA CACGCAGGGC
GTCGGCAAGA TGCACTTCTC GCCGCAGCGG GCGCTGCACG GGTTCCACGC GACGACGATC
GACGAGTCGC TCCGCGTCGA GGAGCCGGGC TTCACCTCCG ACTACACGCA GTGGTTCGAG
CGCCACGCGC CGGCGGACGT GCGGCAGGCC GACCACGGGC TGGACTTCAA CTCGTGGCTG
GCGCGACCGT TCCACACCGG CGAGCACCTG CACCCGTCGA CCTGGACGGT GACGGAGTCG
ATCCGCTTCC TGGAGCGCCG CGACCCCACC CGGCCCTTCT TCCTCATGAC GTCGTTCGCG
CGGCCGCACT CGCCGTACGA CCCGCCCGCG TTCTACTACG AGCACTACCT GCGCCGGCAC
CACACCGGCG ACCTGCCGCC CGCCGTCGTC GGCGACTGGG CGTCCGTGCA CGATGTGGGC
GGCGCGGAGG GCATGGACCC CAACGCCTGG CGCGGCCGGC GGACCGCCGA CGAGATCGGG
CGCGCCCGCG CCGGCTACTA CGGGTCGATC CACCACATCG ACCACCAGAT CGGCCGGCTG
ATGCGGTACC TGCGCGACCG GCGTCTCGAC GCCGAGACGC TCGTCGTCTT CACCGCCGAC
CACGGCGACA TGCTCGGCGA CCACCACCTG TGGCGGAAGA CGTACGCGTA CGAGGGGTCG
GCGCACGTGC CGCTCGTCGT GCGGCTGCCC GCCGGCATGC GCTCCGCCGG CGACGCCGAG
GTGGTGGACG ATCCCGTGTG CCTGCAGGAC GTCATGCCGA CGATCCTCGA CGCGTGCGGC
GTCGACGTCC CGGCCAGCGT CGACGGCGCC AGCACGCTGC CGCTCGTCAC CGGCGAGCGC
GTGCCGTGGC GGGAGTTCGT GCACGGCGAG CACTCCACGT GCTACCACCC GAGCCAGGAG
ATGCAGTACC TCACCGACGG CGCCTGGAAG TACGTGTGGT TCCCGCGCGG GGACGGCCCC
GGCTCACCGC GCGAGCAGCT GTTCGACCTG CGCTCCGACC CGTACGAGGA GCGCGACCTC
GCGCCGCGGT CCGACCACGC CGCCGTCCTG CGGCGGTGGC GAGCACGCCT GGTCGACGTC
CTCGCCCCTC GGGACGCCGG CCTGACCGAC GGCGGGGCGC TCGTCCCGCA GGACGGGCGG
CCACCGCTCG TCTCGCCTCA CGCCGCGTCG CGCGTCGCGG AGCGGCTCGC GTGA
 
Protein sequence
MSDRPNVLLV MTDQQRWDTL GSAGGPVETA NLDHLAAQGT TFTHAYSATP SCTPARASLL 
TGQDPWHTGI LGMGAGQPPM AGLENTLPEA LADAGYHTQG VGKMHFSPQR ALHGFHATTI
DESLRVEEPG FTSDYTQWFE RHAPADVRQA DHGLDFNSWL ARPFHTGEHL HPSTWTVTES
IRFLERRDPT RPFFLMTSFA RPHSPYDPPA FYYEHYLRRH HTGDLPPAVV GDWASVHDVG
GAEGMDPNAW RGRRTADEIG RARAGYYGSI HHIDHQIGRL MRYLRDRRLD AETLVVFTAD
HGDMLGDHHL WRKTYAYEGS AHVPLVVRLP AGMRSAGDAE VVDDPVCLQD VMPTILDACG
VDVPASVDGA STLPLVTGER VPWREFVHGE HSTCYHPSQE MQYLTDGAWK YVWFPRGDGP
GSPREQLFDL RSDPYEERDL APRSDHAAVL RRWRARLVDV LAPRDAGLTD GGALVPQDGR
PPLVSPHAAS RVAERLA