Gene Bcav_2095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcav_2095 
Symbol 
ID7859375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeutenbergia cavernae DSM 12333 
KingdomBacteria 
Replicon accessionNC_012669 
Strand
Start bp2369550 
End bp2371088 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content67% 
IMG OID643866187 
Productsulfatase 
Protein accessionYP_002882110 
Protein GI229820584 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0111977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACG GGAAGCCGAA CATCCTGGTG ATCTGGGGCG ACGACATCGG CATCACGAAC 
CTCAGCTGCT ACAGCGACGG CCTCATGGGG TACTGGACCC CGAACATCGA CCGCATCGCC
GCCGAGGGCA TGCGCTTCAC CGACTCGTAC GGGGAGCAGA GCTGTACCGC GGGCCGGTCG
TCGTTCATCA CCGGGCAGAG CGTGTTCCGC ACCGGCCTGA GCAAGGTCGG GATCCCGGGC
TCGCCCATCG GCCTGCAGGC GGAGGACCCC ACCATCGCGG AGCTGCTGAA GCCGCTCGGG
TACGCGACCG GACAGTTCGG GAAGAACCAC CTCGGCGACA AGAACGAGTT CCTCCCGACG
GCGCACGGGT TCGACGAGTT CTACGGCAAC CTTTACCACC TCAACGCCGA GGAGGAGCCC
GAGCTGCCGA ACTGGCCGTC GCCCGAGGAC TTCCCGGGGT TCAACGAGCG TGCACGCCCC
CGCGGGGTCA TCCACTCCTG GGCGACGGAC GTCGACGACC CGACCGAGGA CGGCCGCTTC
GGTCCGCGCG GCAAGCAGCG GATCGAGGAC ACCGGGGCGC TCACGAAGAA GCGGATGGAG
ACGGTCGACG AGGAGTTCGC CGCCGCCGCG CAGGACTTCA TCGCACGCCA GGTGGACGCG
GACACGCCGT TCTTCGTGTG GATGAACACG ACGCACATGC ACTTCAGGAC GCACCCGAAG
CCGGAGAGCG TGGGTCAGGC CGGACGGTGG CAGTCGCCGT ACCACGACAC GATGATCGAC
CACGACCGCG TCGTCGGGGG CCTGCTGGAC CAGCTCGACG AGCTCGGCAT CGCCGAGGAC
ACGATCGTCA TCTACTCGAC GGACAACGGG CCGCACATGA ACACGTGGCC CGACGGCGGG
ATGACGCCGT TCCGCAGCGA GAAGAACACG AACTGGGAGG GTGCGTTCCG GGTGCCCGAG
ATGATCCGGT GGCCCGGGCG GATCGCGGCC GGCGTCGTGT CGAACGAGAT CGTCCAGCAC
CACGACTGGC TGCCCACCTT GCTCGCGGCG GCCGGCGACA CGGGCGTCGT CGACGACCTC
AAGCAGGGGA AGACGATCGG CGACGTCACC TACAAGGTGC ACATCGACGG CTACAACCTG
CTCCCGTACC TGACCGGCGA GGCCGACGAG AGCCCGCGCA AGGGGATGGT CTACTTCTCC
GACGACGGCG ACGTGCTAGC GCTCCGGTTC GACAACTGGA AGGTCGTGTT CATGGAGCAG
CGGGTGCCCG GGACGCTCCG CGTGTGGGCC GAGCCGTTCG TACCGTTGCG GGTCCCGCTC
CTGTACAACT TGCGCACCGA CCCGTTCGAG CGGGCGACGA TCACGTCGAA CACCTACTAC
GACTGGCTGT TCGACAACGA CTATCTCGTC TTCGCGTCGC AGGTGATCAT GACGCAGTTC
CTGGCGACAT TCCGTGAGTA CCCGCCGCGC CAGCGCGCCG CGAGCTTCAG CATCGACCAG
GCGGTCGAGA AGCTCCAGTC GTTCCTCGGC AGCAACTGA
 
Protein sequence
MPNGKPNILV IWGDDIGITN LSCYSDGLMG YWTPNIDRIA AEGMRFTDSY GEQSCTAGRS 
SFITGQSVFR TGLSKVGIPG SPIGLQAEDP TIAELLKPLG YATGQFGKNH LGDKNEFLPT
AHGFDEFYGN LYHLNAEEEP ELPNWPSPED FPGFNERARP RGVIHSWATD VDDPTEDGRF
GPRGKQRIED TGALTKKRME TVDEEFAAAA QDFIARQVDA DTPFFVWMNT THMHFRTHPK
PESVGQAGRW QSPYHDTMID HDRVVGGLLD QLDELGIAED TIVIYSTDNG PHMNTWPDGG
MTPFRSEKNT NWEGAFRVPE MIRWPGRIAA GVVSNEIVQH HDWLPTLLAA AGDTGVVDDL
KQGKTIGDVT YKVHIDGYNL LPYLTGEADE SPRKGMVYFS DDGDVLALRF DNWKVVFMEQ
RVPGTLRVWA EPFVPLRVPL LYNLRTDPFE RATITSNTYY DWLFDNDYLV FASQVIMTQF
LATFREYPPR QRAASFSIDQ AVEKLQSFLG SN