Gene Bcav_3985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcav_3985 
Symbol 
ID7858048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeutenbergia cavernae DSM 12333 
KingdomBacteria 
Replicon accessionNC_012669 
Strand
Start bp4407778 
End bp4409214 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content72% 
IMG OID643868088 
Productsulfatase 
Protein accessionYP_002883988 
Protein GI229822462 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGT CGAACGAGCA GGTGCGCGTA CCCGAGCCGG ACCGCCCCAA CATCGTGCTG 
GTGGTCGTGG ACGACCTGGG GTGGCGGGAC CTCGGCTGCT TCGGCTCGAC GTTCTACGAG
ACGCCGCACA TCGACGCGCT GGCCGCGAGC GGTACGCGGT TCACCCACTC CTACGCGGCG
GCTCCCGTCT GCTCGCCCAC GCGCGCGAGT CTCCTCACCG GGAAGTACCC GGCCCGCGTC
GGCGTGACGA ACTGGATCGG CGGGCACGCC ATCGGAGCGC TCAGGGACGT CCCGTACTTC
CACGGGCTCC CCCAGGACGA GTACGCACTG GCACGAGCTC TCAGGGCGGG TGGCTACCGC
ACCTGGCACG TCGGCAAGTG GCATCTCGGC GGCGGCCGCC ACCTGCCGGA GCACCACGGG
TTCGACCTCA ACGTCGGCGG CTCGGCGAGC GGCTCCCCGG TCAGCTACTA CGCGCCCTAC
GGTATCGGCG CGCTCGAGGA CGCACCCGAC GGCGAGTTCC TCACGGACCG CCTGACGGAC
GTGGCCGTCG ACCTCGTGCG GAGCAGCGAC GACGCCCCGT TCCTCCTCAA CCTGTGGCAC
TACGCGGTGC ACACGCCGAT CGAGGCGCCC GCGCACCTCG TCGAGAAGTA CCGGCACAAG
GCCGAGACCC TCGGCCTGCC CACCCACGGG CCTGACGCCG TCGAGGCGGG CGAGCACATG
CCCGCCCGGC ACCTGCGGTC CGAACGGGTG CGCCGTCGTC GCATCCAGTC GGACCCCACG
TACGCGGCGA TGCTGGAGAC GCTCGACGGC GCCGTCGGCC GCCTCGTGAC CGCACTCCGG
GACGTCGGGA AGCTCGACGA CACACTGATC GTCTTCACGT CGGACAACGG CGGCCTCTCC
ACGGCTGAGG GCTCACCCAC GTGCAACGCG CCCCTCAGCG AGGGCAAGGG CTGGATGGCC
GACGGCGGAA CCCGCGTACC GACGATCGTC TCCTGGCCCG GCCGGGTCCC CGCCGGCGCA
CGGTCCGACC TGCCCTTCAC GTCACCGGAC TTCTACCCGA CGCTGCTTGC TGCTGCCGGG
CTCACCCAGC TACCCGAGCA GCACGTCGAC GGCGTGAACC TGTGGCCGGC GTGGCAGGGC
GCACCACTGG ACCGCGGCCC GATCTTCTGG CACTACCCGC ACTACTCGAA CCAGGGCGGC
GCACCGTCGG CAGCCGTCCG GGACGGCCGC TGGAAGCTCG TGCGCCACTT CGGGATCGAG
CACGACGAGC TGTTCGACGT CGTCGCCGAC GTGTCGGAAA GCCACGACGT CAGCGGCCGG
CGCCGCGACG TCGTCGCCCG GCTGTCGGTG ACGCTCGACT CCTGGCTCGC CGACGTCGGT
GCGCTGATCC CGCGTCGTAC GACGCCGCCG CCCGACACGT TCGACAGACC GCAGTAG
 
Protein sequence
MTASNEQVRV PEPDRPNIVL VVVDDLGWRD LGCFGSTFYE TPHIDALAAS GTRFTHSYAA 
APVCSPTRAS LLTGKYPARV GVTNWIGGHA IGALRDVPYF HGLPQDEYAL ARALRAGGYR
TWHVGKWHLG GGRHLPEHHG FDLNVGGSAS GSPVSYYAPY GIGALEDAPD GEFLTDRLTD
VAVDLVRSSD DAPFLLNLWH YAVHTPIEAP AHLVEKYRHK AETLGLPTHG PDAVEAGEHM
PARHLRSERV RRRRIQSDPT YAAMLETLDG AVGRLVTALR DVGKLDDTLI VFTSDNGGLS
TAEGSPTCNA PLSEGKGWMA DGGTRVPTIV SWPGRVPAGA RSDLPFTSPD FYPTLLAAAG
LTQLPEQHVD GVNLWPAWQG APLDRGPIFW HYPHYSNQGG APSAAVRDGR WKLVRHFGIE
HDELFDVVAD VSESHDVSGR RRDVVARLSV TLDSWLADVG ALIPRRTTPP PDTFDRPQ