Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcav_4108 |
Symbol | |
ID | 7861494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beutenbergia cavernae DSM 12333 |
Kingdom | Bacteria |
Replicon accession | NC_012669 |
Strand | - |
Start bp | 4537666 |
End bp | 4539936 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643868211 |
Product | sulfatase |
Protein accession | YP_002884111 |
Protein GI | 229822585 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGTCCG AGGCCGCCTG GCCGCGGATC CCGACGGCGC CCGAGGGCGC CCCGAACATC GTCGTCATCC TGCTCGACGA CGTCGGCTTC GGTCAGACGT CGACGTTCGG CGGGCTGATC CCGACCCCGA ACCTGGACAA GCTCGCGAGC GAGGGGCTGC GCTACAACCG CTTCCACACC ACGGCGATCT GCGGCCCGTC ACGGGCCGCG CTGCTGACCG GGCGCAACCA CCACGACACC GGCAACGGGT TCCTCATGGA GTGGGCCACG GGGTTCCCGA GCTACACGAC CATGATCCCG AGGACGACCG CGACGGTGGC CGAGGTCCTC AAGGACAACG GCTACTCGAC CTGGTGGTTC GGGAAGAACC ACAACACGCC TGACTGGGAG ACGAGCGTCG CCGGACCGTT CGACCGGTGG CCGACGGGGA TGGGTTTCGA GTACTTCTAC GGCTTCAACG CGGGCGAGAC CCACCAGTAC TACCCGGTGC TGTTCGAGAA CACGACTCCG GTGGAACCGG ACAAGGGGCC CGACGAGGGC TACCACTTCA TGACCGACAT GACCGACCGG GCGATCTCAC GCATGCGGTA CGCGACGTCG GTCGCGCCGG ACAAGCCGTT CTTCATGTAC TTCGCACCGG GTGCGATGCA CGCGCCGCAC CACGTCACGA AGGAGTGGCG CGACCGCTTC ACGGGCGCGT TCGACATGGG GTGGGAGAAG TACCGCGACG TGGTGTTCGC GAACCAGAAG AGGATGGGCA TCGTCCCGCC CGACGCCGAG CTGACTCCGC GTCCCGACTG GGTGGCGGAG TGGGACTCGC TCAGCGAGCA GCAGAAGAGG GTCTACTGCG CGCTGTTCGA GAACTATGCC GGGTACTTCG CGTTCACGGA CCACGAGGTG GGGCGGCTCC TCGACGCGAT CAAGGAGCTG CCCGACGCCG AGAACACCCT CGTGCTCTAC ATCGTGGGCG ACAACGGGGC GTCGTCGGAG GGCGGTCCCG ATGGCACCCT CGACGAGATC AAGAACCTGA GCGGGATCCT GCCGTCGATC GAGGAGATCC TGGCGGACCT CGACAAGCTC GGCGGACCCG AGACGGAGCC GCACTACCCG CTCGGCTGGG CGTGGGCGGG CAACACCCCG TTCCAGTGGG TGAAGCAGGT CGCCTCGCAC CTCGGCGGCT CGCGCAACCC GATGGTGGTG AGCTGGCCCG CTCGGGTGTC CCACGACCCG GTGCCCCGCG ACCCGTTCCT GCACCTCGTC GACGTCGCGC CGACGCTGTA CGAGGCCGCC GGCGTCACGA TGCCGGACAC GGTGAACGGG ATCGAGCAGA TGCCCCTCGC GGGCCGGTCG TTCCTCCCGA GCCTGACCGA CCCGGGATTC GAGGGCCGCG GCGAGCAGTA CTTCGAGATC CTCAGCAACC GATCGATCTA CTCCGACGGC TGGAAGGCCA ACGCTCAGCA CACGCTGCCG TGGCGGCAGG ACATCGCACC CGGTAATTGG GACCAGGACC GCTGGGAGCT GTACCACCTG GAGCAGGACT TCTCGGAGGC GAAGGACCTG GCCGAGGCGA TGCCCGAGAA GCTCGAGGAG ATGAAGCACA GGTTCGACGA GGCCGCGGAG AAGTACCACG TCTACCCGCT GGACGATCGC GGAGTCGCTC GCGCGCTGAT CCCGAAGCCG ACGGCGCCCG GGTCGGACCC TGAGGCGCTC GACTTCACGT TCTACGCCGG CGCCACGCGC CTGCCCGAGA CGGCGGCGCC GTCGATGAAG AACCGCAGCT GGAAGCTCGC CGCACGTGTC ACGATGGAGG GCGCGGCGAC GAACGGCGTC ATCATGGCGA TCGGCGGCGT CGCGGGCGGG ATGTCGCTGT ACCTGAAGGA CGGCGTACCG ATCTTCGACT ACAACTACTA CGGCGAGCAC ACGACGGTGC GGGCGGCGCA ACCGCTGCCG GCCGGTGACG CCGTAGTCGG CGTGGAGTTC GCCTATGACG GCGGCGGCAT CGGCAAGGGG GCGGACGTCA CGCTCACCGT CGACGGCGCA CCGGTCGGAT CCGGGCGCGC CGAGAGGACG GTGTTCGCAC GGTTCGGCGT CGACACGTTC GGCATCGGCG AGGACACCGG CCAGCCGGTG ACGACCGACT ACCGTCCGCC GTTCCGGTTC ACCGGGACGA TCGACCGCGT CGACATCCAG CTGGAGCCGG TTGAGCTCGC GCCGCACGAC GCCGCGAAGG TCCACGAGAC GGAGCTCCGG GCGGTGCAGC GGCGGGAGTA G
|
Protein sequence | MESEAAWPRI PTAPEGAPNI VVILLDDVGF GQTSTFGGLI PTPNLDKLAS EGLRYNRFHT TAICGPSRAA LLTGRNHHDT GNGFLMEWAT GFPSYTTMIP RTTATVAEVL KDNGYSTWWF GKNHNTPDWE TSVAGPFDRW PTGMGFEYFY GFNAGETHQY YPVLFENTTP VEPDKGPDEG YHFMTDMTDR AISRMRYATS VAPDKPFFMY FAPGAMHAPH HVTKEWRDRF TGAFDMGWEK YRDVVFANQK RMGIVPPDAE LTPRPDWVAE WDSLSEQQKR VYCALFENYA GYFAFTDHEV GRLLDAIKEL PDAENTLVLY IVGDNGASSE GGPDGTLDEI KNLSGILPSI EEILADLDKL GGPETEPHYP LGWAWAGNTP FQWVKQVASH LGGSRNPMVV SWPARVSHDP VPRDPFLHLV DVAPTLYEAA GVTMPDTVNG IEQMPLAGRS FLPSLTDPGF EGRGEQYFEI LSNRSIYSDG WKANAQHTLP WRQDIAPGNW DQDRWELYHL EQDFSEAKDL AEAMPEKLEE MKHRFDEAAE KYHVYPLDDR GVARALIPKP TAPGSDPEAL DFTFYAGATR LPETAAPSMK NRSWKLAARV TMEGAATNGV IMAIGGVAGG MSLYLKDGVP IFDYNYYGEH TTVRAAQPLP AGDAVVGVEF AYDGGGIGKG ADVTLTVDGA PVGSGRAERT VFARFGVDTF GIGEDTGQPV TTDYRPPFRF TGTIDRVDIQ LEPVELAPHD AAKVHETELR AVQRRE
|
| |