Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcav_1300 |
Symbol | |
ID | 7858763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beutenbergia cavernae DSM 12333 |
Kingdom | Bacteria |
Replicon accession | NC_012669 |
Strand | + |
Start bp | 1477110 |
End bp | 1478570 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643865384 |
Product | arylsulfatase |
Protein accession | YP_002881321 |
Protein GI | 229819795 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAGC CCGAGCGGCC GAACGTCCTG CTCATCATGG CCGACCAGTG GCGGGGCGAC TGCCTGGGAT CGGCCGGGCA CCCGACGGTG CGGACGCCGT TCCTCGACCG CCTCGCCTCC CAGGGGACGA GGTACGCGAA GGCGTACTCG GCCACGCCGA CGTGCACGCC CGCACGCGCG TCGCTGATGA CGGGGCTTCG GCCGGCGACC CATGGCCGGG TCGGGTACGC CGACCACGTC GCGTGGGACT ACCCGACGAC GCTCGCCGGG GAGTTCACCC GCCACGGGTA CCAGACGCAG GCCGTGGGCA AGATGCACGT GTCGCCGGAG CGTGCGCAGC TCGGCTTCCA GAACGTCGTG CTGCACAGCC CGCTGGGGAT CGTCCGGTCC GCGCGGGAGC GGGGTCAGGA CCCGGACCTG GTCGACGACT ACCTGCCGTG GCTCCGGCTC CGGCTGGGTC GCGACGCCAC GTTCTTCGAC CACGGCATCG ACAGCAACTC CTGGGTCGCC CGGCCGTGGG ACAAGCCCGA GCACCTCCAC CCCACGAACT TCGTCGCCTC GGAGTCCGCG GACTTCCTCC GGCGCCGCGA CCCGACGAAG CCGTTCCTGC TGTTCGCCTC GTTCAACGCC CCGCACCCCC CGTTCGACCC ACCGGCCTGG GCGTTCGAGC AGTACCTGGA GACCGACATG CCGGACCCGC CGGTCGGCGA CTGGGCCGAG GCGTTCGACC CCTGGGCGAA CAGCGCCGAC CCGACGGCCC TGGTGGGCAC GATCCCGCCC GACCTGCTGC AGCGCGCGCG GGCGGGCTAC TACGGGCACA TGACCCACGT GGACCAGCAG ATCAACTTCC TCCTCGAGGA GCTCTCCCAC CGCGGTCTCC GGGACAACAC GCTCGTCTGC TTCCTCGCGG ACCACGGCGA GATGCTGGGC GACCACCACC TGTTCCGCAA GGGCTTCCCC TACGAGGGAT CCGCGCGGAT CCCGATGATC CTGAGCGGTC CCGGTGTCCC GGCCGGGCAG GTGCGCGACG ACGTCGTCGA GCTGGGGGAT GTCATGCCGA CGCTGCTCGA CGCCGCGGGC CTTCCGGTCC CTGACGTGGT CCAGGGGCGG AGCTTCCTCC CGGCCACGCT CGACCGAGCG GAGCCTCGCG CCTGGCTGCA CGGGGAGCAC ACACTCCTCG GCCAGTCGTT CCAGTGGCTC ACGGACGGGC ACGAGAAGTA CGTGTGGTGG AGCGGGACCG GTCGGGAGCA GCTCTTCGAC CTCGACTCCG ACCCCACCGA GAGCCATGAC CTCGCACCAT CGCCTGGCGC GAGCGCGCGA CTGGAGCGTT GGCGTGAGGT GCTCGTCGGC GAGCTCCGCG GGCGCGAGGA GGGGTTCACC GAAGGTGGCC GTCTCGTGGT CGGCCGGCCC GTGCGACCGA CCCTGCGACA CCCGATCTCG GGCGGCGCCG AACCGGGCTG A
|
Protein sequence | MTEPERPNVL LIMADQWRGD CLGSAGHPTV RTPFLDRLAS QGTRYAKAYS ATPTCTPARA SLMTGLRPAT HGRVGYADHV AWDYPTTLAG EFTRHGYQTQ AVGKMHVSPE RAQLGFQNVV LHSPLGIVRS ARERGQDPDL VDDYLPWLRL RLGRDATFFD HGIDSNSWVA RPWDKPEHLH PTNFVASESA DFLRRRDPTK PFLLFASFNA PHPPFDPPAW AFEQYLETDM PDPPVGDWAE AFDPWANSAD PTALVGTIPP DLLQRARAGY YGHMTHVDQQ INFLLEELSH RGLRDNTLVC FLADHGEMLG DHHLFRKGFP YEGSARIPMI LSGPGVPAGQ VRDDVVELGD VMPTLLDAAG LPVPDVVQGR SFLPATLDRA EPRAWLHGEH TLLGQSFQWL TDGHEKYVWW SGTGREQLFD LDSDPTESHD LAPSPGASAR LERWREVLVG ELRGREEGFT EGGRLVVGRP VRPTLRHPIS GGAEPG
|
| |