Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcav_3985 |
Symbol | |
ID | 7858048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beutenbergia cavernae DSM 12333 |
Kingdom | Bacteria |
Replicon accession | NC_012669 |
Strand | + |
Start bp | 4407778 |
End bp | 4409214 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643868088 |
Product | sulfatase |
Protein accession | YP_002883988 |
Protein GI | 229822462 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.135566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGT CGAACGAGCA GGTGCGCGTA CCCGAGCCGG ACCGCCCCAA CATCGTGCTG GTGGTCGTGG ACGACCTGGG GTGGCGGGAC CTCGGCTGCT TCGGCTCGAC GTTCTACGAG ACGCCGCACA TCGACGCGCT GGCCGCGAGC GGTACGCGGT TCACCCACTC CTACGCGGCG GCTCCCGTCT GCTCGCCCAC GCGCGCGAGT CTCCTCACCG GGAAGTACCC GGCCCGCGTC GGCGTGACGA ACTGGATCGG CGGGCACGCC ATCGGAGCGC TCAGGGACGT CCCGTACTTC CACGGGCTCC CCCAGGACGA GTACGCACTG GCACGAGCTC TCAGGGCGGG TGGCTACCGC ACCTGGCACG TCGGCAAGTG GCATCTCGGC GGCGGCCGCC ACCTGCCGGA GCACCACGGG TTCGACCTCA ACGTCGGCGG CTCGGCGAGC GGCTCCCCGG TCAGCTACTA CGCGCCCTAC GGTATCGGCG CGCTCGAGGA CGCACCCGAC GGCGAGTTCC TCACGGACCG CCTGACGGAC GTGGCCGTCG ACCTCGTGCG GAGCAGCGAC GACGCCCCGT TCCTCCTCAA CCTGTGGCAC TACGCGGTGC ACACGCCGAT CGAGGCGCCC GCGCACCTCG TCGAGAAGTA CCGGCACAAG GCCGAGACCC TCGGCCTGCC CACCCACGGG CCTGACGCCG TCGAGGCGGG CGAGCACATG CCCGCCCGGC ACCTGCGGTC CGAACGGGTG CGCCGTCGTC GCATCCAGTC GGACCCCACG TACGCGGCGA TGCTGGAGAC GCTCGACGGC GCCGTCGGCC GCCTCGTGAC CGCACTCCGG GACGTCGGGA AGCTCGACGA CACACTGATC GTCTTCACGT CGGACAACGG CGGCCTCTCC ACGGCTGAGG GCTCACCCAC GTGCAACGCG CCCCTCAGCG AGGGCAAGGG CTGGATGGCC GACGGCGGAA CCCGCGTACC GACGATCGTC TCCTGGCCCG GCCGGGTCCC CGCCGGCGCA CGGTCCGACC TGCCCTTCAC GTCACCGGAC TTCTACCCGA CGCTGCTTGC TGCTGCCGGG CTCACCCAGC TACCCGAGCA GCACGTCGAC GGCGTGAACC TGTGGCCGGC GTGGCAGGGC GCACCACTGG ACCGCGGCCC GATCTTCTGG CACTACCCGC ACTACTCGAA CCAGGGCGGC GCACCGTCGG CAGCCGTCCG GGACGGCCGC TGGAAGCTCG TGCGCCACTT CGGGATCGAG CACGACGAGC TGTTCGACGT CGTCGCCGAC GTGTCGGAAA GCCACGACGT CAGCGGCCGG CGCCGCGACG TCGTCGCCCG GCTGTCGGTG ACGCTCGACT CCTGGCTCGC CGACGTCGGT GCGCTGATCC CGCGTCGTAC GACGCCGCCG CCCGACACGT TCGACAGACC GCAGTAG
|
Protein sequence | MTASNEQVRV PEPDRPNIVL VVVDDLGWRD LGCFGSTFYE TPHIDALAAS GTRFTHSYAA APVCSPTRAS LLTGKYPARV GVTNWIGGHA IGALRDVPYF HGLPQDEYAL ARALRAGGYR TWHVGKWHLG GGRHLPEHHG FDLNVGGSAS GSPVSYYAPY GIGALEDAPD GEFLTDRLTD VAVDLVRSSD DAPFLLNLWH YAVHTPIEAP AHLVEKYRHK AETLGLPTHG PDAVEAGEHM PARHLRSERV RRRRIQSDPT YAAMLETLDG AVGRLVTALR DVGKLDDTLI VFTSDNGGLS TAEGSPTCNA PLSEGKGWMA DGGTRVPTIV SWPGRVPAGA RSDLPFTSPD FYPTLLAAAG LTQLPEQHVD GVNLWPAWQG APLDRGPIFW HYPHYSNQGG APSAAVRDGR WKLVRHFGIE HDELFDVVAD VSESHDVSGR RRDVVARLSV TLDSWLADVG ALIPRRTTPP PDTFDRPQ
|
| |