Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcav_0177 |
Symbol | |
ID | 7862049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beutenbergia cavernae DSM 12333 |
Kingdom | Bacteria |
Replicon accession | NC_012669 |
Strand | + |
Start bp | 184128 |
End bp | 185645 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643864254 |
Product | sulfatase |
Protein accession | YP_002880204 |
Protein GI | 229818678 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCGC ACCACCACGA GAGGCCGGTC GCTTTGACCA ACATCCTGTT CTTCCTCACC GACCAGCACC GCAAGGACAC CCTCGGTGCG TACGGCAACG CGACTGTGCG CACGCCGAAC CTGGACGCGC TCGCCGCCGA CGGCACGACG TTCGACCGGT TCTACACGCC GACCGCCATC TGCACCCCGG CCCGCGCGAG CCTCCTCACC GGCGCGGCGC CGTTCCGCCA CAAGCTGCTC GCGAACTACG AGCGGAACGT CGGGTACCAG GAGGAGCTCT CCGAGGGCCA GTTCACGTTC AGCGAGGACC TGGCGGAGGC CGGCTACCAC CTCGGGCTCG TCGGAAAGTG GCACGTCGGC ACGCACCGCA CCGCCGGCGA TCTCGGGTTC GACGGCCCGC ACCTGCCGGG CTGGCACAAC CCGGTCGACC ACGCCGACTA CCTCGCCTAC CTCGAGGAGA ACGACCTCCC GCCGTATCGC ATCAGCGACG AGGTCCGCGG CACGTTCCCG AACGGCGCCC CGGGCAACCT GCTGGCCGCG CGGCTGCACC AGCCGCTGGA GGCCACGTTC GAGTACTTCC TCGCCGAGCG CGCGATCGAC CTGCTCCGCA CCTACGCGCG AGACCACCGC ACGAGTGGCC GCCCGTTCTT CCTCGCGACG CACTTCTTCG GCCCGCACCT GCCGTACATC CTCCCGAGCG AGTACCTCGA CATGTACGAC GCCGACGACG TCGAGCTGCC GCTCTCGGTC GCGGAGACGT TCGCCGGCAA GCCGCCCGTC CAGGGCAACT ACTCCGCGCA CTGGACGTTC GACACGCTCG GCGACGAGAC CTCCCGCAAG CTGATCGCCG CGTACTGGGG CTACGTCACG CTCGTCGACT CGCAGGTCGG CCGGATCCTC GACGCCGCCC GCGAGCTCGG CGTGTACGAC GACGCGGCAG TGTTCTTCTC CGCCGACCAC GGCGAGTTCA CCGGCGCGCA CCGACTGCAC GACAAGGGTC CGGCGATGTA CGAGGACATC TACACCATCC CCGGCATCGT CAAGCTGCCG GGCGGTGTCC CGGGCCAGCG CTCGGATCGG CTCGCGCACC TCATCGACCT GACGGCGACG ATCCTCGACG TCGCCGGCCG TGACCCGGCC CGCGCCGTCG ACGGCGTGCC CGTCACGCCG CTCGTGCGCG GCGAGGAGAC GCCGTGGCGC GAGGACCTCG TCGCGGAGTT CCACGGCCAC CACTTCCCGC ACCCGCAGCG GATGCTCGTC ACCGAGCGGT GGAAGCTCGT GGTCAACCCG GAGTCCGTCA ACGAGCTGTA CGACCTCGTC CGCGACCCCG ACGAGCTGCA GAACCGCTAC ACGCACCCGG AGACGGCGGC GGTCCGCGCC GAGCTGCTCG GCCGCCTGTA CCGCCAGCTG CGCGAGCGCG GCGACAACTT CTACCACTGG ATGACGTCGA TGTACCCGGT GGGCGAGAAG GACTACGACA CGTCCCTCAG CATGTTCGAA GGAGCGCACC GCCCATGA
|
Protein sequence | MTPHHHERPV ALTNILFFLT DQHRKDTLGA YGNATVRTPN LDALAADGTT FDRFYTPTAI CTPARASLLT GAAPFRHKLL ANYERNVGYQ EELSEGQFTF SEDLAEAGYH LGLVGKWHVG THRTAGDLGF DGPHLPGWHN PVDHADYLAY LEENDLPPYR ISDEVRGTFP NGAPGNLLAA RLHQPLEATF EYFLAERAID LLRTYARDHR TSGRPFFLAT HFFGPHLPYI LPSEYLDMYD ADDVELPLSV AETFAGKPPV QGNYSAHWTF DTLGDETSRK LIAAYWGYVT LVDSQVGRIL DAARELGVYD DAAVFFSADH GEFTGAHRLH DKGPAMYEDI YTIPGIVKLP GGVPGQRSDR LAHLIDLTAT ILDVAGRDPA RAVDGVPVTP LVRGEETPWR EDLVAEFHGH HFPHPQRMLV TERWKLVVNP ESVNELYDLV RDPDELQNRY THPETAAVRA ELLGRLYRQL RERGDNFYHW MTSMYPVGEK DYDTSLSMFE GAHRP
|
| |