Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcav_0733 |
Symbol | |
ID | 7861831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beutenbergia cavernae DSM 12333 |
Kingdom | Bacteria |
Replicon accession | NC_012669 |
Strand | + |
Start bp | 808861 |
End bp | 811194 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643864813 |
Product | sulfatase |
Protein accession | YP_002880756 |
Protein GI | 229819230 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.417874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTCAGG TGTCCGAGGA GTTCTCTGGA GTCATCAAGC TCGACGTGCG GGACTCCGTC CCCGACTGGT CGCCGTACGA GCTGAAGCGC GCCCCGGAGG GCGCGCCGAA CGTGCTCGTC ATCCTGTACG ACGACACCGG TATGGCCTCG TGGTCGCCGT ACGGCGGCCG CATCAGCATG CCGACGCTCG ACCGTCTCGC CGCGAACGGG TTGACGTACA CGCAGTGGCA CACGACGGCG CTGTGCTCGC CGACGCGGTC GACGTTCCTG ACCGGACGCA ACCACAACGC CAACGGCATG GGCTCGATCA TGGAGACCAC GAACGGCTTC CCGGGATATG CGGGCCGCAT CCCCGAGGAG TGCGCCACGG TCGGCCAGGT CCTGCAGCAG AACGGCTACT CGACGTTCTG GGTGGGCAAG AACCACAACG TGCCGGAGGA GGACGTCTCG TCCGGCGGGA GCAAGTCGCA GTGGCCGCTG GCCATGGGCT TCGACCGGTT CTACGGCTTC CTCGGCGGCG AGACGAACAA CTGGTACCCC GACCTCGTCG AGGACAACCG GTTCGTGGAG CCGCCGTACA CGCCGGACGA GGGCTACCAC CTCTCGAAGG ATCTGGCCGA CCAGGCGATC CGGATGATCC GTGACCAGAA CTCCTCGAAC CCGTCGAAGC CCTGGTACAC GTGGTTCTGC CCGGGCGCGA ACCACGCACC GCACCATGCG CCGCAGGAGT ACATCGAGAA GTACAGGGGC GCGTTCGACG ACGGCTACGA CGCCTACCGC ACGTGGGTGC TCGACCGGAT GGTCGAACGC GGCGTCCTGC CATCGGGCAC GGCGCTCACG CCGTTCAACC CGATGCCGGA GGACCAGGCC AACCCGGCCG ACTACGTCAA GCCGTGGGAC TCCCTCTCCG ATGACGAGAG GCGCCTGTTC TCCCACATGG CCGAGGTGTT CGCCGGTTTC AGCGAGTACA CGGACGCGCA GGTCGGCCGC ATCGTCGACT ACCTCGAGGA GACGGGCCAG CTCGAGAACA CGCTGATCTT CTACTGCGCC GACAACGGCG CGTCCGGGGA GGGTTCCCCC GACGGCTCGG TCAACGAGAA CAAGTTCTTC AACGGCTACC CGGACGACCT CGCCGAGAAC CTCGCGAAGA TCGACGTGCT CGGCTCGCCC GAGACGTACA ACCACTACCC GACGGGGTGG GCGGCGGCGT TCTCGACGCC GTTCCAGATG TTCAAGCGCT ACTCGCAGTT CTCAGGCGGC ACCTGCGACC CGATGATCGT CCACTGGCCG GCCGGCATCA GGGCCAAGGG CGAGATCCGG CACCAGTACC ACCACTCGAC CGACATCGTC GCCACGGTCC TCGACGTCGT CGGCATCGAG ATGCCCGCCG AGTTCCGGGG CGTGACGCAG CGCCCGCTGG ACGGTGTGTC GATGAAGTAC AGCTTCGACG CCGAGCCGGA CGGTCCGACG GAGAAGACCG TTCAGTACTA CTCGATGCTC GGCACACGCG GCATCTGGAA GGACGGATGG AAGGCCTCCG CCATCCACGC ACCGCTCACC GGGCACGGCC ACTTCGACGA CGACCGCTGG GAGCTGTACC ACGTCGACGT CGATCGATCG GAATCGAAGG ACCTCGCCGC GGAGCATCCC GAGAAGCTGC AGGAGCTCAT CGCCGTCTGG TCCGAGGAGG CCGAGAGGAA CCACGTCCTG CCGCTCGACG ACCGTGCCGC GCTCGAGATC GTCACGATCG AACGACCACA GGCTGAGCCT CCCCGAACCC GATACGTCTA CTACCCGGAC ACGGCGGCCG TGCCCGAGAG CGTGGCGGTC AACGTCCGAG GCCGCTCGTT CAAGATCATC GCGGACGTGG TTCTCGACGA GGGGGCGCAG GGCGTGCTGT TCGCGCACGG GTCCCGGTTC GGAGGGCACG CGCTGTTCCT CAAGGACGAC CGGCTGCACT ACGTGTCCAA TTTCCTCGGC ATCCCGCCGG AGCAGATGTT CGCGTCGGAG CCTCTCGCTG CGGGACCGCA CACGCTCGGG ATGGAGTTCA TCCGCGAGAG CGCCGGGGAG CACGGCGAAT CCATCGGCAC GTGCACGCTC TACGTCGACG ACCAGGTCGT CGCGGAGGGC CCGATGCGGG CGCAGGTCGG AAAGTTCACG CTGTGCGGCG ACGGGCTGTG CGTCGGCTAC GACAGCGCCG ACGCCGTCAG CGGCCAGTAC ACGAACCCGT TCCCGTTCAC CGGCGGCAAG CTGCTCGGCG TCGGCATCGA CGTGAGCGAG GAGCAGTACC TCGACCTCGA GCTCGAGGCG GCGGCAGTCC TCGCGCGCGA GTAG
|
Protein sequence | MGQVSEEFSG VIKLDVRDSV PDWSPYELKR APEGAPNVLV ILYDDTGMAS WSPYGGRISM PTLDRLAANG LTYTQWHTTA LCSPTRSTFL TGRNHNANGM GSIMETTNGF PGYAGRIPEE CATVGQVLQQ NGYSTFWVGK NHNVPEEDVS SGGSKSQWPL AMGFDRFYGF LGGETNNWYP DLVEDNRFVE PPYTPDEGYH LSKDLADQAI RMIRDQNSSN PSKPWYTWFC PGANHAPHHA PQEYIEKYRG AFDDGYDAYR TWVLDRMVER GVLPSGTALT PFNPMPEDQA NPADYVKPWD SLSDDERRLF SHMAEVFAGF SEYTDAQVGR IVDYLEETGQ LENTLIFYCA DNGASGEGSP DGSVNENKFF NGYPDDLAEN LAKIDVLGSP ETYNHYPTGW AAAFSTPFQM FKRYSQFSGG TCDPMIVHWP AGIRAKGEIR HQYHHSTDIV ATVLDVVGIE MPAEFRGVTQ RPLDGVSMKY SFDAEPDGPT EKTVQYYSML GTRGIWKDGW KASAIHAPLT GHGHFDDDRW ELYHVDVDRS ESKDLAAEHP EKLQELIAVW SEEAERNHVL PLDDRAALEI VTIERPQAEP PRTRYVYYPD TAAVPESVAV NVRGRSFKII ADVVLDEGAQ GVLFAHGSRF GGHALFLKDD RLHYVSNFLG IPPEQMFASE PLAAGPHTLG MEFIRESAGE HGESIGTCTL YVDDQVVAEG PMRAQVGKFT LCGDGLCVGY DSADAVSGQY TNPFPFTGGK LLGVGIDVSE EQYLDLELEA AAVLARE
|
| |