Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3132 |
Symbol | |
ID | 5706342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3561001 |
End bp | 3563397 |
Gene Length | 2397 bp |
Protein Length | 798 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641272564 |
Product | sulfatase |
Protein accession | YP_001537931 |
Protein GI | 159038678 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000329231 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCAAAG CATTCAAGGG TGTCATCGGC CTGGGCGTCA AGGACTCGAC GCCGGACTGG GATCCCTACG CGCAACCACA GGCCCCGAAG GGGGCACCGA ACGTGCTGTT CCTTGTGTGG GACGACACCG GCTTTGGTTC CTGGGACTTC TTCGGCGGGC CCATCGAGAT GCCGAACATG AGCAAGCTCG CGAACAACGG GCTGAGGTAC ACCCAGTTCC ACACGACGGC GCTGTGTTCA CCCACCCGGG CCGCCCTGCT GAGCGGACGT AACCACACCA CGGTCGGAAT GTCCTGTGTC GCGGAGGCGA CCGAGGGGTT CCCCGGTTTG AACGGGCACA TTCCCGGTGA GGCAGCGCTC ATCGGCGAGA TTCTGAGCGA CCGGGGCTAC AACACCTATG CGCTCGGCAA GTGGCACTGT GTCGCCGAGG ACGAGACCAA CATGGCCTCC TCCAAGCGCA ACTGGCCCAC CAGTCGCGGC TTCGAGCGCT TCTACGGTTT CCTTGGTGGT GAGGCGAACC AGTACTACCC GAACCTCGTG CAGGACCAGC AGTTCATCGA CCAGACCGCC GACCCGGTCA GCATCGATGA GTGGAAGAAG GGCAAGGACG GTTATCTCCT CACCGCCGAC CTCGTCGATC GCGCTATCGG CATGATTTCC GATGCCAAGC AGGTCGCCCC GGACCGGCCG TTTTTCATGT ACTTCTGCCC CGGCGCCAAC CACGCCCCGC ATCACGTACC GAAAGCCTGG GCAGACAAGT ACAAGGGCAA GTTCGACATG GGCTACGAGG CCATCCGCGA GAAGATCCTG GCGAAACAGA TCAAGATGGG TATCCTTCCG AAGGGTACGG AACTCTCTCC GATCAACCCG CTCAGTGACG TGCGCAGTAC GGACGGCGAA CCGTCGCCGC CGATGAGCGA CGTACGCCCG TGGGATTCGC TCAGCGACGA CGAGAAGAAG TTGCAGACGC GCATGGCGGA GGTGTTCGCC GGCTTCTCCA GCTACGCGGA CCACGAGATC GGCCGGCTGA TCTCGTACCT GGAGGAGACC GGGCAACTGG ACAACACCCT CATCTTCGTG ATCTCGGACA ATGGAGCCTC GGGTGAGGGA GGACCAGATG GTGCGGTCAA TGAGAACACG TTCTTCAACA GTGTCCCCAG CAGTGTGGAA GAGAACCTGA AGCTGCTCGA CATCCTTGGT TCCCCAGGTA CGTACAACCA CTACTCGACG GGTTGGGCCT TTGCCTTCAA CACTCCGTTC AAGCTGTTCA AACAGGACGC GTGGGAGGGA GGCGTCTGCG ACCCGATGAT CGTACACTGG CCTGCGGGGA TCAAGGCGAA AGGCGAGATG CGCGACCAGT ACGCGCACGT CAGCGACATC GTGCCGACGG TGTACGAGTG CCTCGGCATC GACCTGCCGG AGACGGTCAA GGGCTTCACC CAGTGGCCCT TGGAAGGCAC CAGCTTCAAA CACACATTCG AGAAGCCCAA GGCGAAGACG GCGAAGCGTA GCCAGTTCTA CCAGATGCTG GGCACCCGTG CGCTGTGGCG CGACGGGTGG AAGGTGGACG CCCTGCATCC GAGTTCACCT GCCGACTGGG GGCACTTCGG GCAGGACAAG TGGGCGCTCT ACCACACCGA CGTCGACCGT GCCGAGATCC ACGACGTGGC CGACCAACAT CCCGAACTGG CCGCGGACCT GGTGGGCCTG TGGTACCACG AGGCCGGCAA GTTCTTCGGC CTGCCGATGG ACGACCGACC TATCGCAGAG ATCCTCAGTA CGTCGCGACC ACAGGTGGCG CCGCCCCGGG ACCACTACGT CTACTATCCG AACACGCTGG AGGTCCCGGA GGCGGTCGCG GTCAACATCC GCGGGCGGTC CTACATCATC GCGGCCGATG TCATCATCGA CGGCTCCGAC GCGGAAGGTG TTCTCTTCGC GCAGGGCTCC AACTTCGGCG GGCACGCACT CTACCTCAAG GACGGCAAAC TCAAGTACGT CTACAACTAC CTGGGGGAGA ACGAGCAGGT GATCACGGCG AACAGCGATG TGCCGAAGGG GAAGGTGGTG CTCGGTGTCG CGTTCGAGAA GGAGAAGTTG ACCACTCCGC CCGGTTCGGA CCGACCCAGT GCGTGCATCG GCAACGCATC GCTGTTCATC GGCAAGAAAA AGGTCGGTGA ATGTAAGGGT ATGCAAACCC AACTCGGCAA GTTCGCGCTC GCCGGGGAAG GTTTCAACGT AGGGCGGGAC CGGGGCGCAC CCGTCACCTA TGACTATTCC GGCGAGCGTC CCTGGAAGCT GACCGGGGCC ACGATCAAAC AGGTCATCGC CGACGTGTCG GGTGAGGCCT ACGTCGACGT CGAGCGGGAG GCAGCGGCCA TGATGGCGCG GGACTGA
|
Protein sequence | MSKAFKGVIG LGVKDSTPDW DPYAQPQAPK GAPNVLFLVW DDTGFGSWDF FGGPIEMPNM SKLANNGLRY TQFHTTALCS PTRAALLSGR NHTTVGMSCV AEATEGFPGL NGHIPGEAAL IGEILSDRGY NTYALGKWHC VAEDETNMAS SKRNWPTSRG FERFYGFLGG EANQYYPNLV QDQQFIDQTA DPVSIDEWKK GKDGYLLTAD LVDRAIGMIS DAKQVAPDRP FFMYFCPGAN HAPHHVPKAW ADKYKGKFDM GYEAIREKIL AKQIKMGILP KGTELSPINP LSDVRSTDGE PSPPMSDVRP WDSLSDDEKK LQTRMAEVFA GFSSYADHEI GRLISYLEET GQLDNTLIFV ISDNGASGEG GPDGAVNENT FFNSVPSSVE ENLKLLDILG SPGTYNHYST GWAFAFNTPF KLFKQDAWEG GVCDPMIVHW PAGIKAKGEM RDQYAHVSDI VPTVYECLGI DLPETVKGFT QWPLEGTSFK HTFEKPKAKT AKRSQFYQML GTRALWRDGW KVDALHPSSP ADWGHFGQDK WALYHTDVDR AEIHDVADQH PELAADLVGL WYHEAGKFFG LPMDDRPIAE ILSTSRPQVA PPRDHYVYYP NTLEVPEAVA VNIRGRSYII AADVIIDGSD AEGVLFAQGS NFGGHALYLK DGKLKYVYNY LGENEQVITA NSDVPKGKVV LGVAFEKEKL TTPPGSDRPS ACIGNASLFI GKKKVGECKG MQTQLGKFAL AGEGFNVGRD RGAPVTYDYS GERPWKLTGA TIKQVIADVS GEAYVDVERE AAAMMARD
|
| |