Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_2930 |
Symbol | |
ID | 5059394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 3350260 |
End bp | 3352656 |
Gene Length | 2397 bp |
Protein Length | 798 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640475181 |
Product | sulfatase |
Protein accession | YP_001159746 |
Protein GI | 145595449 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0511457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAAAG AATTCAAGGG CGTTATCAAC CTGGGTGTCA CGGACTCGAC GCCGGACTGG GACCCCTACG CCCAACCCCA GGCCCCGAAA GGGGCGCCGA ACGTGCTGAT CCTGGTCTGG GACGACACGG GCTTCGGCTC CTGGGACTTC TTCGGCGGGC CCATCGAGAT GCCGAACATG AGTAAGCTCG CCAACAACGG ACTGAAGTAC ACCCAGTTCC ACACCACCGC GCTCTGCTCG CCCACTCGGG CCGCGCTGCT GAGTGGGCGT AACCACACCA CCGTCGGCAT GTCCTGCGTC GCCGAGGCGA CCCAGGGGTT CCCGGGGATG AATGGCCATA TCCCCGGGGA GGCGGCCCTT ATCGGGGAGA TCCTCAGCGA CCGGGGCTAC AACACCTACG CACTGGGCAA GTGGCACTGT GCCGGGGAAG ACGAGACCAA CATGGCCTCC TCCAAGCGGA ACTGGCCCAC CTACCGGGGC TTTGAACGCT TCTACGGCTT TCTCGGCGGC GAGGCCAACC AGTACTACCC GAATCTCGTG CAGGACCAGC AGTTCGTCGA CCAGCCCGCC GATCCGGTCA GCATTGACGA GTGGAAGGAG GGAAAGGACG GATATCTGCT CACCGCCGAC CTCGTCGACC GGGCGATCGG CATGATCGGC GACGCCAAGC AGACGGCGCC GGACCGACCG TTCTTCATGT ACTTCTGCCC CGGGGCGAAT CATGCCCCAC ACAGCGTGCC GAAGGCCTGG GCCGACAAGT ACAAGGGCAA GTTCGACATG GGGTACGAGG CGATCCGCGA GAAGATCCTC GCGAAGCAGA TCAAGATGGG TATCGTCCCG AAGGGTACCG AACTATCACC GGTCAACCCG TTCAGCGACG TACGCAGCGC GGACGGCAAA CCCTTTCCCT CCACGAGCGA GGTACGGCCG TGGGATTCGC TCAGCGACGA CGAGAAGAAG CTGCAGACGC GGATGGCGGA GGTCTTCGCC GGCTTCTCCA GCCACGCCGA CCACGAGATC GGTCGGCTGA TCTCCTACCT TGAGGAAACC GGAGAGCTGG ACAACACCCT CATCATCGTG ATCTCCGACA ACGGTGCGTC GGGTGAGGGT GGGCCGGATG GCGCCGTCAA TGAGAACACG TTCTTCAATG GTGTCCCGAG CAATGTCGAC GAGAATATCA AGATGATCGA TATTCTCGGC TCTCCCGGCA CGTACAACCA CTATTCGACC GGTTGGGCCT TTGCCTTCAA CACCCCGTTC AAGCTGTTCA AGCAGGACGT GTGGGAGGGG GGAATCTGCG ACCCGATGAT CGTGCACTGG CCGGCGGGGA TCAAGGCAAA AGGCGAGCTA CGCGACCAGT ACACACACGT CACCGACATC GTACCCACGG TGTATGAATG CCTCGGCATC GAGCTACCGG AGACGGTCAA GGGCTTCAAA CAGTGGCCGC TGGAGGGCAC CAGCTTCAAA CACACCTTCG AGAAGGCCAA GGCGAAGACG GCCAAGCGCA GCCAGTTCTA CCAGATGCTG GGGACGCGGG CGCTGTGGCG GGACGGCTGG AAGGTGGACG CCCTGCACCC GAGCTCGCCA TCGGACTGGG GTCACTTTGG CCTGGACAAG TGGGCGCTCT ACCACACCGA CGTCGACCGT GCCGAGATCC ACAACGTGGC CGACCAGCAC CCCGATCTGG CCGCGGAGCT GGTGGCGCTC TGGTACTACC AGGCCGGTAC GTTCTCTGGT CTACCGATGG AGGACCGGCC TATCGCCGAG TTGCTCAGCA CGCCGCGACC GCAGGTGGCG CCGCCGCGAG ACCACTACGT CTACTATCCG AACACGCTGG AGGTTCCCGA GGCGGTCGCG GTCAACATTC GGGGGCGTTC CTACATCCTC GCGGCCGATG TCGTCATCGA CGGTCCCGAC GCTGAAGGTG TCCTCTTCGC GCAGGGCTCC AACTTCGGCG GTCACGCTCT CTACCTCAAA GACGGCAAAC TGAAGTACGT CTACAACTAC CTGGGGGAGA AGGAGCAGGT CATTGCCTCG AACATCGACG TGCCGAAGGG GAAGGTGGTG CTCGGCATCG CGTTCGAGAA GGAGAAGTTG ATAACTCCGC CGAACTCCGA TCAGCCCAGT GCCTGCATCG GTAACGCCTC GCTGTTCATC GGCAAGAAGA AGGTGGGTGA GTGCAAGGGG ATGCAGACCC AACTCGGCAA CTTCGCGCTC GCCGGGGAAG GCTTCAACGT CGGCCAGGAC CGAGGAGCGC CGGTCACCTA CGACTACTCC GGTGCTCGTC CCTGGAAGCT GACCGGCGCC ACGATCAAAC AGGTCATCGC GGATGTTTCT GGTGAGGCCT ACGTCGACGT CGAGCGGGAG GCGGCCGCCA TGATGGCCCG GGACTGA
|
Protein sequence | MSKEFKGVIN LGVTDSTPDW DPYAQPQAPK GAPNVLILVW DDTGFGSWDF FGGPIEMPNM SKLANNGLKY TQFHTTALCS PTRAALLSGR NHTTVGMSCV AEATQGFPGM NGHIPGEAAL IGEILSDRGY NTYALGKWHC AGEDETNMAS SKRNWPTYRG FERFYGFLGG EANQYYPNLV QDQQFVDQPA DPVSIDEWKE GKDGYLLTAD LVDRAIGMIG DAKQTAPDRP FFMYFCPGAN HAPHSVPKAW ADKYKGKFDM GYEAIREKIL AKQIKMGIVP KGTELSPVNP FSDVRSADGK PFPSTSEVRP WDSLSDDEKK LQTRMAEVFA GFSSHADHEI GRLISYLEET GELDNTLIIV ISDNGASGEG GPDGAVNENT FFNGVPSNVD ENIKMIDILG SPGTYNHYST GWAFAFNTPF KLFKQDVWEG GICDPMIVHW PAGIKAKGEL RDQYTHVTDI VPTVYECLGI ELPETVKGFK QWPLEGTSFK HTFEKAKAKT AKRSQFYQML GTRALWRDGW KVDALHPSSP SDWGHFGLDK WALYHTDVDR AEIHNVADQH PDLAAELVAL WYYQAGTFSG LPMEDRPIAE LLSTPRPQVA PPRDHYVYYP NTLEVPEAVA VNIRGRSYIL AADVVIDGPD AEGVLFAQGS NFGGHALYLK DGKLKYVYNY LGEKEQVIAS NIDVPKGKVV LGIAFEKEKL ITPPNSDQPS ACIGNASLFI GKKKVGECKG MQTQLGNFAL AGEGFNVGQD RGAPVTYDYS GARPWKLTGA TIKQVIADVS GEAYVDVERE AAAMMARD
|
| |