Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_1580 |
Symbol | |
ID | 8882769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 1656122 |
End bp | 1657501 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003510376 |
Protein GI | 291299098 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.37197 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGCTC GGCCGAACGT CATCGTGGTC TTCACCGACC AGCAGCGCTG GGACACCACG GGGGCGCAGG GAAATCCGCT GGGACTGACG CCGAACTTCG ACCGAATGGC CGACACCGGC ACCCACGCGC GGCTGGCGGT GACCCCCAAT CCGGTGTGCG GCCCGGCGCG GGCGGCGTTG CAGACCGGCC GGTATCCGAC GGCCAACGGG TGCTACCGCA ACGCGATTCC GCTGCCGGAG TCGGAGCGCA CGCTGGCGCA CCACTTCGCC GACGCCGGGT ACGACACCGG TTACGTGGGC AAATGGCACC TGGCCGACGC CGATCCGGTG CCGGAGTCGC AGCGCGGCGG CTACGGGGAA TGGCTGGCGG CCAACACGCT CGAGTTCACC TCCGACGCGT ATCGGACCAT TGTGTACGGG CAGGACGGCG AGCCGGTGCT GCTGCCGGGA TATCGCTCCG ACGCGTGTTT CGACGCCGCG ATCCGGTTCG TCACCGACCA CCACGATCGT CCGTTCTACC TTTTCTTGTC GCTGTTGGAG CCGCACCACC AAAACGAGGT GGACGACTAT CCGGCGCCCG ACGGGTACGA GCAGCGGTAC CAGGGGCGTT GGATGCCACC GGATCTAGCG GCGCTGTCGG CGAACGGCGG CACCGCGCAC CGGCACATGG GCGGGTATCT GGGTCAGATC GCCCGTGTCG ACGAGGGTCT GGGGCGGTTG CACGACGCGT TGCGCAGTCT GGGGCTGGCC GAGGACACGA TCGTCGCGTA CACCTCGGAT CACGGTTGTC ACTTCAAGAC CCGCAACTCC GAGTACAAGC GGTCGGCGCA CGACGCCTCG ATCCGGGTGC CGCTGGCGAT CTCGGGGCCG GGTTTTACCG GCGGGAGCCG CATCGACCGG CCGGTCAGCA CTGTGGACCT GCCGCCGACG CTGCTGGACG CCGCCGGGAT CGCGGTGCCG GAGGCGATGC AGGGGACGTC GTTCCTGCCG CTGGTGCGCG ATCCGGGGGC CGAGTTCCCC GACGAGGCGT TCATCCAGGT CAGCGAGGCC CAGTGCGGGC GCGCGATCCG GACGAAGCGG TGGCTGTACT ACGTGTCGGA TCCCGACGCG GATGGCTGGG ACGACGCGGC CAGTTCCCGT TACGTCGAGA CCGAGTTGTA CGACCTGGAG CACGATCCGC ACCAGTTGAA CAACCTGGCC GGGTATCCGT CGCACCGTGG GGTGTGTGAT GAGCTGCGGT CGCGGTTGTT GGCGCGGCTG GCAGCCGCCG GTGAGGATGC GGCCGAGATC GTGGCGGCTC CCGAGCCAGT CGGGGCGCCG GTGCGGCACG TGGATCCGGT GGCGCGTTCG CTGTCGGTGC CGCCGATTCG GTTCAGCTGA
|
Protein sequence | MSARPNVIVV FTDQQRWDTT GAQGNPLGLT PNFDRMADTG THARLAVTPN PVCGPARAAL QTGRYPTANG CYRNAIPLPE SERTLAHHFA DAGYDTGYVG KWHLADADPV PESQRGGYGE WLAANTLEFT SDAYRTIVYG QDGEPVLLPG YRSDACFDAA IRFVTDHHDR PFYLFLSLLE PHHQNEVDDY PAPDGYEQRY QGRWMPPDLA ALSANGGTAH RHMGGYLGQI ARVDEGLGRL HDALRSLGLA EDTIVAYTSD HGCHFKTRNS EYKRSAHDAS IRVPLAISGP GFTGGSRIDR PVSTVDLPPT LLDAAGIAVP EAMQGTSFLP LVRDPGAEFP DEAFIQVSEA QCGRAIRTKR WLYYVSDPDA DGWDDAASSR YVETELYDLE HDPHQLNNLA GYPSHRGVCD ELRSRLLARL AAAGEDAAEI VAAPEPVGAP VRHVDPVARS LSVPPIRFS
|
| |