Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal_0138 |
Symbol | |
ID | 4841812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS155 |
Kingdom | Bacteria |
Replicon accession | NC_009052 |
Strand | + |
Start bp | 158586 |
End bp | 160550 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640117351 |
Product | sulfatase |
Protein accession | YP_001048541 |
Protein GI | 126172392 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000529062 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATCTG GTTTATCACC TCGCGGGCAC AACAATGCCC ATGGGCCTTT CCGCGCCATT TTTATTTTCT CGCTATTAGT GCTTTTCATT ACCACCGCGA GCCGTATCGC TTTTGGGCTG TGGCAAGCCG ATCGCGTGGC CGCTGTGGAT GGTTGGTCGC ATCTCTTAAT CCAAGGTTTT CGCATCGATA TCGCTACCCT GTGTTGGTTG TGGGGGATTG CCGCCTTAGG CACCGCATTA TTTTCGGGTG ATCATTTAAT TGGCCGTCTG TGGCAGCCGA TTTTACGGGT GTGGTTAACT GTCGGTCTGT GGATCATCCT CTTTTTAGAA GCATCGACCC CTGCGTTTAT TGAAGAATAC GGTATTCGCC CGAATCGTTT GTATGTGGAA TATCTGATCT ATCCGAAAGA AGTGCTTTCT ATGCTGTGGG CGGGTCGCAA ACTTGAGCTA ATTTTCTCCG TGCTATTAAC TATCGGTACG CTTTGGGGGG GCTGGGTGTT AAGCGGTAAG CTCACTAAAA ATCTACGTTT CCCGCGCTGG TACTGGCGTC CTGTATTGGC AGTGCTTATT ATCGCCACGA CACTGTTGGG TGCGCGTTCA ACCTTAGGCC ATAGACCGAT TAACCCTGCT ATGGTGGCGT TTGCCGACGA TCCATTAGTG AACTCGTTAG TCATCAACTC AGCCTATTCA TTAGTGTTTG CCATCAAGCA GATGGGTAGT GAAGAAGATG CCTCTAAAGT GTATGGCAAG TTAGATAACG CTGAGATTAT TGCGACCATA AGACAGGAAA GTGGTCGTCC TGAAAGTGCA TTTACCTCAA CAGATATCCC ATCGCTGAGT TTTAACCAAG CCAGTTATAC CGGAAAGCCA AAGAACTTAG TGATCCTACT GCAAGAGAGT TTAGGCGCAC GTTTTGTGGG GAGTTTAGGC GGTTTACCGC TGACTCCGAA TATCGATGCT TTATCCAAAG AAGGTTGGTA TTTCGATAAT TTGTACGCCA CTGGTACTCG TTCAGTGCGC GGGATAGAAG CCGTAACGAC AGGTTTTACC CCGACGCCAG CTCGTGCTGT GGTGAAACTG GGTAAGAGCC AAGTTGGCTT CTTCAGCATA GCTGAATTAC TTAAAAATCA TGGTTATACC ACGCAGTTTA TCTATGGTGG TGAGAGCCAT TTCGACAATA TGCGTAGCTT CTTTTTGGGC AACGGCTTTA GTGACATCAT AGATCAGAAA GATTATAAAT CTCCGGCCTT TGTGGGTTCG TGGGGCGCCT CTGACGAAGA CTTAATGCGT AAGGCTAATA GTGAGTTTGA GCGTCTACAC AGTGAAGGTA AGCCTTTCTT TAGTCTAGTT TTTAGCTCGA GCAACCACGA TCCATTCGAA TTCCCTGATG ATCGTATCGA GCTGTACGAG CAACCTAAGC AAACCCGCAA CAATGCAGCG AAATATGCCG ATTATGCGAT TGGTGAGTTT TTCAAATTGG CGAAAAACGC GGACTATTGG AAAGACACGA TTTTTATCGT GGTTGCCGAC CATGACAGCC GAGTGGGTGG TGCAGATCTG GTGCCAGTAT CGCGTTTTCG GATTCCGGGT TTAATCCTTG GGGATAATTT AGCGCCAAAA CGCGATCATC GTATTGTGAG CCAGATTGAT TTACCGCCAA CACTGTTATC ATTGATTGGT ATTTCAGACT CTTATCCTAT GCTAGGCCGA GATTTGACCC AGGTCAGCGA TGATTGGCCT GGACGCGCGT TAATGCAATA CGATAAAAAC TTTGCCCTGA TGGAAGGTAA AGATGTAGTG ATCCTGCAGC CAGAAAAAGC GGCTCAAGGT TTCGAATACA ATGAAAAAAC TGAGCAGTTA ACGCCTTATG CGCCAGCTGC GGCAGCGTTA GAGAAGAAAG CCTTAAGTTG GGCATTATGG GGCAGTTTGG CCTACCAGCA AGAGCTGTAT CGTTTGCCTA AATAA
|
Protein sequence | MQSGLSPRGH NNAHGPFRAI FIFSLLVLFI TTASRIAFGL WQADRVAAVD GWSHLLIQGF RIDIATLCWL WGIAALGTAL FSGDHLIGRL WQPILRVWLT VGLWIILFLE ASTPAFIEEY GIRPNRLYVE YLIYPKEVLS MLWAGRKLEL IFSVLLTIGT LWGGWVLSGK LTKNLRFPRW YWRPVLAVLI IATTLLGARS TLGHRPINPA MVAFADDPLV NSLVINSAYS LVFAIKQMGS EEDASKVYGK LDNAEIIATI RQESGRPESA FTSTDIPSLS FNQASYTGKP KNLVILLQES LGARFVGSLG GLPLTPNIDA LSKEGWYFDN LYATGTRSVR GIEAVTTGFT PTPARAVVKL GKSQVGFFSI AELLKNHGYT TQFIYGGESH FDNMRSFFLG NGFSDIIDQK DYKSPAFVGS WGASDEDLMR KANSEFERLH SEGKPFFSLV FSSSNHDPFE FPDDRIELYE QPKQTRNNAA KYADYAIGEF FKLAKNADYW KDTIFIVVAD HDSRVGGADL VPVSRFRIPG LILGDNLAPK RDHRIVSQID LPPTLLSLIG ISDSYPMLGR DLTQVSDDWP GRALMQYDKN FALMEGKDVV ILQPEKAAQG FEYNEKTEQL TPYAPAAAAL EKKALSWALW GSLAYQQELY RLPK
|
| |