Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_4333 |
Symbol | |
ID | 5756164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | - |
Start bp | 5116175 |
End bp | 5118139 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641290689 |
Product | sulfatase |
Protein accession | YP_001556751 |
Protein GI | 160877435 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATCTG GTTTATCACC TCGCGGGCGA CACAATGCCC ATGGGCCTTT CCGCGCCATC TTTATCTTCT CGCTATTAGT GCTTTTTATT GCGACCGCAA GCCGTATTGC TTTGGGGCTG TGGCAAGCCG ATCGCGTGGC CGCTGTTGAT GGTTGGTCGC ATCTCTTAAT CCAAGGTTTA CGCGTCGATA TCGCGACCCT GTGCTGGTTA TGGGGTATTG CTGCCTTAGG AACCGCATTG TTTTCGGGTG ATCATTTTAT TGGCCGTCTG TGGCAGCCGA TTTTACGGGT GTGGTTAACT GTCGGTCTGT GGATCATCCT CTTTTTAGAA GCATCGACCC CTGCGTTTAT TGAAGAATAC GGTATTCGCC CAAATCGTTT GTATGTGGAA TATCTGATCT ACCCGAAAGA AGTGCTTTCT ATGCTGTGGG CGGGTCGCAA ACTTGAGCTG ATCTTCTCCG TGCTATTAAC TATAGGTACA CTATGGGGCG GCTGGGTGTT AAGCGGTAAG CTCACTAAAA ATCTACGTTT TCCGCGCTGG TACTGGCGTC CAGTATTGGC AGTGCTTGTT ATCGCCATGA CGTTATTGGG TGCGCGTTCA ACCTTAGGCC ATAGACCGAT TAACCCTTCT ATGGTGGCGT TTGCCGACGA TCCATTAGTG AACTCTTTAG TCATCAACTC AGCCTATTCA TTAGTGTTTG CCATCAAGCA GATGGGCAGT GAAGAAGATG CCTCTAAAGT GTATGGCAAG TTAGATAACG CTGAGATTAT TGCGACCATA AGACAGGAAA GTGGTCGTCC TGAAAGTGCA TTTACCTCAA CGGATATCCC ATCGCTGAGT TTTAACCAAG CCAGTTATAC CGGAAAGCCA AAGAACTTAG TGATCCTACT GCAAGAGAGT TTAGGCGCAC GTTTTGTGGG GAGTTTAGGT GGTTTACCCC TGACTCCGAA TATCGATGCC TTATCCAAAG AAGGTTGGTA TTTCGATAAT TTGTACGCCA CTGGTACTCG TTCAGTGCGC GGGATAGAAG CCGTAACGAC AGGTTTTACC CCGACGCCAG CTCGTGCTGT GGTGAAACTG GGTAAGAGCC AAGTTGGCTT CTTCAGTATT GCTGAATTAC TTAAAAATCA TGGTTATACC ACGCAGTTTA TTTATGGTGG TGAGAGCCAT TTCGACAATA TGCGTAGCTT CTTTTTGGGC AACGGCTTTA GTGACATCAT AGATCAGAAA GATTATAAAT CTCCGGCCTT TGTGGGTTCG TGGGGCGCCT CTGACGAAGA CTTAATGCGT AAGGCGAATA GTGAGTTTGA GCGTCTACAC AGTGAAGGTA AGCCTTTCTT TAGTCTAGTT TTTAGCTCGA GCAACCACGA TCCATTTGAA TTCCCAGATG ATCGTATCGA GCTGTACGAG CAACCTAAGC AAACCCGTAA TAATGCGGCG AAATATGCCG ACTATGCGAT TGGTGAGTTT TTCAAACTGG CGAAAAATGC AGACTACTGG AAAGATACGA TTTTTATCGT GGTTGCCGAC CATGACAGTC GAGTGGGTGG CGCGGATCTG GTGCCAGTGT CACGTTTTCG TATTCCGGGT TTAATCCTTG GGGATAATTT AGCGCCAAAA CGCGATCATC GCATTGTGAG CCAAATTGAT TTACCGCCGA CACTTTTATC ATTGATTGGT ATTTCAGACT CTTATCCTAT GCTGGGCCGA GATTTGACTC AGGTCAGCGA TGATTGGCCT GGACGCGCGT TAATGCAATA CGATAAAAAC TTTGCCCTGA TGGAAGGTAA AGATGTAGTG ATCCTGCAGC CAGAAAAAGC GGCTCAAGGT TTCGAATATA ACGAAAAAAC TGAGCAGTTA ACGCCTTATG CGCCAGCTGC AGCAGCGTTA GAGAAGAAAG CCTTAAGTTG GGCATTATGG GGCAGTTTGG CCTACCAGCA AGAGCTGTAT CGTTTGCCTA AATAA
|
Protein sequence | MQSGLSPRGR HNAHGPFRAI FIFSLLVLFI ATASRIALGL WQADRVAAVD GWSHLLIQGL RVDIATLCWL WGIAALGTAL FSGDHFIGRL WQPILRVWLT VGLWIILFLE ASTPAFIEEY GIRPNRLYVE YLIYPKEVLS MLWAGRKLEL IFSVLLTIGT LWGGWVLSGK LTKNLRFPRW YWRPVLAVLV IAMTLLGARS TLGHRPINPS MVAFADDPLV NSLVINSAYS LVFAIKQMGS EEDASKVYGK LDNAEIIATI RQESGRPESA FTSTDIPSLS FNQASYTGKP KNLVILLQES LGARFVGSLG GLPLTPNIDA LSKEGWYFDN LYATGTRSVR GIEAVTTGFT PTPARAVVKL GKSQVGFFSI AELLKNHGYT TQFIYGGESH FDNMRSFFLG NGFSDIIDQK DYKSPAFVGS WGASDEDLMR KANSEFERLH SEGKPFFSLV FSSSNHDPFE FPDDRIELYE QPKQTRNNAA KYADYAIGEF FKLAKNADYW KDTIFIVVAD HDSRVGGADL VPVSRFRIPG LILGDNLAPK RDHRIVSQID LPPTLLSLIG ISDSYPMLGR DLTQVSDDWP GRALMQYDKN FALMEGKDVV ILQPEKAAQG FEYNEKTEQL TPYAPAAAAL EKKALSWALW GSLAYQQELY RLPK
|
| |