Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_4167 |
Symbol | |
ID | 5714682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009959 |
Strand | - |
Start bp | 15174 |
End bp | 16799 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641277062 |
Product | sulfatase |
Protein accession | YP_001542358 |
Protein GI | 159046690 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.255365 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCAA GCCCCCTGCG CCTCGGTCTC GGCGCGCTCG TGCTGCATCT GGTGCTGGTG CAACCGAACC ATCCGGCGGC CCTGACCTGG GGGGCGCTGG CGATGTTTCC CCTGGAATTG CCGGTGATCC TCCTGGTGCT CGCCGCCCTG CCCCCCGGGC GGGTCACCGC GTGGCTGCGC GCCGGGCTGA CGGCGCTGCT GGTGCTGATC GCGGTGCTGA AGACGGCGGA TTTCGCGATG TTCTCGGCCC TGGGGCGGGG GTTCAACCCG ATCTCGGACA TGGCTTTGGT GGAGGCCGGT TTGCGGCTCT CCACCGGGGC GATCGGCCCG GTTCTGACCG GGCTGGCGGT GGTCGCGGCA CTGCTGGCGG TGGCGGGCGT GGCCTTGGCG ATCTGGTGGG CGACCGGGGT CTGGGCCGGG CTGCGCCTGC CGCGGCGCGC GGGGCTCGGC CTAGGCGTGG CGGCGGGGCT TGCCGCCGGG GTCGCCGGGG CGGAGATCGG GCAGGCCATG GGCCGCTGGT CCCTGCCGGT CACGCCCCCG GGGGCGGCGT TTACCGCCCG CGTCGGGGTC GAGCGGATGG GCATGGCCCG CGCCACCCTC GCCGACTTGC GCGCCTTCGA GATCGCGGCG GCCACGGACC CGCTGGCCGG GCGCGCGGAC CTGCTGGGCG CCATCGACCG GGACGTTCTG GTCGTCTTTG TCGAAAGCTA CGGGCGCGCC AGCCTCGACA CCCCGCTTTA TGCCGAGACC CATCGCGCGA CCCTGGCGGC GGCCGAGGCG CGGCTCGGGG CGCTGGGGCT GTCCATGCGA TCGGGCCTGC TGACCGCGCC CACGCGGGGC GGGCAGAGCT GGCTGAGCCA CGCGACCTTT GCCAACGGGC TGTGGGTGGA CAACCAGACG AGCTATGGCG CGGCGCTGGC CAGCGGGCGG CGGACGCTGT TTCACCTCGC CGCCGAGGCC GGGTTTCACA CCGCCGCGGT GATGCCGCAG ATCACCCTGG ACTGGCCCGA GGCCGACCTG ATGGGGTTCG AGACCGTGCT GGCGGCGGCG GATCTCGGCT ATGCCGGGCA GCCCTTCAAC TGGGTGACGA TGCCGGACCA GTTCACCTTC GCCGCGATGG ACCGCCTGCT GCGCGACCGG GCGGAGACGC GGCCCTATTT CGTGCAGATG GCGCTGGGGT CGTCCCATGC GCCCTGGGTG CCGGTGCCCG AGCTGGTGCC GTGGGAGGCA ATCGGCGATG GCACGATCTT CGATCCCATG GCGGCGGCGG GCGATCCGCC GGACGTGGTC TGGCGCGACC GCGACCGGGT GCGGGAGCAG TACCGCCTCG CCCTCGACTA CGCCCTGCGG GTGGTGTTCG ACTACGCCGC GCGGCACGCG GGCGACCCGC CGCTGATCCT GGTGCTGGGC GATCACCAGG CGGCCGGATT CGTGGCGCTG GACGAGCGGG CCGAGGTGCC GGTGCACCTG ATCGGACCGG CGGATCTGGT CGAGGTCGCC GCCGGTTGGG GCTGGTCCCC GGGGCTGATC CCGGGGCCGG AGGCCGCGCC CCTGCGGATG GACGAAATGC GCGACCTGAT CCTGCAATCC TTCGCCAGCC AGGCGCCCCC GGAGGGCGAG AGTTGA
|
Protein sequence | MIPSPLRLGL GALVLHLVLV QPNHPAALTW GALAMFPLEL PVILLVLAAL PPGRVTAWLR AGLTALLVLI AVLKTADFAM FSALGRGFNP ISDMALVEAG LRLSTGAIGP VLTGLAVVAA LLAVAGVALA IWWATGVWAG LRLPRRAGLG LGVAAGLAAG VAGAEIGQAM GRWSLPVTPP GAAFTARVGV ERMGMARATL ADLRAFEIAA ATDPLAGRAD LLGAIDRDVL VVFVESYGRA SLDTPLYAET HRATLAAAEA RLGALGLSMR SGLLTAPTRG GQSWLSHATF ANGLWVDNQT SYGAALASGR RTLFHLAAEA GFHTAAVMPQ ITLDWPEADL MGFETVLAAA DLGYAGQPFN WVTMPDQFTF AAMDRLLRDR AETRPYFVQM ALGSSHAPWV PVPELVPWEA IGDGTIFDPM AAAGDPPDVV WRDRDRVREQ YRLALDYALR VVFDYAARHA GDPPLILVLG DHQAAGFVAL DERAEVPVHL IGPADLVEVA AGWGWSPGLI PGPEAAPLRM DEMRDLILQS FASQAPPEGE S
|
| |