Gene Dshi_4167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_4167 
Symbol 
ID5714682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009959 
Strand
Start bp15174 
End bp16799 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content73% 
IMG OID641277062 
Productsulfatase 
Protein accessionYP_001542358 
Protein GI159046690 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.255365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCAA GCCCCCTGCG CCTCGGTCTC GGCGCGCTCG TGCTGCATCT GGTGCTGGTG 
CAACCGAACC ATCCGGCGGC CCTGACCTGG GGGGCGCTGG CGATGTTTCC CCTGGAATTG
CCGGTGATCC TCCTGGTGCT CGCCGCCCTG CCCCCCGGGC GGGTCACCGC GTGGCTGCGC
GCCGGGCTGA CGGCGCTGCT GGTGCTGATC GCGGTGCTGA AGACGGCGGA TTTCGCGATG
TTCTCGGCCC TGGGGCGGGG GTTCAACCCG ATCTCGGACA TGGCTTTGGT GGAGGCCGGT
TTGCGGCTCT CCACCGGGGC GATCGGCCCG GTTCTGACCG GGCTGGCGGT GGTCGCGGCA
CTGCTGGCGG TGGCGGGCGT GGCCTTGGCG ATCTGGTGGG CGACCGGGGT CTGGGCCGGG
CTGCGCCTGC CGCGGCGCGC GGGGCTCGGC CTAGGCGTGG CGGCGGGGCT TGCCGCCGGG
GTCGCCGGGG CGGAGATCGG GCAGGCCATG GGCCGCTGGT CCCTGCCGGT CACGCCCCCG
GGGGCGGCGT TTACCGCCCG CGTCGGGGTC GAGCGGATGG GCATGGCCCG CGCCACCCTC
GCCGACTTGC GCGCCTTCGA GATCGCGGCG GCCACGGACC CGCTGGCCGG GCGCGCGGAC
CTGCTGGGCG CCATCGACCG GGACGTTCTG GTCGTCTTTG TCGAAAGCTA CGGGCGCGCC
AGCCTCGACA CCCCGCTTTA TGCCGAGACC CATCGCGCGA CCCTGGCGGC GGCCGAGGCG
CGGCTCGGGG CGCTGGGGCT GTCCATGCGA TCGGGCCTGC TGACCGCGCC CACGCGGGGC
GGGCAGAGCT GGCTGAGCCA CGCGACCTTT GCCAACGGGC TGTGGGTGGA CAACCAGACG
AGCTATGGCG CGGCGCTGGC CAGCGGGCGG CGGACGCTGT TTCACCTCGC CGCCGAGGCC
GGGTTTCACA CCGCCGCGGT GATGCCGCAG ATCACCCTGG ACTGGCCCGA GGCCGACCTG
ATGGGGTTCG AGACCGTGCT GGCGGCGGCG GATCTCGGCT ATGCCGGGCA GCCCTTCAAC
TGGGTGACGA TGCCGGACCA GTTCACCTTC GCCGCGATGG ACCGCCTGCT GCGCGACCGG
GCGGAGACGC GGCCCTATTT CGTGCAGATG GCGCTGGGGT CGTCCCATGC GCCCTGGGTG
CCGGTGCCCG AGCTGGTGCC GTGGGAGGCA ATCGGCGATG GCACGATCTT CGATCCCATG
GCGGCGGCGG GCGATCCGCC GGACGTGGTC TGGCGCGACC GCGACCGGGT GCGGGAGCAG
TACCGCCTCG CCCTCGACTA CGCCCTGCGG GTGGTGTTCG ACTACGCCGC GCGGCACGCG
GGCGACCCGC CGCTGATCCT GGTGCTGGGC GATCACCAGG CGGCCGGATT CGTGGCGCTG
GACGAGCGGG CCGAGGTGCC GGTGCACCTG ATCGGACCGG CGGATCTGGT CGAGGTCGCC
GCCGGTTGGG GCTGGTCCCC GGGGCTGATC CCGGGGCCGG AGGCCGCGCC CCTGCGGATG
GACGAAATGC GCGACCTGAT CCTGCAATCC TTCGCCAGCC AGGCGCCCCC GGAGGGCGAG
AGTTGA
 
Protein sequence
MIPSPLRLGL GALVLHLVLV QPNHPAALTW GALAMFPLEL PVILLVLAAL PPGRVTAWLR 
AGLTALLVLI AVLKTADFAM FSALGRGFNP ISDMALVEAG LRLSTGAIGP VLTGLAVVAA
LLAVAGVALA IWWATGVWAG LRLPRRAGLG LGVAAGLAAG VAGAEIGQAM GRWSLPVTPP
GAAFTARVGV ERMGMARATL ADLRAFEIAA ATDPLAGRAD LLGAIDRDVL VVFVESYGRA
SLDTPLYAET HRATLAAAEA RLGALGLSMR SGLLTAPTRG GQSWLSHATF ANGLWVDNQT
SYGAALASGR RTLFHLAAEA GFHTAAVMPQ ITLDWPEADL MGFETVLAAA DLGYAGQPFN
WVTMPDQFTF AAMDRLLRDR AETRPYFVQM ALGSSHAPWV PVPELVPWEA IGDGTIFDPM
AAAGDPPDVV WRDRDRVREQ YRLALDYALR VVFDYAARHA GDPPLILVLG DHQAAGFVAL
DERAEVPVHL IGPADLVEVA AGWGWSPGLI PGPEAAPLRM DEMRDLILQS FASQAPPEGE
S