Gene Shew185_4201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew185_4201 
Symbol 
ID5373558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS185 
KingdomBacteria 
Replicon accessionNC_009665 
Strand
Start bp5004461 
End bp5006425 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content47% 
IMG OID640832466 
Productsulfatase 
Protein accessionYP_001368380 
Protein GI153002699 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000207245 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATCTG GTTTATCATC TCGCGGGCAC AACAATGCCC ATGGGCCTTT CCGCGCCATC 
TTTATTTTCT CGCTATTAGT GCTTTTTATT GCCACCGCGA GCCGTATCGC TTTAGGGCTG
TGGCAAGCCG ATCGCGTGGC CGCTGTGGAT GGTTGGTCGC ATCTCTTAAT CCAAGGTTTA
CGCGTCGATA TCGCTACCCT GTGCTGGTTA TGGGGTATTG CTGCCTTAGG AACCGCATTA
TTTTCGGGTG ATCATTTTAT TGGCCGTCTG TGGCAGCCGA TTTTACGGGT GTGGTTAACT
GTCGGTCTGT GGATCATCCT CTTTTTAGAA GCATCGACCC CTGCGTTTAT TGAAGAATAC
GGTATTCGCC CGAATCGCCT GTATGTGGAA TATCTGATCT ACCCGAAAGA AGTGCTTTCA
ATGCTGTGGG CGGGTCGCAA ACTTGAGCTG ATCTTCTCCG TGCTATTAAC TATCGGTACG
CTTTGGGGTG GCTGGGTGTT AAGCGGTAAG CTCACTAAAA ATCTACGTTT CCCACGCTGG
TACTGGCGTC CTGTATTGGC AGTGCTTGTT ATCGCCATGA CGTTATTGGG TGCGCGTTCA
ACCTTAGGCC ATAGACCGAT TAACCCTGCT ATGGTGGCGT TTGCCGACGA TCCATTAGTG
AACTCTTTAG TCATCAACTC AGCCTATTCA TTAGTGTTTG CCATCAAGCA GATGGGCAGT
GAAGAAGATG CCTCTAAAGT GTATGGCAAG TTAGATAACG CTGAGATTAT TACGACCATA
AGACAGGAAA GTGGTCGTCC TGAAAGTGTA TTTACCTCAA CGGATATCCC ATCGCTAAGC
TTTAACCAAG CCAGTTATAC CGGAAAGCCA AAGAACTTAG TGATCCTACT GCAAGAGAGT
TTAGGCGCAC GTTTTGTGGG GAGTTTAGGT GGTTTACCGC TGACTCCGAA TATCGATGCC
TTATCCAAAG AAGGTTGGTA TTTCGATAAT TTGTACGCCA CTGGTACTCG TTCAGTGCGC
GGGATAGAAG CTGTAACGAC AGGCTTTACC CCGACGCCAG CTCGTGCTGT GGTGAAACTG
GGTAAGAGCC AAGTTGGCTT CTTCAGTATA GCTGAATTAC TTAAAAATCA TGGTTATACC
ACGCAGTTTA TCTATGGTGG CGAGAGCCAT TTCGACAATA TGCGTAGCTT CTTTTTGGGC
AATGGCTTTA GTGACATCAT AGATCAGAAA GATTATAAAT CTCCGGCCTT TGTGGGCTCG
TGGGGCGCCT CTGACGAAGA CTTAATGCGT AAGGCGAATA GTGAGTTTGA GCGTCTACAC
AGTGAAGGTA AGCCTTTCTT TAGTTTAGTG TTTAGCTCGA GCAACCACGA TCCATTTGAA
TTCCCAGATG ATCGTATCGA GCTGTACGAG CAACCTAAGC AAACCCGTAA TAATGCGGCG
AAATATGCCG ACTATGCGAT TGGTGAGTTT TTCAAACTGG CGAAAAATGC GGACTACTGG
AAAGATACGA TTTTTATCGT GGTTGCCGAC CATGACAGCC GAGTAGGTGG TGCGGATCTG
GTGCCAGTGT CACGTTTTCG TATTCCGGGT TTAATCCTTG GGGATAATTT AGCGCCAAAA
CGCGATCATC GTATTGTGAG CCAAATTGAT TTACCGCCAA CACTGTTATC ATTGATTGGT
ATTTCAGACT CTTATCCTAT GCTGGGCCGA GATTTGACCC AGGTCAGCGA TGATTGGCCT
GGACGCGCGT TAATGCAATA CGATAAAAAC TTTGCCCTGA TGGAAGGTAA AGATGTAGTG
ATCCTGCAGC CAGAAAAAGC GGCTCAAGGT TTCGAATATA ACGAAAAAAC TGAGCAGTTA
ACGCCTTATG CGCCAGCTGC AGCAGCGTTA GAGAAGAAAG CCTTAAGTTG GGCATTATGG
GGCAGTTTGG CCTACCAGCA AGAGCTGTAT CGTTTGCCTA AATAA
 
Protein sequence
MQSGLSSRGH NNAHGPFRAI FIFSLLVLFI ATASRIALGL WQADRVAAVD GWSHLLIQGL 
RVDIATLCWL WGIAALGTAL FSGDHFIGRL WQPILRVWLT VGLWIILFLE ASTPAFIEEY
GIRPNRLYVE YLIYPKEVLS MLWAGRKLEL IFSVLLTIGT LWGGWVLSGK LTKNLRFPRW
YWRPVLAVLV IAMTLLGARS TLGHRPINPA MVAFADDPLV NSLVINSAYS LVFAIKQMGS
EEDASKVYGK LDNAEIITTI RQESGRPESV FTSTDIPSLS FNQASYTGKP KNLVILLQES
LGARFVGSLG GLPLTPNIDA LSKEGWYFDN LYATGTRSVR GIEAVTTGFT PTPARAVVKL
GKSQVGFFSI AELLKNHGYT TQFIYGGESH FDNMRSFFLG NGFSDIIDQK DYKSPAFVGS
WGASDEDLMR KANSEFERLH SEGKPFFSLV FSSSNHDPFE FPDDRIELYE QPKQTRNNAA
KYADYAIGEF FKLAKNADYW KDTIFIVVAD HDSRVGGADL VPVSRFRIPG LILGDNLAPK
RDHRIVSQID LPPTLLSLIG ISDSYPMLGR DLTQVSDDWP GRALMQYDKN FALMEGKDVV
ILQPEKAAQG FEYNEKTEQL TPYAPAAAAL EKKALSWALW GSLAYQQELY RLPK