Gene Shewmr4_3808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3808 
Symbol 
ID4254371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4545246 
End bp4547210 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content51% 
IMG OID638120453 
Productsulfatase 
Protein accessionYP_735928 
Protein GI113972135 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.062671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCAG GTTCATCCGC TCGTGGACGC CAGTCCGCCC ATGGGCCATT TCGCGTCATT 
CTTGTTTTCA GTCTGTTAGT ACTCGCTATT GCTACGGCAA GCCGTATCGG CTTAGGTCTG
TGGCAGGGCG AACGTGTGGC CGCTGTTGAC GGTTGGTCGC ATCTGCTACT ACAAGGTATT
CGTGTCGATA TCGCAACCCT GTGTTGGTTA TGGGGCGTCG CCGCTTTAGG TACGGCGTTG
TTTTCGGGCG ATCATCTGCT TGGTCGAGTT TGGCAATGGG TGCTGCGCCT ATGGCTAACC
TTAGGTCTGT GGGCCATTGT GTTTCTTGAG GTATCAACGC CAGCCTTTAT TGAGGAATAC
GGCATTCGCC CGAACCGCTT GTATGTAGAG TATTTGATTT ATCCTAAAGA AGTGCTTTCT
ATGCTGTGGG CGGGACGTAA GCTTGAGCTG ATTTTTTCGG TGCTGCTTAG CATAGTCACC
CTGTGGGGCG GTTGGAAACT CAGCGGTAAG TTGTCTAAAA ATTTACGTTT TCCACGTTGG
TATTGGCGTC CTGTGTTAGC TGTCATGGTG ATCGTCGTGA CCCTATTGGG CGCGCGCTCA
ACCTTAGGCC ATAGACCTAT CAACCCAGCA ATGGTGGCAT TTGCCGACGA TCCCTTGGTT
AACTCCCTAG TGATTAACTC TGCTTATTCG TTGGTGTTTG CGATTAAGCA AATGGGTAGC
GAAGAAGATG CCTCTAAAGT GTATGGCTCG TTAGACAAAG ATGAGATCAT CAAGACCATC
AGACAAGAGA GTGGTCGCCC AGAGACAGCC TTTACCTCGA ACGAAGTTCC CTCATTGAGC
TTTAACCAAG CCAGTTACAG CGGTAAACCT AAAAACCTAG TGATCCTGCT GCAGGAAAGC
TTAGGCGCCC GTTTTGTCGG GAGTTTAGGC GGCATGCCCC TAACCCCGAA TATTGATGCG
CTCTCACAGG AAGGCTGGTA TTTCGATAAT CTTTACGCCA CAGGCACACG TTCAGTGCGC
GGGATTGAGG CGGTCACCAC AGGGTTTACC CCAACGCCGG CCCGCGCCGT GGTGAAACTC
GGTAAGAGCC AGACGGGCTT TTTCACCCTC GCCGAATTGC TGAAAAACCA CGGCTATACC
ACCCAATTTA TCTATGGTGG TGAGAGCCAC TTCGACAATA TGCGCAGTTT CTTCCTCGGC
AATGGATTTA GCGACATTAT CGATCAGAAG GACTATCAGT CGCCTGCTTT TGTAGGCTCT
TGGGGCGCTT CCGATGAAGA TTTAATGCGT AAGGCGAATA GTGAGTTTGA GCGTCTTCAC
AGTGAAGGTA AGCCATTCTT TAGCTTGGTG TTTAGCTCGA GTAACCACGA CCCATTCGAA
TTCCCCGACG GTCGTATTGA GCTGTATGAA CAGCCAAAGC AAACCCGTAA TAACGCGGCA
AAATATGCCG ATTATGCGAT TGGTGAGTTC TTTAAGTTGG CGAAAAATGC AGACTACTGG
AAGGATACTA TCTTCATCGT GGTAGCCGAT CACGACAGTC GTGTCGGTGG CGCCGATCTT
GTGCCTGTAC CGCGTTTTCG TATTCCGGGG TTGATCCTTG GGGATAATGT TGCGCCAAAA
CGCGACCACC GTATCGTGAG CCAAATCGAC TTGCCACCAA CCCTGTTATC TTTGATTGGT
ATTTCAGACT CTTACCCTAT GTTAGGTCGT GACTTAACTC AGGTCAGCGA GGATTGGCCG
GGTCGTGCGC TAATGCAATA CGATAAAAAC TTTGCCCTGA TGGAAGGTAA AGATGTGGTT
ATCCTGCAGC CAGAGAAAGC GGCGCAGGGC TTCCAGTACG ATGAAAAGAC CGAGCACTTA
ACGCCTTATG CCCCTGCGGC GCAGGCGTTG GAGAAAAAGG CCTTAGGTTG GGCACTGTGG
GGCAGCCTAG CCTACCAGCA AGAGCTGTAT CGCTCGGGTA AATAA
 
Protein sequence
MQSGSSARGR QSAHGPFRVI LVFSLLVLAI ATASRIGLGL WQGERVAAVD GWSHLLLQGI 
RVDIATLCWL WGVAALGTAL FSGDHLLGRV WQWVLRLWLT LGLWAIVFLE VSTPAFIEEY
GIRPNRLYVE YLIYPKEVLS MLWAGRKLEL IFSVLLSIVT LWGGWKLSGK LSKNLRFPRW
YWRPVLAVMV IVVTLLGARS TLGHRPINPA MVAFADDPLV NSLVINSAYS LVFAIKQMGS
EEDASKVYGS LDKDEIIKTI RQESGRPETA FTSNEVPSLS FNQASYSGKP KNLVILLQES
LGARFVGSLG GMPLTPNIDA LSQEGWYFDN LYATGTRSVR GIEAVTTGFT PTPARAVVKL
GKSQTGFFTL AELLKNHGYT TQFIYGGESH FDNMRSFFLG NGFSDIIDQK DYQSPAFVGS
WGASDEDLMR KANSEFERLH SEGKPFFSLV FSSSNHDPFE FPDGRIELYE QPKQTRNNAA
KYADYAIGEF FKLAKNADYW KDTIFIVVAD HDSRVGGADL VPVPRFRIPG LILGDNVAPK
RDHRIVSQID LPPTLLSLIG ISDSYPMLGR DLTQVSEDWP GRALMQYDKN FALMEGKDVV
ILQPEKAAQG FQYDEKTEHL TPYAPAAQAL EKKALGWALW GSLAYQQELY RSGK