Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3808 |
Symbol | |
ID | 4254371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 4545246 |
End bp | 4547210 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 638120453 |
Product | sulfatase |
Protein accession | YP_735928 |
Protein GI | 113972135 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.062671 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTCAG GTTCATCCGC TCGTGGACGC CAGTCCGCCC ATGGGCCATT TCGCGTCATT CTTGTTTTCA GTCTGTTAGT ACTCGCTATT GCTACGGCAA GCCGTATCGG CTTAGGTCTG TGGCAGGGCG AACGTGTGGC CGCTGTTGAC GGTTGGTCGC ATCTGCTACT ACAAGGTATT CGTGTCGATA TCGCAACCCT GTGTTGGTTA TGGGGCGTCG CCGCTTTAGG TACGGCGTTG TTTTCGGGCG ATCATCTGCT TGGTCGAGTT TGGCAATGGG TGCTGCGCCT ATGGCTAACC TTAGGTCTGT GGGCCATTGT GTTTCTTGAG GTATCAACGC CAGCCTTTAT TGAGGAATAC GGCATTCGCC CGAACCGCTT GTATGTAGAG TATTTGATTT ATCCTAAAGA AGTGCTTTCT ATGCTGTGGG CGGGACGTAA GCTTGAGCTG ATTTTTTCGG TGCTGCTTAG CATAGTCACC CTGTGGGGCG GTTGGAAACT CAGCGGTAAG TTGTCTAAAA ATTTACGTTT TCCACGTTGG TATTGGCGTC CTGTGTTAGC TGTCATGGTG ATCGTCGTGA CCCTATTGGG CGCGCGCTCA ACCTTAGGCC ATAGACCTAT CAACCCAGCA ATGGTGGCAT TTGCCGACGA TCCCTTGGTT AACTCCCTAG TGATTAACTC TGCTTATTCG TTGGTGTTTG CGATTAAGCA AATGGGTAGC GAAGAAGATG CCTCTAAAGT GTATGGCTCG TTAGACAAAG ATGAGATCAT CAAGACCATC AGACAAGAGA GTGGTCGCCC AGAGACAGCC TTTACCTCGA ACGAAGTTCC CTCATTGAGC TTTAACCAAG CCAGTTACAG CGGTAAACCT AAAAACCTAG TGATCCTGCT GCAGGAAAGC TTAGGCGCCC GTTTTGTCGG GAGTTTAGGC GGCATGCCCC TAACCCCGAA TATTGATGCG CTCTCACAGG AAGGCTGGTA TTTCGATAAT CTTTACGCCA CAGGCACACG TTCAGTGCGC GGGATTGAGG CGGTCACCAC AGGGTTTACC CCAACGCCGG CCCGCGCCGT GGTGAAACTC GGTAAGAGCC AGACGGGCTT TTTCACCCTC GCCGAATTGC TGAAAAACCA CGGCTATACC ACCCAATTTA TCTATGGTGG TGAGAGCCAC TTCGACAATA TGCGCAGTTT CTTCCTCGGC AATGGATTTA GCGACATTAT CGATCAGAAG GACTATCAGT CGCCTGCTTT TGTAGGCTCT TGGGGCGCTT CCGATGAAGA TTTAATGCGT AAGGCGAATA GTGAGTTTGA GCGTCTTCAC AGTGAAGGTA AGCCATTCTT TAGCTTGGTG TTTAGCTCGA GTAACCACGA CCCATTCGAA TTCCCCGACG GTCGTATTGA GCTGTATGAA CAGCCAAAGC AAACCCGTAA TAACGCGGCA AAATATGCCG ATTATGCGAT TGGTGAGTTC TTTAAGTTGG CGAAAAATGC AGACTACTGG AAGGATACTA TCTTCATCGT GGTAGCCGAT CACGACAGTC GTGTCGGTGG CGCCGATCTT GTGCCTGTAC CGCGTTTTCG TATTCCGGGG TTGATCCTTG GGGATAATGT TGCGCCAAAA CGCGACCACC GTATCGTGAG CCAAATCGAC TTGCCACCAA CCCTGTTATC TTTGATTGGT ATTTCAGACT CTTACCCTAT GTTAGGTCGT GACTTAACTC AGGTCAGCGA GGATTGGCCG GGTCGTGCGC TAATGCAATA CGATAAAAAC TTTGCCCTGA TGGAAGGTAA AGATGTGGTT ATCCTGCAGC CAGAGAAAGC GGCGCAGGGC TTCCAGTACG ATGAAAAGAC CGAGCACTTA ACGCCTTATG CCCCTGCGGC GCAGGCGTTG GAGAAAAAGG CCTTAGGTTG GGCACTGTGG GGCAGCCTAG CCTACCAGCA AGAGCTGTAT CGCTCGGGTA AATAA
|
Protein sequence | MQSGSSARGR QSAHGPFRVI LVFSLLVLAI ATASRIGLGL WQGERVAAVD GWSHLLLQGI RVDIATLCWL WGVAALGTAL FSGDHLLGRV WQWVLRLWLT LGLWAIVFLE VSTPAFIEEY GIRPNRLYVE YLIYPKEVLS MLWAGRKLEL IFSVLLSIVT LWGGWKLSGK LSKNLRFPRW YWRPVLAVMV IVVTLLGARS TLGHRPINPA MVAFADDPLV NSLVINSAYS LVFAIKQMGS EEDASKVYGS LDKDEIIKTI RQESGRPETA FTSNEVPSLS FNQASYSGKP KNLVILLQES LGARFVGSLG GMPLTPNIDA LSQEGWYFDN LYATGTRSVR GIEAVTTGFT PTPARAVVKL GKSQTGFFTL AELLKNHGYT TQFIYGGESH FDNMRSFFLG NGFSDIIDQK DYQSPAFVGS WGASDEDLMR KANSEFERLH SEGKPFFSLV FSSSNHDPFE FPDGRIELYE QPKQTRNNAA KYADYAIGEF FKLAKNADYW KDTIFIVVAD HDSRVGGADL VPVPRFRIPG LILGDNVAPK RDHRIVSQID LPPTLLSLIG ISDSYPMLGR DLTQVSEDWP GRALMQYDKN FALMEGKDVV ILQPEKAAQG FQYDEKTEHL TPYAPAAQAL EKKALGWALW GSLAYQQELY RSGK
|
| |