Gene Shewmr7_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr7_1901 
Symbol 
ID4258781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-7 
KingdomBacteria 
Replicon accessionNC_008322 
Strand
Start bp2244706 
End bp2247075 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content46% 
IMG OID638122556 
Productsulfatase 
Protein accessionYP_737947 
Protein GI114047397 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTTT TTAAAACAAC CCTAACTGTC AGTAGCTTAC TGGCGCCGTG TCTGACATCA 
GCGCAAACTA TTGACAGAAC ACAGCTGCCA ATTGCTGATA TTGAACCTCA AACCTACGAC
CAACTCGATG TACGTGATGT TCAGCGTCCA ACTCCAGCGA GCCGGGTTAA GGCGCCAGAT
GGAGCGCCGA ACGTGTTAGT GATTTTGCTC GACGATGTTG GTTTTAGTCA AAGCTCACTC
TTTGGTGGGG CTGTGGATAT GCCAACTTTA GATGCATTGG CGGCACAGGG ATTGATTTAC
AATCAGTTTC ATACCACAGG TGTCAGCTCG GCGACACGAA CCGCGTTGTT GACAGGGCGA
AACCATCATC AAAACAACAT GGGATCGATT GCAGAAACCT CGACGGCCTT TCCTGGTAAT
ACGGGAGCGC GTCCAAATTA TATTGCCGCA TTACCTAAAG TGTTAAAATA CAACGGCTAT
AGCACTGCGA TGTTTGGCAA AAACCATGAG ATTCCGCCCT GGGAAACCGG TCCAGCAGCC
AACCAGAGTT TATGGCCCAG CCAAATTGGT TTTGAGAAGT TTTATGGTTT TTTTGGCGGT
GAAACAGATC AATTCCAACC CGTGCTTATC GATGGCAATA CTCGTATAAA AACACCGAGA
AAGGAAAACT ACCATTTCAC AACCGATATG ACAGATCAAA CCATTCAATG GTTAAATCTG
CAGCAAAGTT ATAATGCCGA TAAACCTTTC TTCGTTTACT TTGCCCCAGG TGCAGCACAC
GCGCCGCATC AAGCGCCTAA AGAATGGATT GATAAGTTCA AAGGAAAATT CTCGATGGGT
TGGGACAAAC TCAGACAAGA CACCTTTGAG CGTCAAAAGG CTGCGGGGAT TATTCCTAAA
GACACCATTC TCCCCCCCAT GCCTGAGCAA GTGCCGCGAT GGGATTCACT CACCCCAGAT
GAAAAACGTG TGTTTGAACG CCAAATGGAA GTTTACGCGG GCTTTTTGAC ACACACCGAT
CATGAGATAG GGCGAATCGT AGATACCCTG AAGAAAAATG GTAAATTCGA CAATACGCTC
ATTTTCTACA TAGTTGGTGA TAATGGGGCA AGTGGCGAAG GTAACCGCAA CGGTAGCTTC
AATTCGCTCG CATTTTATAA CGGTATCGAA GAGGACACAA AAACAGTACT CGACAACATT
GATAAGCTAG GTGGGCCCGA TAGCTTCGGT CACTATGCTG CGGGATGGTC AATTGCAGGA
GATACGCCCT TTGTATGGAT GAAGGGCATG GCATCCGATC TCGGCGGTAC TCGTAATGGC
ATGGTCGTCA GTTGGCCCAA GGGCATCAAG TCGAAAGGAG AAGAGATCCG TAATCAATGG
TCACACGTGA TTGATATTGC GCCTACGATA TTGGAGGTGG CCGACCTTCC CGACCCCAAA
ATGGTCGATG GCGTGAAACA ATTACCTATC GCTGGAGTCA GTTTTGCTGA CACCTTCAAC
AATGCTCAAG CCAAAACCAA ACATACCACT CAGTATTTTG AATTGGGTGG TAATCGCGCT
ATCTATAATG ATGGTTGGTT GGCTCGTGTA ATACATTTTC CATTGTGGGA AGACTCTAAA
AAGTTCGCAA CTCTGCAAAC TGATAAATGG GAACTGTTTG ACACACGTAA AGATTGGTCA
TTGTCTACCG ATCTTGCAAA AAACAACCCT GATAAGCTGA AAGAGCTGCG CTCGATATTC
GACAAAGAAG CTGAAATCAA TCATGTGTAT CCCATTGATG ACCGAACTTT AGAGCGCATG
AATGCCGAAG TTGCAGGCAG ACCAGACGCC CTATTTGGAA AGAAGAGTTT AACTCTCTAT
GAAGGTGCAA AGGGGATCCC CGAAAACTCA TTCTTAAACA TCAAAAATAA ATCCTTCGAT
CTTGTCGCGA AAGTCATGAT TGACAATGTT GACAATACCA ATGGGGTGAT CATTGCTCAA
GGCGGTAATT TCGCAGGATG GAGTTTGTAT GTGATGAAGG GAATACCTAC CTTTGAATAC
AACTGGTTAA CCTATGAATA CACCAAGCTT TCTGGCGCAA AACTCCAGCC TGGCGAGAAT
GAGATCACAA TGAAGTTCCG TTATGACGAA AATGGCGTTG GCGGAAAAGG AAACTCAGTC
GGTACCGGGA AAGGAGGTAA TGCCTATCTT TACGTCAATG GCAAATTGGT AGAGAAGAAA
CTGATCCCCA ATACCATTAG TCGGCTTTAC TCACTGGATG ATGGTGTGGG TATCGGTGAA
GACGAAGGCG GTTCTGTCAG TCGAGATTAT CAAGCCCCAT TTGAGTTCTC TCAGCGTATT
GAAAGTGTAA CAACGTCAAT CGTGGAATAA
 
Protein sequence
MDVFKTTLTV SSLLAPCLTS AQTIDRTQLP IADIEPQTYD QLDVRDVQRP TPASRVKAPD 
GAPNVLVILL DDVGFSQSSL FGGAVDMPTL DALAAQGLIY NQFHTTGVSS ATRTALLTGR
NHHQNNMGSI AETSTAFPGN TGARPNYIAA LPKVLKYNGY STAMFGKNHE IPPWETGPAA
NQSLWPSQIG FEKFYGFFGG ETDQFQPVLI DGNTRIKTPR KENYHFTTDM TDQTIQWLNL
QQSYNADKPF FVYFAPGAAH APHQAPKEWI DKFKGKFSMG WDKLRQDTFE RQKAAGIIPK
DTILPPMPEQ VPRWDSLTPD EKRVFERQME VYAGFLTHTD HEIGRIVDTL KKNGKFDNTL
IFYIVGDNGA SGEGNRNGSF NSLAFYNGIE EDTKTVLDNI DKLGGPDSFG HYAAGWSIAG
DTPFVWMKGM ASDLGGTRNG MVVSWPKGIK SKGEEIRNQW SHVIDIAPTI LEVADLPDPK
MVDGVKQLPI AGVSFADTFN NAQAKTKHTT QYFELGGNRA IYNDGWLARV IHFPLWEDSK
KFATLQTDKW ELFDTRKDWS LSTDLAKNNP DKLKELRSIF DKEAEINHVY PIDDRTLERM
NAEVAGRPDA LFGKKSLTLY EGAKGIPENS FLNIKNKSFD LVAKVMIDNV DNTNGVIIAQ
GGNFAGWSLY VMKGIPTFEY NWLTYEYTKL SGAKLQPGEN EITMKFRYDE NGVGGKGNSV
GTGKGGNAYL YVNGKLVEKK LIPNTISRLY SLDDGVGIGE DEGGSVSRDY QAPFEFSQRI
ESVTTSIVE