Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2074 |
Symbol | |
ID | 4252647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 2467905 |
End bp | 2470274 |
Gene Length | 2370 bp |
Protein Length | 789 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 638118698 |
Product | sulfatase |
Protein accession | YP_734204 |
Protein GI | 113970411 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.33405 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTTT TTAAAACAAC CTTAACTGTC AGTAGCTTAC TGGCGCCGTG TCTGACATCA GCGCAAACTA TTGACAGAAC ACAGCTGCCA ATTGCTGATA TTGAACCTCA AACCTACGAC CAACTCGATG TACGTGATGT TCAGCGTCCA ACTCCAGCGA GCCGGATTAA GGCGCCAGAT GGAGCGCCGA ACGTGTTAGT GATTTTGCTC GACGATGTTG GTTTTAGTCA AAGCTCACTC TTTGGTGGGG CTGTGGATAT GCCAACTTTA GATGCATTGG CGGCACAGGG ATTGATTTAC AATCAGTTTC ATACCACAGG TGTCAGCTCG GCGACACGAA CCGCGTTGTT GACGGGGCGA AACCATCATC AAAACAACAT GGGATCGATT GCAGAAACCT CGACGGCCTT TCCTGGTAAT ACGGGAGCGC GTCCAAATTA TATTGCCGCA TTACCTAAAG TGTTAAAATA CAACGGCTAT AGCACAGCGA TGTTTGGCAA AAACCATGAG ATTCCGCCCT GGGAAACCGG TCCAGCAGCC AACCAGAGTT TATGGCCCAG CCAAATTGGT TTTGAGAAGT TCTATGGTTT TTTTGGCGGT GAAACAGATC AATTCCAACC CGTGCTTATC GATGGCAATA CCCGTATAAA AACACCGAGA AAGGAAAACT ACCATTTCAC AACCGATATG ACAGATCAAA CCATTCAATG GTTAAATCTG CAGCAAAGTT ATAATGCCGA TAAACCTTTC TTCGTTTACT ATGCCCCAGG TGCAGCACAC GCGCCGCATC AAGCGCCTAA AGAATGGATT GATAAGTTCA AAGGAAAATT CTCGATGGGT TGGGACAAAC TCAGACAAGA CACCTTTGAG CGTCAAAAGG CTGCGGGGAT TATTCCTAAA GACACCATTC TCCCCCCCAT GCCTGAGCAA GTGCCGCGAT GGGATTCACT CACCCCAGAT GAAAAACGTG TGTTTGAACG CCAAATGGAA GTTTACGCGG GCTTTTTGAC TCACACCGAT CATGAGATAG GGCGAATCGT AGATACCCTG AAGAAAAATG GTAAGTTCGA CAATACGCTC ATTTTCTACA TAGTTGGTGA TAATGGGGCA AGTGGCGAAG GTAACCGCAA CGGTAGCTTC AATTCGCTCG CATTTTATAA CGGTATCGAA GAGGACACAA AAACAGTACT CGACAACATT GATAAGCTAG GTGGGCCCGA TAGCTTCGGT CACTATGCTG CGGGATGGTC AATTGCCGGA GATACGCCCT TTGTATGGAT GAAGGGCATG GCATCCGATC TCGGCGGTAC TCGTAATGGC ATGGTCGTCA GTTGGCCCAA GGGCATCAAG TCGAAAGGAG AAGAGATCCG TAATCAATGG TCACACGTGA TTGATATTGC ACCTACGATA TTGGAGGTAG CCGACCTTCC CGACCCCAAA ATGGTCGATG GCGTGAAACA ATTACCTATC GCTGGAGTCA GTTTTGCTGA CACCTTCAAC AATGCTCAAG CCAAAACCAA ACATACCACT CAGTATTTTG AATTGGGTGG TAATCGCGCT ATCTATAATG ATGGTTGGTT GGCTCGTGTA ATACATTTTC CATTGTGGGA AGACTCTAAA AAGTTCGCAA CTCTGCAAAC TGATAAATGG GAACTGTTTG ACACACGTAA AGATTGGTCA TTGTCTACCG ATCTTGCAAA AAACAACCCT GATAAGCTGA AAGAGCTGCG CTCGATATTC GACAAAGAAG CTGAAATCAA TCATGTGTAT CCCATTGATG ACCGAACTTT AGAGCGCATG AATGCCGAAG TTGCAGGCAG ACCAGACGCC CTATTTGGAA AGAAGAGTTT AACTCTCTAT GAAGGTGCAA AGGGGATCCC CGAAAACTCA TTCTTAAACA TCAAAAATAA ATCCTTCGAT CTTGTCGCGA AAGTCATGAT TGACGATGTT GACAATACCA ATGGGGTGAT CATTGCTCAA GGCGGTAATT TCGCAGGATG GAGTTTGTAT GTGATGAAGG GAATACCTAC CTTTGAATAC AACTGGTTAA CCTATGAATA CACCAAGCTT TCTGGCGCAA AACTCCAGCC TGGCGAGAAT GAGATCACAA TGAAGTTCCG TTATGACGAA AATGGCGTTG GCGGAAAAGG AAACTCAGTC GGTACCGGGA AAGGAGGTAA TGCCTATCTT TACGTCAATG GCAAATTGGT AGAGAAGAAA CTGATCCCCA ACACCATTAG TCGGCTTTAC TCACTGGATG ATGGTGTGGG TATCGGTGAA GACGAAGGCG GTTCTGTCAG TCGAGATTAT CAAGCCCCAT TTGAGTTCTC TCAGCGTATT GAAAGTGTAA CAACATCAAT CGTGGAATAA
|
Protein sequence | MDVFKTTLTV SSLLAPCLTS AQTIDRTQLP IADIEPQTYD QLDVRDVQRP TPASRIKAPD GAPNVLVILL DDVGFSQSSL FGGAVDMPTL DALAAQGLIY NQFHTTGVSS ATRTALLTGR NHHQNNMGSI AETSTAFPGN TGARPNYIAA LPKVLKYNGY STAMFGKNHE IPPWETGPAA NQSLWPSQIG FEKFYGFFGG ETDQFQPVLI DGNTRIKTPR KENYHFTTDM TDQTIQWLNL QQSYNADKPF FVYYAPGAAH APHQAPKEWI DKFKGKFSMG WDKLRQDTFE RQKAAGIIPK DTILPPMPEQ VPRWDSLTPD EKRVFERQME VYAGFLTHTD HEIGRIVDTL KKNGKFDNTL IFYIVGDNGA SGEGNRNGSF NSLAFYNGIE EDTKTVLDNI DKLGGPDSFG HYAAGWSIAG DTPFVWMKGM ASDLGGTRNG MVVSWPKGIK SKGEEIRNQW SHVIDIAPTI LEVADLPDPK MVDGVKQLPI AGVSFADTFN NAQAKTKHTT QYFELGGNRA IYNDGWLARV IHFPLWEDSK KFATLQTDKW ELFDTRKDWS LSTDLAKNNP DKLKELRSIF DKEAEINHVY PIDDRTLERM NAEVAGRPDA LFGKKSLTLY EGAKGIPENS FLNIKNKSFD LVAKVMIDDV DNTNGVIIAQ GGNFAGWSLY VMKGIPTFEY NWLTYEYTKL SGAKLQPGEN EITMKFRYDE NGVGGKGNSV GTGKGGNAYL YVNGKLVEKK LIPNTISRLY SLDDGVGIGE DEGGSVSRDY QAPFEFSQRI ESVTTSIVE
|
| |