Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssed_4001 |
Symbol | |
ID | 5611074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sediminis HAW-EB3 |
Kingdom | Bacteria |
Replicon accession | NC_009831 |
Strand | + |
Start bp | 4898273 |
End bp | 4900048 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640934955 |
Product | arylsulfate sulfotransferase |
Protein accession | YP_001475733 |
Protein GI | 157377133 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0413696 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTAAAA GAACAATTAT AGCAACAGCA ATAGCCACTA TTTCATTGGG CGCATCTGCC GCAGGATTTA AGCCTGCACC GGCCGCAGGT CAGCTTGGTG CGGTTCTTGT GAACCCATAC GGAAACTCTC CACTTACTGC ACTTATCGAT TTACGCAGTA AGCAACCAAC TGACGTTGTA GTTACAGTTA AAGGCAAGGG TCGCAACGGT GTTGATATCA AATACCCAGT TGGACAGAGG ACCATTAACA CACATGACGG TATCCCAGTA TTCGGTCTGT ATGCTAACCA CAACAATGTT ATAAAGCTTA CATACAAGCT AGAAGGCAAA AAAGTATCAG AGACATATAA AGCCCTTACT GGCGCTATCG TTAACAACTA TATCGATAAC CGCAACGTAA CGGCACTTCC CGAAGTTCAG GTTAAGAAAG TTGCCAAGGG CTTTGGAGAC CGCTTATACC TAGTGAACTC TCACACGTAC AACCAGCAAG GCTCTGATCT TCATTGGTCT GGTCAGAAGA GCAAAGACGC CGGTATCTTT GAAGGCTCTC CAGCAATGGG CGCACTTCCA TTTGAAAACC CACCGATGAC CTATGTTGTC GACACTGAAG GTGAAGTCCG TTGGTGGTTA AACCAAGACG CCACTTATGA CGCGACAAGC CTGGACATTG AGAAGCGCGG TTACTTAATG GGCTTCCAGG ATACCGGAGA AGGTAGTTAC ACGTTCGTGC AGGGCCAGCA TTACGGTACA TTTAACCTGT TAGGTCAGAT TGACTCACAG CGTTTGCCTC GTGGCTATGT CGATGCATCA CACGAGCACA ATGTTATGCC TAATGGTCAT ACGCTAGTTC GTGCAGCAAA GGCTAACTAC GTTAACGATC GTGGAGATAC GGTTCATACA ATACGTGATC ATGTATTAGA ACTTGATAAA GACGGCAACC TCGTTGACGT TTGGAACGTT GCAACAATCC TTGATCCATA CCGTGATGCA CTTCTTGAAG CATTGGATAT GGGTGCTGTT TGTCTGAACG TTGATATGGA CCACTTGGGC CAGACAGCGA AGATGGAAGT AGACGCTCCT TACGGCGATA TTCCAGGTGT CGGCGCTGGT CGTAACTGGG CTCACATCAA CTCTATCGAA TACGATCCAA AGGGTGACGG CATTATCGTT TCACTACGCC ACCAAGGCGT AGCGAAGATT AACCGCAATA AAGAGGTTGT CTGGATTCAG GCGCCACGCG AAGGCTGGAA CAAAGAGCTT GCTAAGAAAG TCCTTACTCC TATCGATTCT AACGGCAATA AGATCAAGTG TACTGAGAAA GGTGTTTGTG AGGGTGACTT CGACTTCACC TACACACAGC ATACTGCTTG GTTGAACAAT AAGAACGGCA ACCTGACAGT ATTCGACAAC GGTGATGGTC GTGGTCACGA GCAGCCAGCG CTAGGCAGCA TGAAGTATAG CCGTTTCGTT GAGTACAAGA TTGACGAAGA AGACATGACC ATCGAGCAGG TCTGGGAATA CGGTAAGGAG CGTGGCTACG ATTGGTATAG CGCCATTACA TCAAACGTAG AGTACATGGA AGATAAAGAC ACCATGTTCG GCTTTAGTGC TGCAATCCAC CTTTACAATC CAGGCGAGCG CACGATCGGT AAGATCAACG AGATTGGTCG CACTGATGGC AAGGTTAAAG TCGAGATTGA CGTCTTATCT GATAAGCCTA ACACGCCTCA TTACCGCGCA AGCCTAGTAA ACCTAACAAG CCAGTTCGGT AAATAA
|
Protein sequence | MLKRTIIATA IATISLGASA AGFKPAPAAG QLGAVLVNPY GNSPLTALID LRSKQPTDVV VTVKGKGRNG VDIKYPVGQR TINTHDGIPV FGLYANHNNV IKLTYKLEGK KVSETYKALT GAIVNNYIDN RNVTALPEVQ VKKVAKGFGD RLYLVNSHTY NQQGSDLHWS GQKSKDAGIF EGSPAMGALP FENPPMTYVV DTEGEVRWWL NQDATYDATS LDIEKRGYLM GFQDTGEGSY TFVQGQHYGT FNLLGQIDSQ RLPRGYVDAS HEHNVMPNGH TLVRAAKANY VNDRGDTVHT IRDHVLELDK DGNLVDVWNV ATILDPYRDA LLEALDMGAV CLNVDMDHLG QTAKMEVDAP YGDIPGVGAG RNWAHINSIE YDPKGDGIIV SLRHQGVAKI NRNKEVVWIQ APREGWNKEL AKKVLTPIDS NGNKIKCTEK GVCEGDFDFT YTQHTAWLNN KNGNLTVFDN GDGRGHEQPA LGSMKYSRFV EYKIDEEDMT IEQVWEYGKE RGYDWYSAIT SNVEYMEDKD TMFGFSAAIH LYNPGERTIG KINEIGRTDG KVKVEIDVLS DKPNTPHYRA SLVNLTSQFG K
|
| |