Gene Ssed_4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_4001 
Symbol 
ID5611074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp4898273 
End bp4900048 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content47% 
IMG OID640934955 
Productarylsulfate sulfotransferase 
Protein accessionYP_001475733 
Protein GI157377133 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0413696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAAA GAACAATTAT AGCAACAGCA ATAGCCACTA TTTCATTGGG CGCATCTGCC 
GCAGGATTTA AGCCTGCACC GGCCGCAGGT CAGCTTGGTG CGGTTCTTGT GAACCCATAC
GGAAACTCTC CACTTACTGC ACTTATCGAT TTACGCAGTA AGCAACCAAC TGACGTTGTA
GTTACAGTTA AAGGCAAGGG TCGCAACGGT GTTGATATCA AATACCCAGT TGGACAGAGG
ACCATTAACA CACATGACGG TATCCCAGTA TTCGGTCTGT ATGCTAACCA CAACAATGTT
ATAAAGCTTA CATACAAGCT AGAAGGCAAA AAAGTATCAG AGACATATAA AGCCCTTACT
GGCGCTATCG TTAACAACTA TATCGATAAC CGCAACGTAA CGGCACTTCC CGAAGTTCAG
GTTAAGAAAG TTGCCAAGGG CTTTGGAGAC CGCTTATACC TAGTGAACTC TCACACGTAC
AACCAGCAAG GCTCTGATCT TCATTGGTCT GGTCAGAAGA GCAAAGACGC CGGTATCTTT
GAAGGCTCTC CAGCAATGGG CGCACTTCCA TTTGAAAACC CACCGATGAC CTATGTTGTC
GACACTGAAG GTGAAGTCCG TTGGTGGTTA AACCAAGACG CCACTTATGA CGCGACAAGC
CTGGACATTG AGAAGCGCGG TTACTTAATG GGCTTCCAGG ATACCGGAGA AGGTAGTTAC
ACGTTCGTGC AGGGCCAGCA TTACGGTACA TTTAACCTGT TAGGTCAGAT TGACTCACAG
CGTTTGCCTC GTGGCTATGT CGATGCATCA CACGAGCACA ATGTTATGCC TAATGGTCAT
ACGCTAGTTC GTGCAGCAAA GGCTAACTAC GTTAACGATC GTGGAGATAC GGTTCATACA
ATACGTGATC ATGTATTAGA ACTTGATAAA GACGGCAACC TCGTTGACGT TTGGAACGTT
GCAACAATCC TTGATCCATA CCGTGATGCA CTTCTTGAAG CATTGGATAT GGGTGCTGTT
TGTCTGAACG TTGATATGGA CCACTTGGGC CAGACAGCGA AGATGGAAGT AGACGCTCCT
TACGGCGATA TTCCAGGTGT CGGCGCTGGT CGTAACTGGG CTCACATCAA CTCTATCGAA
TACGATCCAA AGGGTGACGG CATTATCGTT TCACTACGCC ACCAAGGCGT AGCGAAGATT
AACCGCAATA AAGAGGTTGT CTGGATTCAG GCGCCACGCG AAGGCTGGAA CAAAGAGCTT
GCTAAGAAAG TCCTTACTCC TATCGATTCT AACGGCAATA AGATCAAGTG TACTGAGAAA
GGTGTTTGTG AGGGTGACTT CGACTTCACC TACACACAGC ATACTGCTTG GTTGAACAAT
AAGAACGGCA ACCTGACAGT ATTCGACAAC GGTGATGGTC GTGGTCACGA GCAGCCAGCG
CTAGGCAGCA TGAAGTATAG CCGTTTCGTT GAGTACAAGA TTGACGAAGA AGACATGACC
ATCGAGCAGG TCTGGGAATA CGGTAAGGAG CGTGGCTACG ATTGGTATAG CGCCATTACA
TCAAACGTAG AGTACATGGA AGATAAAGAC ACCATGTTCG GCTTTAGTGC TGCAATCCAC
CTTTACAATC CAGGCGAGCG CACGATCGGT AAGATCAACG AGATTGGTCG CACTGATGGC
AAGGTTAAAG TCGAGATTGA CGTCTTATCT GATAAGCCTA ACACGCCTCA TTACCGCGCA
AGCCTAGTAA ACCTAACAAG CCAGTTCGGT AAATAA
 
Protein sequence
MLKRTIIATA IATISLGASA AGFKPAPAAG QLGAVLVNPY GNSPLTALID LRSKQPTDVV 
VTVKGKGRNG VDIKYPVGQR TINTHDGIPV FGLYANHNNV IKLTYKLEGK KVSETYKALT
GAIVNNYIDN RNVTALPEVQ VKKVAKGFGD RLYLVNSHTY NQQGSDLHWS GQKSKDAGIF
EGSPAMGALP FENPPMTYVV DTEGEVRWWL NQDATYDATS LDIEKRGYLM GFQDTGEGSY
TFVQGQHYGT FNLLGQIDSQ RLPRGYVDAS HEHNVMPNGH TLVRAAKANY VNDRGDTVHT
IRDHVLELDK DGNLVDVWNV ATILDPYRDA LLEALDMGAV CLNVDMDHLG QTAKMEVDAP
YGDIPGVGAG RNWAHINSIE YDPKGDGIIV SLRHQGVAKI NRNKEVVWIQ APREGWNKEL
AKKVLTPIDS NGNKIKCTEK GVCEGDFDFT YTQHTAWLNN KNGNLTVFDN GDGRGHEQPA
LGSMKYSRFV EYKIDEEDMT IEQVWEYGKE RGYDWYSAIT SNVEYMEDKD TMFGFSAAIH
LYNPGERTIG KINEIGRTDG KVKVEIDVLS DKPNTPHYRA SLVNLTSQFG K