Gene SO_3872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_3872 
Symbol 
ID1171515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp4018056 
End bp4019846 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content38% 
IMG OID637345636 
Productarylsulfate sulfotransferase 
Protein accessionNP_719404 
Protein GI24375361 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA ACATATTAGG CGTTGCAGTT ATATTGGCAC TATCTCAATC TATTATTTCT 
TCAGAATCAA ATGCAGCAGG TTTCAAACCA GCTCCACCTG CAGGACAATT AGGTGCCATT
CTTGTTAATC CATATGGTAA TTCACCATTA ACAGCTATAT TAGAACTCAG CAGTAAAAAA
CCAACTAATG TTACAGTAAC TGTTCATGGC AAGGGTAATA ATGGTGTAGA TATTAGTTAT
CCTGTAGGTC AAAGAACAAT GAATACACAT GATGGTATTC CATTATTCGG TTTATATGCC
GACCATAATA ACCAAGTTAC CGTAAAATAT ACTCTAGATG GTAAAAATTT AACTGAAAAA
TATAACGTAT TAACGGGAGC GATATCAAAT AAATATGTTG ACAACCGAAG CACGACAGTT
ATGCAGGAAG TAAAAGTAAA AACCGTAGCT AAAGGTTTTG AAAACCGTCT ATATATGGTC
AACAGCCATA CATATAACCA ACAGGGATCT GATATTCACT GGTCTGGTCA AAAAGGCAAA
GATGCGGGAA TTTTTGAATC GACACCAAGC ATGGGTTCAT TAACCTTTGA TAACGCTCCA
ATGACCTACG TTGTAGATAC TAAAGGTGAA ATTCGTTGGT GGTTAGAGCA AGATGCCGTT
TACGATGGTT CAGATGTAGA TCTAAATAAG CGTGGATATT TCATGGGCTT GCATGATAAT
GGCAAGGGTG GGATTACCTT CGTACAGGGA CAACGGTACG GCCATTTTGA TCTCTTAGGA
AATGTTGAAT CAAGACGTCT ACCAAGAGGC TACATTGATG CTTCCCATGA GCACAATCTA
ATGCCTAATG GACACTCACT TATCCGCGCA GCAAAATCAA ACTACGAAAA TGATCGTGGT
GATGTCGTTC ACACTGTAAG GGATCATATT CTAGAAATTG ACCAACAAGG TAATTTAGTT
GATGTATGGA ACCTTGCAAC TATTTTGGAT CCTTACAGGG ATGCCCTGCT AAATGCATTA
GATATGGGCG CAGTATGCTT AAACGTCGAT ATGGATAATA TTGGTAAAAC AGCTGAAGTT
AAAATTGATG CACCTTACGG AGATATTCCA GGTGTTGTAG CTGGTAGAAA CTGGGCACAC
ATTAACTCAG TTGAATATGA TCCAAAGGAT GACTCTATAA TTCTATCATT CCGTCATCAA
GGTGTAGCTA AAGTAACCCG CGATAAAAAA GTTAAATGGA TTCTTGCTCC ATCTGAAGGT
TGGAATAAAG AACTATCAAC TAAACTATTA ACACCAGTTG ATTCAAAAGG TAAAAAAATT
TCATGCACAA GCAAAGGGGT TTGCGATGGT GATTTCGATT TCACTTATAC CCAACATACA
GCATGGTTAA ATAATAAAAC GGGTACACTG ACCGTACTTG ACAACGGTGA TGGCCGTGGT
TACGAACAGC CCGCGTTACC TACGATGAAA TATTCTCGCT TCGTTGAATA TAAAATTAAC
GAAGAGAATA TGACTGTTGA ACAAATCTGG GAATACGGTA AAGAACGTGG TTACGAATGG
TATAGCCCAA TTACTTCTAA TGTTGAATAT ATGGAAGATA AGGATACCAT GTTTGGATTT
GGTGGTTCAG TAGACCTTTA TAATCCGGGA AAACCAACTA TTGGTCGAAT TAATGAAATA
GGTTATGACG ATAAAAAGGT CAAAGTTGAA ATTGACGTTT TATCAGACAA GCCTAACTCT
CCACATTACC GTGCAATTAT TCTAAATCCA TCAAGTCAAT TTGGTAATTA A
 
Protein sequence
MKKNILGVAV ILALSQSIIS SESNAAGFKP APPAGQLGAI LVNPYGNSPL TAILELSSKK 
PTNVTVTVHG KGNNGVDISY PVGQRTMNTH DGIPLFGLYA DHNNQVTVKY TLDGKNLTEK
YNVLTGAISN KYVDNRSTTV MQEVKVKTVA KGFENRLYMV NSHTYNQQGS DIHWSGQKGK
DAGIFESTPS MGSLTFDNAP MTYVVDTKGE IRWWLEQDAV YDGSDVDLNK RGYFMGLHDN
GKGGITFVQG QRYGHFDLLG NVESRRLPRG YIDASHEHNL MPNGHSLIRA AKSNYENDRG
DVVHTVRDHI LEIDQQGNLV DVWNLATILD PYRDALLNAL DMGAVCLNVD MDNIGKTAEV
KIDAPYGDIP GVVAGRNWAH INSVEYDPKD DSIILSFRHQ GVAKVTRDKK VKWILAPSEG
WNKELSTKLL TPVDSKGKKI SCTSKGVCDG DFDFTYTQHT AWLNNKTGTL TVLDNGDGRG
YEQPALPTMK YSRFVEYKIN EENMTVEQIW EYGKERGYEW YSPITSNVEY MEDKDTMFGF
GGSVDLYNPG KPTIGRINEI GYDDKKVKVE IDVLSDKPNS PHYRAIILNP SSQFGN