Gene Swoo_3659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwoo_3659 
Symbol 
ID6117993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella woodyi ATCC 51908 
KingdomBacteria 
Replicon accessionNC_010506 
Strand
Start bp4460119 
End bp4461573 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content46% 
IMG OID641635210 
Productsulfatase 
Protein accessionYP_001762016 
Protein GI170727990 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000330438 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.826759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAACCA ACAAAACGCT TTTAACAGCA CTTTTACTTG GGCTAGTGAT AAGCACCAAG 
TTAAACGCAT CATTTTCTCC CTTAAAAAAG GAGGAGAGCA AGCTTAAGCA GGCTAATGTT
GTCATCATCT ATGTCGATGA TCTCGGCATT ATGGATACAG GCATCTACGG TTCGGCCCAA
TATCCAACCC CCAATATCGA TAAGCTAGCC AATTCAGGGG TTCGATTTAC TCAAGCATAT
GCCAATGCCG CCAATTGTGC TCCCAGTAGA GCCAGCCTAA TGACAGGTCT AACACCAGCA
GAACACGGTA TCTTAACCGT AGGTAGTTCT GAGCGCGGAG AGAGTCAATA TCGTAAATTA
ATTCCTGTCA CCAATAACAC CGAACTCAAT CCTGATCTCA CCACTATTGC CGACCTATTT
AAACAGCAAG GATACGCCAC TGCCGTCATC GGGAAATGGC ACCTTGGTAA GACTGCGCCC
ACAGAGTACG GTTTTGATAC TGCCATTGCA GCCTCCCATT TAGGTCATCC CCCCAGTTAC
TTCTATCCCT ACAGTAAAGG AAAACGCAAA CTCATAGGGC TTGAAGAGGG AGGGCTTAAA
GATGAGTACC TCAGTAACCG AATAACTCGA GAGGCTGTAA ACTATATCTC ATCACAACGG
CAACCTTTCT TTCTTTATCT CCCCTTCTAC GCTGTCCACA CTCCGATAGA AGCCCCAAAA
GAGTGGGTCA ACCAACACAA TGCCAGACAG CAAGCTGGAG AGATCAAGAG CGCAGCTTAT
GCGGCGATGA TCGCCAACCT TGATAGAGAT GTGGGTAAGC TTCTACAAGC CTTAGATAAA
AGCGGACAGC GTGAAAATAC CTTAGTGGTG TTTGCATCCG ATAACGGCGC CTATGATCCC
GCCACCTCAT CTCTGCCTTA TCGTGGCTAC AAAAGTAGCT TATTTGAAGG GGGTATTAAG
ATCCCACTTG TTCTCTCTTG GCCCAAACAG ATACCACCGA ATAGTCAAAA CAGAACTCCA
GTGCAGATGA GCGATCTCTT TTTAGGTATA AAACACCTCC TGCAGCCTAA ACTAGCGCTC
CACCGCCAAG ATATCATTTC ACTAGCCGAG CAAGGCAAGG AGCAGCCAGA GCGCCCCCTT
TACTGGCATG CTCCTATCTA TATCGATCAA TTTGCTCCCT ATCGTGGTCA ACCTAACCAT
CCTTACTGGA AACACACACC TGCAGCTGCA ATACGCTTAG GACACTATAA ATTAATCCAC
AGCTATGAAA CGGGTAAACA GCTACTGTTC GACCTAGACA AAGACAGCCA AGAGAAAAAT
AACCTTGTTA ATCAAAACCC TGAAATAAGA GAGAAGCTGT TTAAAGCCTT GCAACAGTGG
CAGGAGTCAG TGAATGCCCC CATGGTTTCT GAATTAAACC CTAACTATCA ATCTCAAGCT
ACTAGCCACT ATTAA
 
Protein sequence
MKTNKTLLTA LLLGLVISTK LNASFSPLKK EESKLKQANV VIIYVDDLGI MDTGIYGSAQ 
YPTPNIDKLA NSGVRFTQAY ANAANCAPSR ASLMTGLTPA EHGILTVGSS ERGESQYRKL
IPVTNNTELN PDLTTIADLF KQQGYATAVI GKWHLGKTAP TEYGFDTAIA ASHLGHPPSY
FYPYSKGKRK LIGLEEGGLK DEYLSNRITR EAVNYISSQR QPFFLYLPFY AVHTPIEAPK
EWVNQHNARQ QAGEIKSAAY AAMIANLDRD VGKLLQALDK SGQRENTLVV FASDNGAYDP
ATSSLPYRGY KSSLFEGGIK IPLVLSWPKQ IPPNSQNRTP VQMSDLFLGI KHLLQPKLAL
HRQDIISLAE QGKEQPERPL YWHAPIYIDQ FAPYRGQPNH PYWKHTPAAA IRLGHYKLIH
SYETGKQLLF DLDKDSQEKN NLVNQNPEIR EKLFKALQQW QESVNAPMVS ELNPNYQSQA
TSHY