Gene Swoo_3655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwoo_3655 
Symbol 
ID6117989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella woodyi ATCC 51908 
KingdomBacteria 
Replicon accessionNC_010506 
Strand
Start bp4452007 
End bp4453587 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content46% 
IMG OID641635206 
Productsulfatase 
Protein accessionYP_001762012 
Protein GI170727986 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0999851 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGTT TATTACTCGC TTCTGTCCTA GTGTGTTTGC TGCCCTTTAG TAGCTCACAA 
GCTCAAGATA ATTTACTGTT TATCATGACA GATGAGATGA AGTGGAACGT GATGGGCGTG
GCGGGACACC CTGTTGTGAA AACCCCAAAT CTGGATAGGC TTGCATCAGA GGGCACCTAT
TATAAAACCG CTTATACAGT GGCTCCAATC TGTTCACCAT CACGACGATC TTTTTTTACC
TCACGCTATA CCCATGTCCA TGGGGTCATA GATAACAGTA AACAGGCCTT AGCCAATGAT
GGAGAGGTTG ATCTACAGAC AATACTGAAA CACCAAGGTT ACCGCACCGC GATATCGGGG
AAGCTGCATT TCTATCCTGA GTGGCACGAT TGGGGGTTCG ATGAGTTTTG GGCACGTAGC
AGCGAAGGCC CAAATCGTTT GGAAACCTAT CGTCAGTATA TGGTGGCTAA ACATGGTGAT
GATGCATTTA AGCCTATTAA AGGCAGCGTC ACCTACCCAA AGGATCCTCT AGGCCATGAT
TTGGGGCGAT ACAGATTCGG TAAAGAGGAT TTTGAGACCT ATTGGTTGAC GGACAAAGCG
TTGGATTATC TGGCCAGAAA GGAGAAGAAG CCTTTTTTCT TATTTTTAAG TTATAACGAG
CCCCATAGCC CCTACATGGT AACTGAGCCT TATGCGTCCA TGTATGACCC AAAAACCTTG
CCAGTGCCTG TGATCCCTGC AAGTGCTAAA GCCGAGCGCA AAGTGGCGCT GGAAAAGAAG
ATAAAAGGTA AGTCTCGCCA TCTGATTGAT GATGAGCAGA TGATGCGGGA CTTAACGGCT
CAATATCTCG GCCATGTGTC TAATGTAGAT GACAATGTGG GGCGAGTACT GAGTTACTTA
GATAGCTCAG GGCTCGCTGA CAATACCATA GTGGTTTTTA CTGCAGATCA CGGCAATATG
CTTGGTGACC ATGGTAAGTG GTTTAAAGGC GTCATGCATG AAGGCTCAAG TCGTATTCCA
TTGATTATTC GCGCAGGTAA GCACACCCGT TATGCCAAGG TAATGAATCG CGGTAGGGTG
GTTGAGCAAG TGGTGGAGAG TATCGATGTT ATGCCGACCC TATTGGAGAT GCTTGATATC
AAGGCGCCGA GGGGCATGCA GGGAGAATCT CTACTTTCCC TCACTGCCGG AGAGGCTAAA
AATTGGAAAA ATAGGGCATT CTCTCAACGC TCCGACTTTA TGTTCATAGA GGGGGATTTT
AAGCTCATTA TGCCAGCCAA AGCGGGCAAG AAAGGGAAGC TTGAGTTATA CAATTTAGCT
AATGATCCCC TTGAAAATCA TAATTTGGCA GGGATGACCG AGTATCAAGC TAAGGTCAAA
TCGATGCAGC AAAGCATACA GGTTTGGCAA GCCGATAAAC CAGCCCCAAT CCGTGTAGAG
GGATTAACGC CACCAGAGCA TCTATTTAAT TCAGAGCTAT TACGAAACAA ACACAGCCAA
TCGTTTAAAG CCATGATGTT TAATCATCCA AAGCGTTTTA AGGATGAGAG CAAAACAGTA
AAGAGCAGTG CCGGAGAGTA A
 
Protein sequence
MGRLLLASVL VCLLPFSSSQ AQDNLLFIMT DEMKWNVMGV AGHPVVKTPN LDRLASEGTY 
YKTAYTVAPI CSPSRRSFFT SRYTHVHGVI DNSKQALAND GEVDLQTILK HQGYRTAISG
KLHFYPEWHD WGFDEFWARS SEGPNRLETY RQYMVAKHGD DAFKPIKGSV TYPKDPLGHD
LGRYRFGKED FETYWLTDKA LDYLARKEKK PFFLFLSYNE PHSPYMVTEP YASMYDPKTL
PVPVIPASAK AERKVALEKK IKGKSRHLID DEQMMRDLTA QYLGHVSNVD DNVGRVLSYL
DSSGLADNTI VVFTADHGNM LGDHGKWFKG VMHEGSSRIP LIIRAGKHTR YAKVMNRGRV
VEQVVESIDV MPTLLEMLDI KAPRGMQGES LLSLTAGEAK NWKNRAFSQR SDFMFIEGDF
KLIMPAKAGK KGKLELYNLA NDPLENHNLA GMTEYQAKVK SMQQSIQVWQ ADKPAPIRVE
GLTPPEHLFN SELLRNKHSQ SFKAMMFNHP KRFKDESKTV KSSAGE