Gene Swoo_3657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwoo_3657 
Symbol 
ID6117991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella woodyi ATCC 51908 
KingdomBacteria 
Replicon accessionNC_010506 
Strand
Start bp4455426 
End bp4456910 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content45% 
IMG OID641635208 
Productsulfatase 
Protein accessionYP_001762014 
Protein GI170727988 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000376885 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0602445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCG GAATTGTCAT CATCCTGTTG TTATTCATGG TTGGTTGTCA GGCCACAGAG 
GGCGATAAGG TTTCAAAAGA GATAGCAAAG CAGCCCAATA TTCTGTGGAT CTATGTGGAA
GATATGAATG ACTGGATGGG GGCTTATGGT GACAAGACTG TACCGACGCC AAATATTGAT
CAGCTTGCGA GCCAAGGGGT ACGCTTTGAT AAGGTGATAA TGCCAGCAGC AGTTTGTTCT
GCTGTCCGCT CCGCCATTAT TTCTGGTGAG ATGCAAACCA CTTTAGGTTT CCATAACCAT
CGTAGTGGCC GATTCGACTA TAACCCTATT GCTTTACCCC AAGGTCATAA AACGGTACCT
GAGCTATTTC GCGATAACGG TTACGAAACC TTTAATATTG GTAAGGATGA TTACAACTTT
CACTACGATC GCAGCCAATT ATACTCTCTA CATCCAGGCC CGATAGCCGG TCATCAAGGG
GCAAAAAATG GTCCAGATTT CGATTGGGGT AAAAGACTAG CTCAATCTGG TAAGCCTTTC
TTTGGTCAGA TCCAGTTACG TGGCGGGAAA TATAAAGTCA AGAACCCCCC TGTTAAAGTC
GATCGCGCCA GCGTGACGCT TCCGCCTTAC TATAATGATC AACCTTTAAC TCGCGACGCT
TGGGCGCGCC ACTATGAGAA TATCCATCTG ACAGATCTTG ATGTGGGAGA GATAGTTAAA
GAGCTCAAAG ATAATAACCT ACTGGAAAAT ACAATCGTAT TCTTCTTTAC TGATCACGGC
ATGGGACTGT TGAGACATAA GCAGTTCCTC TATGACGGTG GCCTTCAAGT TCCTTTAGTG
ATCAGCTGGA TGAATGGTAA CGACAAGTTA CGTGAGCTAG GAGCTGAGCG TAAGGAGTTG
ATCCGCGGTC TTGATATTGG TGGTAGTAGC TTAGGCCTTG CCGGTATCGA TATTCCAGCA
TATATGACAA CTGAAAACTT CTTTGCTGCA GACTACCAAG CTAAGCCTTG GGTTATCTCT
GCTCGAGATC GCTGTGACTA TACCTTTGAA AAGATGCGTT CAGTGCGCAC CGACAGGTTT
AAATATATCC GAAATTACTT CCCCGAACGT CCCTATATGC AAGCGCAGTA CCGTGATAAA
TGGCCTCTGG TAAAGGAGTA TAAAAAAGCC TTCGCAGCTG GAGAGTTTAA TGAGATTGAG
GCGCAGTTAA TGGCTGAGCG TAAGCCTGCT GAAGAGTTAT ATGATTTGGA TAATGACCCC
CATGAAGTCA GTAATTTAGC GGGTGTTGGG GCCTATAAAA GTCAATTAAC CAAGATGCGT
GGAATTCTAA ATAACTGGGT TAAGGAGACG GGTGATAAAG GTCAGCTCCC TGAGTCCGAT
AATGGTATCC GAGAGGTGCT GGATTTCTAC CATGACAAAT GCCAAAGCCC AGAGTGCCAA
AGCTACCGTA CCCGCCATCA GTTAAGTGGC AATAAGAGTA AATAG
 
Protein sequence
MRIGIVIILL LFMVGCQATE GDKVSKEIAK QPNILWIYVE DMNDWMGAYG DKTVPTPNID 
QLASQGVRFD KVIMPAAVCS AVRSAIISGE MQTTLGFHNH RSGRFDYNPI ALPQGHKTVP
ELFRDNGYET FNIGKDDYNF HYDRSQLYSL HPGPIAGHQG AKNGPDFDWG KRLAQSGKPF
FGQIQLRGGK YKVKNPPVKV DRASVTLPPY YNDQPLTRDA WARHYENIHL TDLDVGEIVK
ELKDNNLLEN TIVFFFTDHG MGLLRHKQFL YDGGLQVPLV ISWMNGNDKL RELGAERKEL
IRGLDIGGSS LGLAGIDIPA YMTTENFFAA DYQAKPWVIS ARDRCDYTFE KMRSVRTDRF
KYIRNYFPER PYMQAQYRDK WPLVKEYKKA FAAGEFNEIE AQLMAERKPA EELYDLDNDP
HEVSNLAGVG AYKSQLTKMR GILNNWVKET GDKGQLPESD NGIREVLDFY HDKCQSPECQ
SYRTRHQLSG NKSK