Gene Swoo_3651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwoo_3651 
Symbol 
ID6117985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella woodyi ATCC 51908 
KingdomBacteria 
Replicon accessionNC_010506 
Strand
Start bp4444848 
End bp4446299 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content47% 
IMG OID641635202 
Productsulfatase 
Protein accessionYP_001762008 
Protein GI170727982 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000643806 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0204343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTGA TGCTTGGTAG CTCAGCTATA GCAGCGCAGC AGACACCGCC CAATGTTGTC 
ATCGTGTTGG CTGATGACAT GGGATTTGGT CATGTGGCCA TGAACCTGGA CTTGGCAACA
GCTGATAGCT ATAACCCTCA AAATCTTAAA CGGGATAGTC AGCGTCATAA ACCAGAGCTT
GCTCGTTCTT ACGCAAAAAA AGCGACGCCA ACCTTAACTC AATTAGCCAA TGAAGGGGTT
CGCTTTACTA ATGCTTATGT TCCTAGCCCA TTATGCGGTC CTAGCCGAGC AGCCTTGATG
ACTGGTCGCT ACCCGCAAAG GTTTGGTATA TATAATAATG CAGATGTTAA GGCTGCTGGT
TTACCCGTTG AGGAAAATGT ACTGGCAAAC AACTTCCGTA AAGCGGGTTA CCGTACTGGC
GCTGTTGGTA AGTGGCATCT GACAAAGGGA GAGAAAAAAG CCTCTTATAC GTTAGCTCAG
CACCCGCTAG ATCGTGGGTT CGATTTCTTT TTCGGTTTTG ACCGTTCAGG CACCCCCTAT
TATGACTCCA AAATTCTCGA ACTTAATCGT AAACCTGTGA AGGCTGAAGG CTATCTGACC
GATCAGCTGA CCAATCATGC TATTGATTTC ATTAACCAAG ACAAGAGTAA GCCTTTCTTC
CTCTATATGG CTTATAACGC CGTACACGGC CCCTTAAATA AGGCAGCTCC CAAAGAGTAT
CAGGCACCTT TTAACAGTGG TGATCGATAT CTGGATTATT TCTACTCCTA CCTCTACGCG
TTAGATCAAG GAGTGGCCAA AATTATCAAG CAGTTGGACA GTAATGGTCA GCTAGACAAC
ACCATCATCA TGTTTCTAAG TGACAATGGT GCGCCGGGTG GTAAGCCTTT CCCTCTACCT
GCCAATGCCC CTTTTACCGG TTATAAGGGA CAGGTATGGC AAGGCGGTAC TAGGGTTCCT
GTCGTCATTT GGGGGCCTAA AGCCTTAGTT AATGGTGGGC GTGTTGATGA TGCCGTTATC
TCTTCAATGG ATTTGATACC GACTGCTCTC GCAGCTGCTG GTGTGGATTT ATCAGACAAT
CTTGATGGAA ACAATTTACT GCCTAAGCTG AAGAGGGTTG AAGAGGATGA GCGTCAGCTT
TTCTGGGCAA GTCAGTTGTC TCATCACTGG GGATTCATTC GTGATGCCAA GGGGAAGAAG
ATTGATGACA AATCCACCGC AGAGCCTGCT TGGGCCGTCA GAAGTGGTGA GTGGATGCTT
AGATATTGGG CCGATAGCAA GAAGACTGAG CTGTTTAATG TGAGTACAGA TCATGCTGAG
CACCACGATA TTGCCAATAA GCATCCTCAA GTTGTGAAGC AACTGACTGC GGATTACAAA
GTCTGGTTTG ATACGTTGGC AAAGCCAGCT GGCTGGGACA AACGTTATTG GGAGCAGCTA
GAAGTTAAGT AA
 
Protein sequence
MGLMLGSSAI AAQQTPPNVV IVLADDMGFG HVAMNLDLAT ADSYNPQNLK RDSQRHKPEL 
ARSYAKKATP TLTQLANEGV RFTNAYVPSP LCGPSRAALM TGRYPQRFGI YNNADVKAAG
LPVEENVLAN NFRKAGYRTG AVGKWHLTKG EKKASYTLAQ HPLDRGFDFF FGFDRSGTPY
YDSKILELNR KPVKAEGYLT DQLTNHAIDF INQDKSKPFF LYMAYNAVHG PLNKAAPKEY
QAPFNSGDRY LDYFYSYLYA LDQGVAKIIK QLDSNGQLDN TIIMFLSDNG APGGKPFPLP
ANAPFTGYKG QVWQGGTRVP VVIWGPKALV NGGRVDDAVI SSMDLIPTAL AAAGVDLSDN
LDGNNLLPKL KRVEEDERQL FWASQLSHHW GFIRDAKGKK IDDKSTAEPA WAVRSGEWML
RYWADSKKTE LFNVSTDHAE HHDIANKHPQ VVKQLTADYK VWFDTLAKPA GWDKRYWEQL
EVK