Gene Sala_0302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0302 
Symbol 
ID4082865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp301155 
End bp302786 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content67% 
IMG OID638008660 
Productsulfatase 
Protein accessionYP_615358 
Protein GI103485797 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.139649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAATA AATGGGTGGC GCTGGGTGCC ATGACGCTGG CGGCGGTCGG TGGCTATTGG 
GCCTATGACG CGAACAAATA TCGCATCCCC GGCATCGTGC AGGACTGGCG CGAGCCGGTG
CAGCCGAACC GGGCGATCGC GTGGCAGCAA GGGCCGGCGG CGGCGCCGGA GGGCGCGCGG
CCGCCGAACA TCATCCTGAT CGTCGCCGAC GACCTGGGCT ATAACGACAT CAGCCTGAAC
GGCGGCGGCG TCGCGGGGAT CGTCAAGACG CCGAACATCG ATGCGCTGGC GCGTGAGGGG
GTCCATTTCA CCACCGCCTA TGCCGCCAAC GCGACCTGCT CGCCCTCGCG CGCGGCGATG
ATGACGGGGC GCTATCCGAC GCGCTTCGGC TTCGAGTTCA CCGCGGTGCC GATCGAGTTC
GCCGAAAATC TGGCGCATGG CGAGGGTGTC GGGCCGCACC GCGCGATCTT TCACGACGAA
CTGGTGACGC CCGACATCCC GCCCTATCCC CAGATGGGGG TGCCCGCGAG CGAGGTGACA
ATTGCCGAAG CGGTGAAGGC GGCGGGCTAT CACACGGTTC ACATCGGCAA ATGGCATCTG
GGCGAAGCCC CCGAATTGCA GCCGCACGCC CAGGGCTTCG ACGAAAGCCT GGCGGTGCTG
GCGGGCGCGG CAATGCTGCT GCCCGAGGAT GACCCCGACG CAGTCAACGC CAAGCTGCCG
TGGGATCCGA TCGACCGCTT CATCTGGGCC AATCTGCGCC ACGCAGTGAC CTTCAACGGC
AGCAAGCGGT TTGCCGCGCA GGGGCATATG ACCGACTATT TCGCCGACGA GGCGATCAAG
GCGATCGAAG CGAACCGGAA CCGGCCCTTT TTCCTCTATC TTGCCTTCAC CGCGCCGCAC
ACGCCGCTGC AGGCGACGCG CGCCGATTAT GACCGGCTCG CGGCGATCAA GGATCACAGG
ACGCGCGTTT ATGGCGCGAT GATCGCGCAG ATGGACCGGC GGATCGGCGA CGTGATGGCC
AAGCTGAAGG AGGCCGGGAT CGACGACAAT ACGCTCGTCA TCTTCACCAG CGACAATGGC
GGCGCCTGGT ACAACGGGAT GCCGGGGCTG AATGCGCCGT TCCGCGGGTG GAAAGCGACC
TTTTTCGAAG GCGGCATCCG GGCGCCGCTG TTCATGCGCT GGCCAGCGCG CATCGCGCCG
GGGACCGAGC GCGGCGACGT GACGGGCCAT CTCGACCTTT TCGCGACGAT TGCCGCCGCG
GCGGGCGCGG CGCTGCCCGC GGACCGGACG ATCGACAGCG AGGATATATT GGCCGGTCCC
GCCAAGCGTC CGGCGATGTT CTGGCGCTCG GGCGATTATC GCGCGGTGCG CGCGGGCGAC
TGGAAATTGC AGGTGACGAA GCGACCCGAA AAGGCGCGCC TCTATAACCT TGCCGCCGAT
CCGACCGAAC GGACCGACCT GTCGGCGCGC GAGCCGGCGC GCGTCGCCGA ACTCGGCGCG
ATGATCGAGG CGCAGAACCG GGGCATGGCG ACGCCGATCT GGCCGGGGTT GGTCGAAGGG
CCGGTGCGCA TCGACGTGCC GCTGAACACG CCGTGGCAGG ACGGGCAGGA TTATATCTAT
TGGACCAACT GA
 
Protein sequence
MANKWVALGA MTLAAVGGYW AYDANKYRIP GIVQDWREPV QPNRAIAWQQ GPAAAPEGAR 
PPNIILIVAD DLGYNDISLN GGGVAGIVKT PNIDALAREG VHFTTAYAAN ATCSPSRAAM
MTGRYPTRFG FEFTAVPIEF AENLAHGEGV GPHRAIFHDE LVTPDIPPYP QMGVPASEVT
IAEAVKAAGY HTVHIGKWHL GEAPELQPHA QGFDESLAVL AGAAMLLPED DPDAVNAKLP
WDPIDRFIWA NLRHAVTFNG SKRFAAQGHM TDYFADEAIK AIEANRNRPF FLYLAFTAPH
TPLQATRADY DRLAAIKDHR TRVYGAMIAQ MDRRIGDVMA KLKEAGIDDN TLVIFTSDNG
GAWYNGMPGL NAPFRGWKAT FFEGGIRAPL FMRWPARIAP GTERGDVTGH LDLFATIAAA
AGAALPADRT IDSEDILAGP AKRPAMFWRS GDYRAVRAGD WKLQVTKRPE KARLYNLAAD
PTERTDLSAR EPARVAELGA MIEAQNRGMA TPIWPGLVEG PVRIDVPLNT PWQDGQDYIY
WTN