Gene Sala_0299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0299 
Symbol 
ID4082862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp297402 
End bp299231 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content67% 
IMG OID638008657 
Productsulfatase 
Protein accessionYP_615355 
Protein GI103485794 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.142586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCAG GCTGGAGGAC CACCAGGCGG GGCCGGACAG GTGCGTCCCG GCGTGCTGCC 
ATCATCGCGC TCGCATGCCT GTTGGGAAGC ACATCGGGCC TGGCGCAGTC ACGCGCGGCG
CCGCCGCGCC AGCCCAATAT CGTCATCCTG CTCGCCGACG ACTGGGGGTT TTCGGACGTC
GGTGCCTTTG GCTCCGAAAT CGCGACACCG CATATCGACG CGCTCGCACG CGCCGGAATG
CGCTTTGCGA ACTTCCATGT CTCGGGTTCC TGCTCGCCGA CGCGCGCGAT GCTCCAGACG
GGGGTGATGA ACCACCGCAA CGGCCTCGGC AACATGCCCG AGACGATCCC CGACGAACAT
CGCGGCAAGC CCGGTTACGA CACGGTGATG AACCTGCGCG TCGTGACGAT CGCAGAGTTG
ATGAAGGCCG CGGGATACCG CACCTACCTG ACCGGCAAAT GGCATCTGGG CAGCGACGCA
AAGCGGCTGC CCGAAGCGCG CGGATACGAC CGCGCCTTCA GTCTCGCCGA TGCGGGCGCC
GACAATTTCG AGCAGCGACC GATCGAAGGG CTGTACGACA AGGCGAACTG GACCGAGAAC
GGCCGCCCCG CGACCCTGCC CCGCGATTAT TATTCATCCA CCTTCGTCGT CGAAAAGATG
ATCGAATATA TCGAGGCGGA TCGCGATAGC GGCAAACCCT TCCTCGCCTC GATCAACTTC
CTCGCCAATC ATATCCCGGT GCAGGCGCCC GACAGCGACA TCGCGCGCTA TGCGGCGATG
TATCAGGACG GCTGGACGGC GCTGCGCGAG GCACGCGCGC GGCGTGCGGC GGCACTCGGC
ATCGTGCCGG CGGGCACGCC GATGGTGACG ATGCCGACGA CGCGCGACTG GCAGAGGCTG
GACGCCGACG AACGCGCGGC GGCGGTGCGC GTGATGCAGG CCTATGGCGG CATGGCGACC
GCGATGGACC GCGAGATCGG GCGACTCGTC GCGCACCTCA AGACCACGGG CGATTACGAC
AACACGATCT TCGTCTTCCT GTCCGACAAT GGCGCCGAGC CGACGAATCC CTTTGCCAGT
CTGCGCAACC GGCTGTTCCT GCGGATGCAA TATGATCTTT CGACCGATAA CATCGGGCGG
CGGGGCAGTT TTTCGGCGAT CGGGCCGGGC TGGGCAAGCG CCGCGGCGTC GCCCTTGTCA
GGTTACAAGT TCAGCGCCGC CGAGGGCGGG CTGCGCGTCC CGCTGATCAT CGCCTGGCCG
GGGCATGGCG CGATCCCGGC GGGCGCGATC AACGACGGGC TGGCGCATGT CACCGACCTC
TTGCCGACGC TTGCCGAACT GGCGGACGTG CCGCTGCACG AAGGGACATG GCAGGGGCGG
AGCGTCGAGC CGATCACGGG GCGCAGCCTC GTCCCGATGC TGAAGGGTGC TGCGGGCAGC
GTCCATGGCG ACGCGCCGCT CGGTTACGAG CTGTCGGGCA ATGCCGCGCT GTTTCGCGGC
GATTACAAGC TGGTGCGCAA CCTGCCGCCG ACCGGCGACG GCCGGTGGCG GCTCTATGAC
ATCAAGACGG ACCCCGGCGA GACCCGCGAC CTGTCGGCGG CGATGCCCGA TCGGTTCGCC
GCGATGCTGT CCGACTACCG CGCCTATGCC AGGGCGAACG GCGTGCTCGA CATGCCGGCG
GGTTATACCG CCGACGAACA GATCAACCGT TATGCGTGGG AGCAGCAGGG GCGCAAACGC
GCGATCAAGG CCGGGCTGTG GCTGGGCGGC GGGTTGATGG CGCTGGCGTT GCTGGTCTGG
AGCTGGCGGC GGCGGCGCGC GCGGGGGTAA
 
Protein sequence
MAAGWRTTRR GRTGASRRAA IIALACLLGS TSGLAQSRAA PPRQPNIVIL LADDWGFSDV 
GAFGSEIATP HIDALARAGM RFANFHVSGS CSPTRAMLQT GVMNHRNGLG NMPETIPDEH
RGKPGYDTVM NLRVVTIAEL MKAAGYRTYL TGKWHLGSDA KRLPEARGYD RAFSLADAGA
DNFEQRPIEG LYDKANWTEN GRPATLPRDY YSSTFVVEKM IEYIEADRDS GKPFLASINF
LANHIPVQAP DSDIARYAAM YQDGWTALRE ARARRAAALG IVPAGTPMVT MPTTRDWQRL
DADERAAAVR VMQAYGGMAT AMDREIGRLV AHLKTTGDYD NTIFVFLSDN GAEPTNPFAS
LRNRLFLRMQ YDLSTDNIGR RGSFSAIGPG WASAAASPLS GYKFSAAEGG LRVPLIIAWP
GHGAIPAGAI NDGLAHVTDL LPTLAELADV PLHEGTWQGR SVEPITGRSL VPMLKGAAGS
VHGDAPLGYE LSGNAALFRG DYKLVRNLPP TGDGRWRLYD IKTDPGETRD LSAAMPDRFA
AMLSDYRAYA RANGVLDMPA GYTADEQINR YAWEQQGRKR AIKAGLWLGG GLMALALLVW
SWRRRRARG