Gene Daci_5404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_5404 
Symbol 
ID5751019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp5997613 
End bp5998986 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content69% 
IMG OID641300532 
Productsulfatase 
Protein accessionYP_001566418 
Protein GI160900836 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.268396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.742595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG CCGCTTCCCG CCCACAACCC TGCACAGAGC GCATTTGCAT GTCCCGTCCC 
AATATCCTCT TCATCGTGGC CGACGACCTC GGCTATGCCG ACCTCGGCTG CTACGGCGGC
CGCGCGGCCG ACTTCGGAGC GGTGTCCCCG GTGCTTGACC GCCTGGCCGC CGGCGGCCTC
AGGCTCACCC AGGGCTATGC CAACTCGCCC GTGTGCTCTC CCACGCGCTT TGCCCTGGCC
ACGGCGCGCT ACCAGTACCG CCTGCGCGGT GCGGCCGAGG AGCCCATCAA CAGCAAGACA
CGCGGCACGC CACTGGGCGA AAAGCTGGGC CTGCCGCCGG ACATGCCCAC CGTGGCCTCC
ATGCTCAGGG ATGCGGGCTA CCGCACGGCG CTGATCGGCA AATGGCACCT GGGCTACCCG
CCGCACTTCG GCCCGCTGCG CTCGGGCTAC GAGGAATACT TCGGCCCCAT GTCGGGCGGC
GTGGACTACT TCACCCACCT GAGCAGCTCG GGCCAGCACG ACCTGTGGGT GGGCGAGGAG
GAACACCATG ACGAGGGCTA CCTGACCGAC CTGCTGTCGC AGCGCAGCGT GGACTTCGTC
CACCGCATGG CCCAAGGCGA TGCGCCCTTC TTCCTGAGCC TGCACTACAC GGCGCCGCAC
TGGCCCTGGG AAACGCGCGA TGACCGCAGC ACGGCCGAGG CGCTGGGCGC AGGCATTGCC
CACCTGGACG GCGGCAACAT CCACCAGTAC CGCCGCATGA TCCACCACAT GGACGAAGGC
ATAGGCTGGA TCGTCGAGGC GTTGCGCGCC AACGGGCAGC TGGACAACAC CCTCATCGTC
TTCACCAGCG ACAACGGCGG CGAACGCTTC TCCGACAACT GGCCCCTGGT CGGCGGCAAG
ATGGACCTGA CCGAGGGCGG CATACGCGTG CCCTGGATCG CGCACTGGCC GGCCGTGATC
GCTCCGGGCC GCAGCAGCCC CCAGCACTGC ATGAGCATGG ACTGGTCGGC CACGGTGCTG
GATGCCGCCG GCGTGCAGGC GCCAGAGGGC CATGCGCTGG ACGGCATCTC GCTGCTGCCC
GTGCTGCGCG CCGAAGATGC CGAATTCCCG CGCACCCTGC ACTGGCGCAT GAAGCACCGC
GGCCAACGTG CCCTGCGCGA TGGCGACTGG AAGTACCTGC GCGTGGACGG CATCGACTAC
CTGTTCGACC TTGCCGCCGA CGAGCGCGAG CGCGCCAACC AGGCAGCGCG CGCGCCCGAG
CGTCTGGCCG CCATGCGCAG CGCCTGGGAA GACTGGAACC AGGGCATGCC GCCCATCCCC
GAGGACGCCA CGGTCAGCCT GGTCTCTTCG GCCCGGGACA TGCCCCAGCG CTGA
 
Protein sequence
MSAAASRPQP CTERICMSRP NILFIVADDL GYADLGCYGG RAADFGAVSP VLDRLAAGGL 
RLTQGYANSP VCSPTRFALA TARYQYRLRG AAEEPINSKT RGTPLGEKLG LPPDMPTVAS
MLRDAGYRTA LIGKWHLGYP PHFGPLRSGY EEYFGPMSGG VDYFTHLSSS GQHDLWVGEE
EHHDEGYLTD LLSQRSVDFV HRMAQGDAPF FLSLHYTAPH WPWETRDDRS TAEALGAGIA
HLDGGNIHQY RRMIHHMDEG IGWIVEALRA NGQLDNTLIV FTSDNGGERF SDNWPLVGGK
MDLTEGGIRV PWIAHWPAVI APGRSSPQHC MSMDWSATVL DAAGVQAPEG HALDGISLLP
VLRAEDAEFP RTLHWRMKHR GQRALRDGDW KYLRVDGIDY LFDLAADERE RANQAARAPE
RLAAMRSAWE DWNQGMPPIP EDATVSLVSS ARDMPQR