Gene Daci_3115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_3115 
Symbol 
ID5748699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp3432014 
End bp3433336 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content68% 
IMG OID641298218 
Productcerebroside-sulfatase 
Protein accessionYP_001564138 
Protein GI160898556 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.100154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCAC CAAACATCGT GCTCATCGTC GCGGACAACC TGGGCTGGGG CGAGCTGGGC 
TGCTACGGCG GCGGTGCGCT GCGCGGCGCG CCCACTCCGC GTATCGACCA GCTGGCCACC
GAGGGACTGC TGCTGCAGAA CTTCAACGTG GAAAGCGACT GCGTGCCCAC GCGCTCGGCC
CTGATGACGG GGCGCCATCC CATCCGCACG GGCGCCCTGC AATCGGTGCC GGCCGGGCTG
CCCCAGGGCC TGACGCGCGG CGAGACCACG CTGGCCCAGT TGCTGTCGGC CCAGGGATAT
GCCACGGCGC ATTTCGGCAA ATGGCATCTG GGGGACATTC CGGGCCGTTA TCCCTCGGAC
CGGGGTTTCG ACGAGTGGTA CGGCATTCCG CGCACCACGG ACGAAAGCCA GTTCACCGCG
ACCACGGGCT ACGACCCTCT GGTGGCACCC CTGCCCCACA TCATGGAAGG CCGCGCGGGC
GCGCCCTCGG ACGATGTCAA GGTCTACGAC CTGGAATCGC GGCGCAACAT CGATGCCGAG
CTGGTGGAGC GCTCGCTGGA CTTCATGCGC CGCAACCACG CCCGGGACCG CCGCTTCTTC
CTGTACCTGC CCCTGGTGCA CCTGCACTTT CCCACGCTGC CGCATGCGGA TTTCGCGGGC
CGCACCGGCC AGGGCGACTT TGCCGACTCC ATGGTGGAGA TGGACCACCG CGTGGGCCAG
ATCTCGGACG AGATCGACCG GCTCGGCCTG CGCGAGGACA CGCTGTTCAT CTTCTGCAGC
GACAACGGCC CCGAGTTCCG CCAGCCCTAC CGCGGCACGG CCGGCCCCTG GCGCGGCACC
TACCACACGG CCATGGAGGG CAGCCTGCGC GTGCCCTTCA TCGCACGCTG GCCGGGCCGC
ATCCAGGCCG GGCGCACCAG CAATGAAATC ATGCACGTGA CCGACATCTT CTCCACCCTG
GCGGCGGTGG CGGGCGCTGC CGTGCCGCAG GACCGGCCCA TCGACGGCCT GGACCAGACG
GCCTTCCTGC TGGGCGAGCC CCGGTCCGCG CGCGAGGGCT TTCTCTTCTA CATCAAGGAC
CAGCTGCGGG CCGCGAAGTG GCGCGACTGG AAGCTGCATT TCTACTGGGA GCCCGAGGTC
AACGAGAGCA AGGGCAAGCT GGAGTCCCCC TACCTGTTCA ACATCGTGCG CGATCCCAAG
GAGGAGCATG ACGTGCTGAT CTTCAACACC TGGGTGCTCA ACCCCATGCT GACCATGGTC
CAGCGCTTCA ACGAAAGCTG CAAGGCACAC CCCAACACCG CGCCCGGCGC CCCCGACCAA
TGA
 
Protein sequence
MNPPNIVLIV ADNLGWGELG CYGGGALRGA PTPRIDQLAT EGLLLQNFNV ESDCVPTRSA 
LMTGRHPIRT GALQSVPAGL PQGLTRGETT LAQLLSAQGY ATAHFGKWHL GDIPGRYPSD
RGFDEWYGIP RTTDESQFTA TTGYDPLVAP LPHIMEGRAG APSDDVKVYD LESRRNIDAE
LVERSLDFMR RNHARDRRFF LYLPLVHLHF PTLPHADFAG RTGQGDFADS MVEMDHRVGQ
ISDEIDRLGL REDTLFIFCS DNGPEFRQPY RGTAGPWRGT YHTAMEGSLR VPFIARWPGR
IQAGRTSNEI MHVTDIFSTL AAVAGAAVPQ DRPIDGLDQT AFLLGEPRSA REGFLFYIKD
QLRAAKWRDW KLHFYWEPEV NESKGKLESP YLFNIVRDPK EEHDVLIFNT WVLNPMLTMV
QRFNESCKAH PNTAPGAPDQ