Gene Daci_4638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_4638 
Symbol 
ID5750228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp5093957 
End bp5095117 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content70% 
IMG OID641299741 
Productalkanesulfonate monooxygenase 
Protein accessionYP_001565652 
Protein GI160900070 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.211396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00000269993 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGTTT TCTGGTTCAT CCCCACCCAC GGCGACAGCC GCTACCTCGG CACGGCCGAG 
GGCGCCCGCC CGCTCAGCCA CGACTATGTC AAGCAGGTCG CCATCGCGGC CGACAGCCTG
GGCTACGAGG GCGTGCTGAT TCCCACGGGC CGCTCCTGCG AAGACCCCTG GGTCGTCGCC
TCCAGCCTGA TTCCGGTCAC CAGGCGCCTG AAGTTCCTGG TCGCCGTGCG GCCCGGCCTG
CACCAGCCCA GCCTGGCGGC GCGCATGGCG GCCTCGTTCG ACCGCCTGTC GGGCGGGCGG
CTGCTGATCA ACCTGGTCAC GGGCGGCGAC CGCGCCGAGC TGGAGGGCGA CGGCGTCTTC
CTCGACCATG CGCAGCGCTA CGAGCAATCG GCCGAGTTCA TCCGCATCTG GCGCGAGATC
CTTGAGCGCA GCCATGAGGG CGGCACGCTC GACTACGAAG GTGAACACCT CTCGGTCAAG
GGCGCCAAGC TGTTGTTCCC GCCGCTGCAA AAGCCGTATC CGCCCGTGTA CTTCGGCGGT
TCGTCCGAAG CCGCGCACGA CCTGGCCGCC GAGCAGGTCG ATGCCTACCT GACCTGGGGC
GAGCCCCCGG CCGAGGTGGC CAAAAAGATT GCCGATGTGC GCGAGAAGGC GGCGCGCCAT
GGCCGCAGCG TGAAGTTCGG CATCCGCCTG CACGTCATCG TGCGCGAGAC CGAGGACGAG
GCCTGGGCCG ATGCCGATCG CCTGATCAGC CGCCTCAAGG ACGAGACCGT GGTCCAGGCC
CAGGCCGCCT TCGCGCGCAT GGACTCGGAA GGCCAGCGCC GCATGGCCGC CCTGCATGCC
GGCGGCAGCC GCCGCACGCG CGCCGAGCTG GAGATCAGCC CCAACCTCTG GGCCGGCGTG
GGCCTGGTGC GCGGCGGCGC GGGCACGGCC CTGGTGGGCG ATGCGCAGAC CGTGGCCGAC
CGCATCAAGG AGTACGCGGA CCTGGGCATA GACACCTTCG TGCTGTCCGG CTATCCGCAC
CTGGAAGAGG CCTACCGCTT TGCCGAGCTG GTCTTCCCGC TGCTGCCGCT GTCCGTGCGC
GACAGGCTGG CTGGCGGCGT GGGCGGGCCG CTGGGCGAGA CCGTTGCCAA CCTGTACTCG
CCGCGCGCGT CGCAAAGCTG A
 
Protein sequence
MQVFWFIPTH GDSRYLGTAE GARPLSHDYV KQVAIAADSL GYEGVLIPTG RSCEDPWVVA 
SSLIPVTRRL KFLVAVRPGL HQPSLAARMA ASFDRLSGGR LLINLVTGGD RAELEGDGVF
LDHAQRYEQS AEFIRIWREI LERSHEGGTL DYEGEHLSVK GAKLLFPPLQ KPYPPVYFGG
SSEAAHDLAA EQVDAYLTWG EPPAEVAKKI ADVREKAARH GRSVKFGIRL HVIVRETEDE
AWADADRLIS RLKDETVVQA QAAFARMDSE GQRRMAALHA GGSRRTRAEL EISPNLWAGV
GLVRGGAGTA LVGDAQTVAD RIKEYADLGI DTFVLSGYPH LEEAYRFAEL VFPLLPLSVR
DRLAGGVGGP LGETVANLYS PRASQS