Gene Daci_4639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_4639 
Symbol 
ID5750229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp5095156 
End bp5096211 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content72% 
IMG OID641299742 
Productaliphatic sulfonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_001565653 
Protein GI160900071 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00000215932 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGCCA ATTCCAAGAA GCCCCGCCTG GACCGCGTGC GCCGCCAGGC CCTGCAACTG 
CTGGGCACCA CCGCCTTCAG CTGGAGCCTG GCCGGCCAGT ACGCACGCGC CGAGACGCCG
GCCGCCGTGG CCGGGCCCGA GCAGTTGCGC ATCGGCTATC AGAAGTCGGC CGTCAACCTG
GTCATCCTCA AGCAGCAGGG CGTGCTGGAA AAGCGCTTTG CGGGCACCAA GGTCAGCTGG
CTGGAGTTCC CGGCCGGCCC CCAGCTGCTG GAGGCCCTGG CCGCAGGCAG CCTGGATTTC
GGCCTGACCG GCGATTCGCC CCCGGTCTTT GCCCAGGCCG CGGGCCGCGA CCTGCTGTAC
GTGGGCGCCG AGCCGCCCAA GCCCGAGAGC TCGGCCATCC TCGTGCCATC GGACTCGCCG
CTGCGCACCC TGGCCGATCT CAAGGGCCGG CGCGTGGCGC TGCAAAAAGG CTCCAGCGCC
CATTACCTGC TGGTGCGCGC GCTGGACAAG GCGGGCCTGG CCTGGAACGA GATCCAGCCC
GTGTACCTGG CCCCGGCCGA TGCGCGCGCC GCCTTCGAAC GCAAGAGCGT GGACGCCTGG
GCCATCTGGG ACCCGTTCTA CGCGGCCACC GAGCTGGCGA TTCGCCCGCG CGTGCTGGCC
AATGGCGAAG GCCTGTCGGG CAACGCCTCG TTCTACCTGG CCGCGCGCGG ACTGGTGGAG
CGCCATCCGC AGGTGCTGCG CGCGCTGTTC GACGAGCTCA CGCGCGCCGA TCGCCTGGCC
CAGAGCGCGC GCCAGGAGGC CGTGGCCCTG GTGGCCGGCT TCAGCGGCCT GGATGCGGCG
GTGGTCAGCC GCTTCATTGC GCGCCGGCCC AGCTCCCCCG TGGGCCTGCT GGCTGCGCAG
ACCGTGGTGG ACCAGCAGCG CGTGGCCGAT GCATTTTTTC GGCTGGGCCT GATTCCGCGC
CAGGTGCAGG TGGCCGACAT CGTCTGGCGG CCCTCGGCGG CCGAGTACGC GCGGCTGGCC
GAGCCTGCGG CGGCACGGCC TTCTTCCGCC CTTTGA
 
Protein sequence
MNANSKKPRL DRVRRQALQL LGTTAFSWSL AGQYARAETP AAVAGPEQLR IGYQKSAVNL 
VILKQQGVLE KRFAGTKVSW LEFPAGPQLL EALAAGSLDF GLTGDSPPVF AQAAGRDLLY
VGAEPPKPES SAILVPSDSP LRTLADLKGR RVALQKGSSA HYLLVRALDK AGLAWNEIQP
VYLAPADARA AFERKSVDAW AIWDPFYAAT ELAIRPRVLA NGEGLSGNAS FYLAARGLVE
RHPQVLRALF DELTRADRLA QSARQEAVAL VAGFSGLDAA VVSRFIARRP SSPVGLLAAQ
TVVDQQRVAD AFFRLGLIPR QVQVADIVWR PSAAEYARLA EPAAARPSSA L