Gene Daci_4640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_4640 
Symbol 
ID5750230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp5096235 
End bp5097245 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content67% 
IMG OID641299743 
Productaliphatic sulfonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_001565654 
Protein GI160900072 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.374104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00000179955 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGTCA TGGTCTCGTT TGATTCTTTC CCCCGACACA CAGCCAACTG GCGCCGACGC 
CAACTGCTGG GCGCGGGCCT TGCCGCAGCA GTCGCCACGG TGGGCGGCCC GGCCCTGGCC
CAGGACAAGG CGGGCGACCG CGTGCTGCGC GTGGGCCACC AAAAGGGCTG GCTGTCCATC
CTGAAGAGCC GGGGCACGCT GGAAAAGCGG CTGGCGCCGC TGGGCGTATC CGTGCGCTGG
ATCGAGTTCA ACGCCGGCCC CGTGCAGCTG GAGGCGCTGA ACGTGGGCTC CATCGACTTC
GGCGATGTGG GCGAGGCCCC GCCCATCTTT GCGCAGGCCG CAGGCGCGCC TTTGGTCTAT
GCGGGCGCCA CCGTGCCGCG CCCCGGGCTG GAGGCGGTGA TCGTGCCCAA GGATTCGCCC
ATTCGCAGCG TGCAGGACCT CAAGGGCAAG CGCGTGGCCT ACAACAAGGG CTCGAACGTG
CAGTACTTCC TGGTCAAGCT GCTGGAAAAG CACGGCCTGA AGTACGGCGA TGTGCAGTCC
GTCTTCCTGG CGCCGGCCGA TGCGCGTGCC GCCTTCGAGC GCGGCTCCGT CGATGCCTGG
CTGATCTGGG ACCCCTTCCT GGCTGCTGCG CAAAAGACGC TGGACGCCCG CCTGCTGGCC
GATGCCACGG GCGTGGTCAA CAACCGCGCC TACTACTTCA CCTCGCGCGA TTTCGCCACG
CGCAATGCCG ATGTGCTGCG CATTGCCATC GAAGAGGTCG ATGGGATTGA CCGCTGGGCC
TCGAAGAACC AGACGGCCGC TGCCGCCGAG CTGTCCAGCA TCCTGGGCCT GGACAAGTCC
ATCACCGAGC TGTACCTGAG CCGCGCACGC TTCGGCACCT CGTCCGTGAC ACGCGAGATC
CTGGCCGAGC AGCAGCAGAT CGCCGACACC TTCTTCGAGC TCAAGCTCAT CCCCCGCAAG
CTCAACCTGC TGCATGCCGC GCCGGTGGAC CTGATCGCGT CGCGCCCCTG A
 
Protein sequence
MSVMVSFDSF PRHTANWRRR QLLGAGLAAA VATVGGPALA QDKAGDRVLR VGHQKGWLSI 
LKSRGTLEKR LAPLGVSVRW IEFNAGPVQL EALNVGSIDF GDVGEAPPIF AQAAGAPLVY
AGATVPRPGL EAVIVPKDSP IRSVQDLKGK RVAYNKGSNV QYFLVKLLEK HGLKYGDVQS
VFLAPADARA AFERGSVDAW LIWDPFLAAA QKTLDARLLA DATGVVNNRA YYFTSRDFAT
RNADVLRIAI EEVDGIDRWA SKNQTAAAAE LSSILGLDKS ITELYLSRAR FGTSSVTREI
LAEQQQIADT FFELKLIPRK LNLLHAAPVD LIASRP