Gene Sala_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1810 
Symbol 
ID4082201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1906291 
End bp1907814 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content69% 
IMG OID638010185 
Productaldehyde dehydrogenase 
Protein accessionYP_616855 
Protein GI103487294 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGAG CGAAAGGCTG GGCAATGGCC GGGATCGGCG AAGAGATCGG GCAATTGCTC 
GACGGGTTGG ACGTGGACCG TGCGCTGTGG ACCGACGGGT CGATGCCCGC GGCGACGCCG
CTGACGGGCG AGCGTCTCGG CAGGGTGCGG GTCGCTGATG CCGCTGCGAT CGATCAGGCG
CTGGACAGGG CGACCTCGGC CTTTCGCGCC TGGCGCCATG TCCCTGCACC GCGGCGCGGC
GAACTGGTGC GACTGTTCGG CGAGGAATTG CGCGCCGCGA AGGATGATCT GGCGCGGCTG
GTGACGATCG AGGCGGGCAA GATTCCGTCC GAAGGCGCGG GCGAGGTGCA GGAGATGATC
GACATCTGCG ACTTCGCGGT CGGTCTTTCG CGGCAATTAT ACGGCCTCAC CATCGCGACC
GAGCGACCGG GGCACCGGAT GATGGAGGTG TGGCACCCGC TGGGCGTCGT CGGGGTGATT
TCGGCGTTCA ATTTTCCCGT CGCGGTGTGG GCGTGGAATG CGGCGCTCGC ACTCGTGTGC
GGCAACAGCG TGGTGTGGAA ACCGTCCGAA AAGACGCCGC TGACGGCGCT CGCGACGCAG
GCGATTTTCG AGCGTGCGCT CGCGCGCTTC GGCGAGGCGC CCGAAGGTTT GTCGCAACTG
CTGATCGGCG GGCGTGAGGC GGGCGAGGCG CTGGTCGATG ACCGCCGCGT CGCGCTCGTT
TCGGCGACGG GATCGACCCG CATGGGCCGC GCGGTCGCGC CGCGGCTGGC GCAGCGCTTT
GCGCGAGCGA TCCTCGAGCT GGGCGGCAAT AATGGCGTGA TCGTCGCCCC CTCGGCCGAC
CTCGACCTCG CGCTGCGGGG GGTCGCGTTC GGCGCGATGG GGACGGCGGG GCAGCGCTGC
ACGACGACGC GGCGGCTGTT CGTTCACGAC AGCATTTACG ATGCTTTCGT CGCGCGATTG
AAGGCCGCCT ATGCCAGCGT CGCGGTCGGC AATCCACTGG AGAACGACGT TCTCGTCGGG
CCGCTGATCG ATCGCGCCGC CCATGACGCG ATGCAGGATG CGCTGGCCGC GGCGAAGGCG
GCGGGCGGCG TCGTGCAGGG CGGCGAACGG GTCGGCGAGG GCGCCGCCTA TTATGTCCGT
CCGGCGCTCG TCGAGATGCC GGGACAGGTC GGGCCGGTGC TGGAGGAGAC GTTCGCGCCG
ATCCTCTATG TCATGCGTTA TGACGATCTG GACGCCGCGA TCCGGCTGCA CAATGATGTC
GCCGCGGGGC TGTCGTCGGC GATCTTCACC ACCGACATGC GCGAGGCCGA GCGCTTTCTC
GCGGCGAGCG ATTGCGGCAT CGCGAACGTC AATCTGGGGA CGAGCGGCGC CGAGATCGGC
GGGGCGTTCG GCGGCGAGAA GGAAACCGGC GGCGGTCGCG AAAGCGGGTC GGATGCGTGG
CGCCAATATA TGCGGCGCGC CACGAACACG ATCAACTATT CGGACGCGCT GCCGCTGGCG
CAGGGGGTGT CGTTCGCGCT CTAG
 
Protein sequence
MTRAKGWAMA GIGEEIGQLL DGLDVDRALW TDGSMPAATP LTGERLGRVR VADAAAIDQA 
LDRATSAFRA WRHVPAPRRG ELVRLFGEEL RAAKDDLARL VTIEAGKIPS EGAGEVQEMI
DICDFAVGLS RQLYGLTIAT ERPGHRMMEV WHPLGVVGVI SAFNFPVAVW AWNAALALVC
GNSVVWKPSE KTPLTALATQ AIFERALARF GEAPEGLSQL LIGGREAGEA LVDDRRVALV
SATGSTRMGR AVAPRLAQRF ARAILELGGN NGVIVAPSAD LDLALRGVAF GAMGTAGQRC
TTTRRLFVHD SIYDAFVARL KAAYASVAVG NPLENDVLVG PLIDRAAHDA MQDALAAAKA
AGGVVQGGER VGEGAAYYVR PALVEMPGQV GPVLEETFAP ILYVMRYDDL DAAIRLHNDV
AAGLSSAIFT TDMREAERFL AASDCGIANV NLGTSGAEIG GAFGGEKETG GGRESGSDAW
RQYMRRATNT INYSDALPLA QGVSFAL