Gene Sala_3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3049 
Symbol 
ID4082898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3200419 
End bp3201912 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content66% 
IMG OID638011435 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_618086 
Protein GI103488525 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.205238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACAGA TCGACCATCA CATTGTCGGC GGGGCCGCCG GTTCCGCCCG CTTCGGCGAC 
GTCTTCGACC CCAATAATGG CGGCGTGCAG GCGCGGGTCG CGCTCGGCGA TCGCGCCATC
CTCGACCGCG CCGTGGCGGC GGCAAAGGCC GCGCAGCCCG CGTGGGCCGC GACCAACCCG
CAGCGCCGCG CGCGCGTGAT GTTCGATTTC AAGCGCCTCG TCGAAGCCAA TATGAACCAG
CTCGCCGAAA TGCTGTCGAG CGAACATGGC AAGGTGATCG CGGACTCGAA GGGCGACATC
CAGCGCGGCC TCGAAGTCAT CGAGTTCTGC TGCGGCATCC CGCATGTGTT GAAGGGCGAA
TATACGCAGG GTGCGGGCCC CGGCATCGAC GTCTATTCGA TGCGCCAGCC GGTCGGCATC
GGCGCGGGGA TCACGCCGTT CAACTTCCCC GCGATGATCC CGCTGTGGAT GGGGGGCGTC
GCGACCGCGG TGGGCAACGC CTTCATCCTG AAGCCCAGCG AGCGCGACCC GTCGGTGCCC
GTGCGCCTGT CGGAACTCTT CCTCGAAGCC GGGATGCCCG AGGGGATTTT CCAGACCGTC
CACGGCGACA AGGAAATGGT CGACGCGATC CTCGACCATC CCGACATCGG CGCGGTCAGC
TTCGTCGGTT CGTCGGACAT CGCGCATTAT GTCTATAATC GCGGCGTTGC GAACGGCAAG
CGCGTGCAGG CAATGGGCGG GGCCAAGAAC CATGGCATCG TCATGCCCGA CGCCGATCTC
GACCAGGTGG TGAACGACCT GACCGGCGCG GCCTTCGGCT CGGCGGGCGA ACGCTGCATG
GCGCTGCCCG TCGTCGTTCC CGTCGGTGAG GACACCGCGA ACCGCCTGCG CGCAAAACTC
GTCCCCGCGA TCGAGGCGCT GCGCGTCGGC GTGTCGACCG ATACCGAGGC GCATTACGGC
CCGGTGGTGA CAGAGGCGCA CAAGGAAAAG GTCGAAGGCT GGATCGCCAA ATGCGCCGAC
GAAGGTGCCG AGCTGGTCAT CGACGGCCGC GGCTTCACGC TGCAGGGCCA CGAAAAGGGC
TTCTTCGTCG GCCCGACGCT GTTCGACCAT GTCACCCCCG ACATGGAATC ATACAAGGAA
GAGATTTTCG GCCCCGTGCT CCAGATCGTC CGCGCGCCCG ATTTCGAAAC CGCGCTCGAA
CTGCCGTCGA AGCATCAATA TGGCAATGGC GTCGCGATCT TTACGCGCAA CGGCCACGCC
GCGCGCGAAT TTGCGGCGCG GGTCAATGTC GGCATGGTCG GCATCAACGT GCCGATCCCG
GTGCCGGTCG CCTATCACAG CTTCGGCGGC TGGAAACGGT CGGCGTTCGG CGACACCAAC
CAGCATGGCA TGGAAGGCGT GAAGTTCTGG ACCAAGGTGA AGACCGTGAC CGCGCGCTGG
CCCGACGGAT CGCCTGATGG CGGCAACGCC TTCGTTATCC CGACGATGGG TTGA
 
Protein sequence
MRQIDHHIVG GAAGSARFGD VFDPNNGGVQ ARVALGDRAI LDRAVAAAKA AQPAWAATNP 
QRRARVMFDF KRLVEANMNQ LAEMLSSEHG KVIADSKGDI QRGLEVIEFC CGIPHVLKGE
YTQGAGPGID VYSMRQPVGI GAGITPFNFP AMIPLWMGGV ATAVGNAFIL KPSERDPSVP
VRLSELFLEA GMPEGIFQTV HGDKEMVDAI LDHPDIGAVS FVGSSDIAHY VYNRGVANGK
RVQAMGGAKN HGIVMPDADL DQVVNDLTGA AFGSAGERCM ALPVVVPVGE DTANRLRAKL
VPAIEALRVG VSTDTEAHYG PVVTEAHKEK VEGWIAKCAD EGAELVIDGR GFTLQGHEKG
FFVGPTLFDH VTPDMESYKE EIFGPVLQIV RAPDFETALE LPSKHQYGNG VAIFTRNGHA
AREFAARVNV GMVGINVPIP VPVAYHSFGG WKRSAFGDTN QHGMEGVKFW TKVKTVTARW
PDGSPDGGNA FVIPTMG