Gene Sala_0902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0902 
Symbol 
ID4082802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp910795 
End bp912270 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content64% 
IMG OID638009263 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_615953 
Protein GI103486392 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.864011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTC TGAAATATGC CGACGCCATC TATGTGGGCG GCGAATGGGA GAAGACCGAT 
CGGCGCGAGG CGGTGATCAA CCCCGCGGAC GAAAGTCTGC TGATCGAGGC ACCGGTCGGC
AGCGCGCGGC AGGTCGAAGC CGCAATCGGC GCTGCTCGCC ATGCTTTCGA CAGAAGCGAC
TGGTCGCATC TGCCGGTCGC CGAACGGCAG AAAATCCTGA CCCGCTTTCT GGACGCGCTG
GATGCCCGCA AGGGCGCAAT CGTCGACATG ATCGTTGCCG AGGCGGGCGC GACGCGGATG
CTGGCGGAGT TCCTGCAATA TGGCATCCCG ATGAAACATG CGCGGCGCAC GGTGGAACTG
GCATCGCGGC CCGCCGTCAC CCCGCTGCCC GTCGAACTCA CGCCGAACGC GCAGGGCCGC
ACAACGCTGG GCACCGGCGT GGTCAGCCGC GAGCCGGTCG GGGTGGTGGC CGCCATTTCC
CCCTATAATT TCCCCTTCTT CCTGAACGTC GGCAAGGTCG TTCCGGCGCT TGCGGTGGGC
TGCACGGTGG TTCTGAAGCC GTCGCCCTAC ACCCCGATGG AAGCGCTGAT CCTGGGCGAA
ATCGCCGACG AGGTGGGATT GCCGAAAGGC GTTCTCAGCA TCGTGACCGG CGACATCGAA
ACCGGCAAGC TGCTCACCAC CGATCCGCGC GTCGATCTGG TGCATTTCAC CGGGTCGGAC
AAGGTCGGCG CGATGATCCA GGCGCAGGCG GCGCCGACGC TGAAACGGAT CGTGATGGAA
CTGGGAGGCA AATCGGCGCT GATCGTCCGC AGCGACGCGG ACATTCAAAA GGCCGCCGCA
GCGGGATTGA TGGGGTTCAC CACCCACTGC GGCCAGGGCT GCGCGCTCAC CACCCGCCAT
TTGGTCCACA ACAGCGTCCG GCCGCAATTT GTCGAAGCGC TGAAAGGGAT GCTGACGCAT
ATCAGGATCG GCAACCCCGC CGACCCCGCG GTCAATTACG GCCCGTTGAT CCGCGAAGTC
GCGCGCAAAC GGACCGAGGA TTATGTCGCA ATCGCCCGCG ACGAGGGCGC GACGCTGGTG
TCGGGCGGAA AACGCCCGGA AGGGCTGGAC AAGGGCTTTT ATTTCGAGCC GACGCTGTTC
GACAATGTCA AGAATGACAG CCGGCTGGCC CAGGAAGAGG TTTTCGGACC GATCGGGGCG
GTCATCGGTT TCGACGATGA CGACGAAGCG ATTGCCCTGG CCAATGCCAG CGATTTTGGC
CTGTCCGGCG CGATCTATTC GGCCGATGCC GGGCAGGCCT ATCGGATGGC GCTCAAAATC
CGGACCGGCG GCGTATCGAT CAACGGCGGC GCCGGCACGA TGCAGTCCGA TGCCCCCTTC
GGCGGTATCA AGCGGTCGGG CTATGGCCGC GAATATGGCG AGGATGGCCT GAACGAATTC
ACCTATCAAA AGGTGATCGG TTTCCACGCC GAATAG
 
Protein sequence
MPFLKYADAI YVGGEWEKTD RREAVINPAD ESLLIEAPVG SARQVEAAIG AARHAFDRSD 
WSHLPVAERQ KILTRFLDAL DARKGAIVDM IVAEAGATRM LAEFLQYGIP MKHARRTVEL
ASRPAVTPLP VELTPNAQGR TTLGTGVVSR EPVGVVAAIS PYNFPFFLNV GKVVPALAVG
CTVVLKPSPY TPMEALILGE IADEVGLPKG VLSIVTGDIE TGKLLTTDPR VDLVHFTGSD
KVGAMIQAQA APTLKRIVME LGGKSALIVR SDADIQKAAA AGLMGFTTHC GQGCALTTRH
LVHNSVRPQF VEALKGMLTH IRIGNPADPA VNYGPLIREV ARKRTEDYVA IARDEGATLV
SGGKRPEGLD KGFYFEPTLF DNVKNDSRLA QEEVFGPIGA VIGFDDDDEA IALANASDFG
LSGAIYSADA GQAYRMALKI RTGGVSINGG AGTMQSDAPF GGIKRSGYGR EYGEDGLNEF
TYQKVIGFHA E