Gene Saro_0855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0855 
Symbol 
ID3915911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp905920 
End bp907416 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content66% 
IMG OID640443588 
Productmethylmalonate-semialdehyde dehydrogenase [acylating] 
Protein accessionYP_496134 
Protein GI87198877 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.345114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCTGA TTGACCATTT CATCGTCGGC GGCCCCGGTG GCGCGCCTGC CCGCAAGAGC 
CCGATCTTCG ATCCCAACAA TGGCGGCGTG CAGGCCGAAG TCGCGCTCGG CACCGCAGAG
ACGCTCGAGC GCGCGGTGCA GGCCGCGCTG AAGGCGCAGC CGGCATGGGC CGCGACCAAT
CCGCAGCGCC GCGCCCGCGT GATGTTCCGC TTCAAGGAAC TGGTCGAGGC CAACATGGAC
AGCCTCGCCC ACATGCTCTC GTCCGAGCAC GGCAAGGTCA TCGCAGACTC CAGGGGGGAC
ATCCAGCGCG GCCTCGAGGT CGTCGAGTTC GCCTGCGGCA TCCCGCACGT CCTCAAGGGC
GAATATACTC ACGGCGCGGG GCCGGGCATC GACGTCTATT CGACCCGCCA GCCCATCGGC
ATCGGAGCCG GCATCACGCC GTTCAACTTC CCCGGCATGA TCCCGCTGTG GATGAGCGCG
ATTGCCATCG CCACCGGCAA CGCCTTCATC ATCAAGCCGT CCGAGCGCGA TCCTTCGGTG
CCGGTGCGCC TCGCTGAGCT GTTCATCGAA GCAGGTCTTC CCGAAGGCAT CTGCCAGGTC
GTCCACGGCG ACAAGGAAAT GGTCGACGCG ATCCTCGATC ACCCGGCGAT CGGCGCCATC
AGCTTCGTTG GATCGTCGGA CATCGCGCAC TACGTGTACA ATCGCGGCGT TGCCGCCGGA
AAGCGCGTGC AGGCGATGGG CGGGGCCAAG AACCACGGCG TGGTCATGCC CGATGCCGAT
CTCGACCAGG TCGTGAACGA CCTGTGCGGC GCCGCTTTCG GCTCTGCCGG CGAACGCTGC
ATGGCACTGC CAGTGGTCGT GCCCGTCGGC CATGACACCG CCGAGCGCCT GCGCGCCAAG
CTGATCCCGG CGATCCACGC GCTCAAGGTC GGCATCTCGA CCGATCCCGA GGCGCACTAC
GGTCCGGTGG TGACCCAGGC GCACAAGGAA AAGGTCGAAG GCTGGATCGA CAAGTGCATC
GAGGAAGGCG GCGAACTCGT CGTCGACGGT CGCGGCTTCA CCCTGCAGGG GCACGAGAAC
GGCTTCTTCG TCGGCCCGAC GCTGATCGAC CATGTCACGC CCGACATGGA CAGCTACCAC
AACGAGATCT TCGGCCCGGT GCTGCAGATC GTGCGCGCCG AGAACTTCGA GCAGGCGCTC
GAACTGCCGA GCAAGCACCA GTACGGAAAC GGCGTCGCCA TCTTCACCCG CAACGGCCAC
GCCGCGCGTG AATTTGCCGC CCGCGTCAAC GTCGGCATGG TTGGCATCAA CGTGCCGATC
CCGGTGCCTG TGGCCTACCA CACCTTCGGC GGGTGGAAGC GTTCGGCGTT CGGTGACACC
AACCAGCACG GCATGGAAGG CGTGAAGTTC TGGACCAAGG TCAAGACCGT CACGCAGCGC
TGGCCGGATG GCTCGCCCGA CGGCGGCAAC GCCTTCGTCA TCCCGACGAT GGGCTGA
 
Protein sequence
MRLIDHFIVG GPGGAPARKS PIFDPNNGGV QAEVALGTAE TLERAVQAAL KAQPAWAATN 
PQRRARVMFR FKELVEANMD SLAHMLSSEH GKVIADSRGD IQRGLEVVEF ACGIPHVLKG
EYTHGAGPGI DVYSTRQPIG IGAGITPFNF PGMIPLWMSA IAIATGNAFI IKPSERDPSV
PVRLAELFIE AGLPEGICQV VHGDKEMVDA ILDHPAIGAI SFVGSSDIAH YVYNRGVAAG
KRVQAMGGAK NHGVVMPDAD LDQVVNDLCG AAFGSAGERC MALPVVVPVG HDTAERLRAK
LIPAIHALKV GISTDPEAHY GPVVTQAHKE KVEGWIDKCI EEGGELVVDG RGFTLQGHEN
GFFVGPTLID HVTPDMDSYH NEIFGPVLQI VRAENFEQAL ELPSKHQYGN GVAIFTRNGH
AAREFAARVN VGMVGINVPI PVPVAYHTFG GWKRSAFGDT NQHGMEGVKF WTKVKTVTQR
WPDGSPDGGN AFVIPTMG