Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0855 |
Symbol | |
ID | 3915911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 905920 |
End bp | 907416 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640443588 |
Product | methylmalonate-semialdehyde dehydrogenase [acylating] |
Protein accession | YP_496134 |
Protein GI | 87198877 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.345114 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCTGA TTGACCATTT CATCGTCGGC GGCCCCGGTG GCGCGCCTGC CCGCAAGAGC CCGATCTTCG ATCCCAACAA TGGCGGCGTG CAGGCCGAAG TCGCGCTCGG CACCGCAGAG ACGCTCGAGC GCGCGGTGCA GGCCGCGCTG AAGGCGCAGC CGGCATGGGC CGCGACCAAT CCGCAGCGCC GCGCCCGCGT GATGTTCCGC TTCAAGGAAC TGGTCGAGGC CAACATGGAC AGCCTCGCCC ACATGCTCTC GTCCGAGCAC GGCAAGGTCA TCGCAGACTC CAGGGGGGAC ATCCAGCGCG GCCTCGAGGT CGTCGAGTTC GCCTGCGGCA TCCCGCACGT CCTCAAGGGC GAATATACTC ACGGCGCGGG GCCGGGCATC GACGTCTATT CGACCCGCCA GCCCATCGGC ATCGGAGCCG GCATCACGCC GTTCAACTTC CCCGGCATGA TCCCGCTGTG GATGAGCGCG ATTGCCATCG CCACCGGCAA CGCCTTCATC ATCAAGCCGT CCGAGCGCGA TCCTTCGGTG CCGGTGCGCC TCGCTGAGCT GTTCATCGAA GCAGGTCTTC CCGAAGGCAT CTGCCAGGTC GTCCACGGCG ACAAGGAAAT GGTCGACGCG ATCCTCGATC ACCCGGCGAT CGGCGCCATC AGCTTCGTTG GATCGTCGGA CATCGCGCAC TACGTGTACA ATCGCGGCGT TGCCGCCGGA AAGCGCGTGC AGGCGATGGG CGGGGCCAAG AACCACGGCG TGGTCATGCC CGATGCCGAT CTCGACCAGG TCGTGAACGA CCTGTGCGGC GCCGCTTTCG GCTCTGCCGG CGAACGCTGC ATGGCACTGC CAGTGGTCGT GCCCGTCGGC CATGACACCG CCGAGCGCCT GCGCGCCAAG CTGATCCCGG CGATCCACGC GCTCAAGGTC GGCATCTCGA CCGATCCCGA GGCGCACTAC GGTCCGGTGG TGACCCAGGC GCACAAGGAA AAGGTCGAAG GCTGGATCGA CAAGTGCATC GAGGAAGGCG GCGAACTCGT CGTCGACGGT CGCGGCTTCA CCCTGCAGGG GCACGAGAAC GGCTTCTTCG TCGGCCCGAC GCTGATCGAC CATGTCACGC CCGACATGGA CAGCTACCAC AACGAGATCT TCGGCCCGGT GCTGCAGATC GTGCGCGCCG AGAACTTCGA GCAGGCGCTC GAACTGCCGA GCAAGCACCA GTACGGAAAC GGCGTCGCCA TCTTCACCCG CAACGGCCAC GCCGCGCGTG AATTTGCCGC CCGCGTCAAC GTCGGCATGG TTGGCATCAA CGTGCCGATC CCGGTGCCTG TGGCCTACCA CACCTTCGGC GGGTGGAAGC GTTCGGCGTT CGGTGACACC AACCAGCACG GCATGGAAGG CGTGAAGTTC TGGACCAAGG TCAAGACCGT CACGCAGCGC TGGCCGGATG GCTCGCCCGA CGGCGGCAAC GCCTTCGTCA TCCCGACGAT GGGCTGA
|
Protein sequence | MRLIDHFIVG GPGGAPARKS PIFDPNNGGV QAEVALGTAE TLERAVQAAL KAQPAWAATN PQRRARVMFR FKELVEANMD SLAHMLSSEH GKVIADSRGD IQRGLEVVEF ACGIPHVLKG EYTHGAGPGI DVYSTRQPIG IGAGITPFNF PGMIPLWMSA IAIATGNAFI IKPSERDPSV PVRLAELFIE AGLPEGICQV VHGDKEMVDA ILDHPAIGAI SFVGSSDIAH YVYNRGVAAG KRVQAMGGAK NHGVVMPDAD LDQVVNDLCG AAFGSAGERC MALPVVVPVG HDTAERLRAK LIPAIHALKV GISTDPEAHY GPVVTQAHKE KVEGWIDKCI EEGGELVVDG RGFTLQGHEN GFFVGPTLID HVTPDMDSYH NEIFGPVLQI VRAENFEQAL ELPSKHQYGN GVAIFTRNGH AAREFAARVN VGMVGINVPI PVPVAYHTFG GWKRSAFGDT NQHGMEGVKF WTKVKTVTQR WPDGSPDGGN AFVIPTMG
|
| |