Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3560 |
Symbol | |
ID | 5077709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 176934 |
End bp | 178376 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640481284 |
Product | dehydrogenase catalytic domain-containing protein |
Protein accession | YP_001165946 |
Protein GI | 146275786 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000824091 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAATA TCCGCCCGTT CTGCATGCCC AAGTGGGGCA TCGAAATGAC CGAGGGCACC ATCGCCGAAT GGATGGTCAA GGAAGGCGAG GCCTTCAACA AGGGTCAGGT CCTCTGCCTG ATCGAGACCG CCAAGATCAC CAACGAGGTG GAAGCGGAAT ACGACGCGGT CCTGAAGCGC CTGCTGACGC CTGCGAGCGA CGAGGCCCAC CCCGTGGGCG CGCTGCTGGC GGTCTTTGCC GATGCCGACA CGACCGACGC CGAGGTCGAT GAGTTCATCG CCGGCTTCAA GCCCGCCGAG ACCTCGGTTG CGGCCAAGAG CGGCGGCGGT TCGGCCCCGG CCCCCGCCCC GGCTGCGGCG GCGCCTGCTC CGGCGGCTCC CGCGCGCACG CCGACGAAGA TCGTCACCAA CCGCGCAATC AGCCCCGAGG CATTGAAGCT GGCCGAGGCC GAAGGCGTCG ACATCGAACC CATCGAAGGT TCGGGCCGCA ATGGCCGGAT CACCTACCAG GATGTGGTTC AGGCCCTGCG CCCGGAGCGC GCCTTGTCAT ACAAGGGCAG CGCGCAGCTG GTGGAGGACA GCCCCGAGGC CTTCGCGTCG CCGCTGGCCC GCCGCATCGC GGCCCAGCAC GGCATAGCGC TGGCCGGGAT CAAGGGCACC GGCGCGCGCG GCCGTATCTC GAAAGCGGAC GTGATGGCGC TGGTCAAGCC GACCACGGCC GCGGCACCGG TCTTCGGCGC GCCGTTCGAA CTGGTTGCCA ACCAGCCGCA GGTCCAGCCG TTCGACAAGG TACGCAAGGT CGTCGCGCGC CGCCTGACCG AGGCGAAGCA GACGATCCCG CACTTCTACC TGCGCGTCTC GGCCTCGGTC GACGCGCTGA TGGACTTGCG CAAGACGGCC AACCTCGTGC TCGGCACCAA GGCTTCGATC AACGACTACC TGGTCAAGGC CGTGGCGCTG GCGCTGGTGC GGCATCCCGA CGTCAACGTG CAGGTCCATG GCGACAGCGT CCACAGCTTC CCCCACGCCG ATGTCGCCAT CGCGGTTGCC AGCCCCAAGG GCCTGGTCAC CCCGATCGTG CGACAGGCGG ATCGCATGCA CATCGCGCAG ATCGCGGCCA CTACCCGCGC ACTGATCGAC AAGGCACAGG CGGGCCGGCT CGGCTATGAG GACATGGACG GCGGGACCTT CTCGGTGTCG AACCTCGGCA TGTTCGGGAT CGAGCAGTTC GATGCGATCA TCAACCCGCC GCAGGGCGCG ATCCTTGCGG TCGGCGGGGT GAACCGCGTG GCGGTGGAAG CGGCGAACGG CGACATCGCT TTCGAAAACC GCATCCAGCT GACCATGTCG GTCGATCATC GCGCAATCGA TGGCGCTGCG GGCGCGAAGT TCCTGCAGAC GCTCAAGGGC CTGCTCGAAG CGCCGGAAGG ACTGTTCGCA TGA
|
Protein sequence | MANIRPFCMP KWGIEMTEGT IAEWMVKEGE AFNKGQVLCL IETAKITNEV EAEYDAVLKR LLTPASDEAH PVGALLAVFA DADTTDAEVD EFIAGFKPAE TSVAAKSGGG SAPAPAPAAA APAPAAPART PTKIVTNRAI SPEALKLAEA EGVDIEPIEG SGRNGRITYQ DVVQALRPER ALSYKGSAQL VEDSPEAFAS PLARRIAAQH GIALAGIKGT GARGRISKAD VMALVKPTTA AAPVFGAPFE LVANQPQVQP FDKVRKVVAR RLTEAKQTIP HFYLRVSASV DALMDLRKTA NLVLGTKASI NDYLVKAVAL ALVRHPDVNV QVHGDSVHSF PHADVAIAVA SPKGLVTPIV RQADRMHIAQ IAATTRALID KAQAGRLGYE DMDGGTFSVS NLGMFGIEQF DAIINPPQGA ILAVGGVNRV AVEAANGDIA FENRIQLTMS VDHRAIDGAA GAKFLQTLKG LLEAPEGLFA
|
| |