Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1045 |
Symbol | |
ID | 3915827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1083306 |
End bp | 1085522 |
Gene Length | 2217 bp |
Protein Length | 738 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640443779 |
Product | branched-chain alpha-keto acid dehydrogenase E1 component |
Protein accession | YP_496324 |
Protein GI | 87199067 |
COG category | [C] Energy production and conversion |
COG ID | [COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCTTG ATGCCGCCGA GCAGGTCCAC CGGCAGTTCC TTGATGCGCT GGACAAGGGA ACCGCACGAC GGCGTTCCAA CCTTGGGCTG AAGGATGTGG GGCTGGCGCC GGAGAGGGCG GCAGCGCTGT TCCGGTCGCA AGCGCTGTCG CGCCAGCTTG ACCGGCTCAG CCGCAAATTG CAGGCGCGGG GCGAGGGGTT CTATACGATC GGATCGTCGG GCCACGAGGG CAATGCAGTG CTGGCCGAAG TGCTGCGCAT GGATGACATG GCGTTCCTTC ACTATCGCGA CGCCGCGTTC CAGATCCACC GCGCCCACCG CGTGCCGGGC GAGAATCCGG CGTGGGACAT GCTGCTGAGC TTCACGGCCA GCATGGAGGA CCCGATTTCG GGCGGGCGCC ACAAGGTGCT GGGATCCAAG CGGCTGTTCA TTCCGCCGCA GACGTCGACC ATCGCCAGCC ACTTGCCCAA GGCGGTGGGC GCGGCCTTTT CCATCGGCAT CGCGCGCAGG ATGGGATTCG ACGACACCGT GCTGTCGAAG GACGGGGTCG TGCTGTGCAG CTTCGGCGAT GCTAGCGCGA ACCATTCGAC CGCGCTGGGC GCGATCAACA CTGCGTGCTG GGCGGCGTTT CAGGGCACGC CAATGCCGAT CATCTTCCTG TGCGAGGACA ACGGCATCGG CATCTCCACA CGCACGCCGC CGGGATGGAT CGAGGCGAAC TTCTCGGGCA GGGCGGGGCT GAACTACATT CCCTGCGACG GATCGGACCT TGTCGATACC TGCGCGGCCG CAAGACAGGC ACTGGAGATC GCGAGGCGGC AGCGAAAGCC GGTGTTCCTG CACATGAAGA CGGTGCGGCT CTACGGCCAC GCGGGCAATG ACGTGCAGCT TGCCTATCGC AGCAAGGAGG AGATCCGAGC CGAGGAAGAG CGCGATCCGC TGCTGGCGAG CGCGGCCTTG CTGATCGAGG AAGGCGTCAT GTCGGCGGCG CAGGTGCGCG GCGTCTATGA CGAGATCGAG GCCACGCTGG AACGGCAGGT GGAGCTTGCC ATCAAGCGCC CCAAGCTGCC CGACGCGGCG GCGGTGATGG CCAGCATCGT GCCCCCCAGG CGCGAAGGGG CGGCGCGCCC TCAGGCTTCG GCGCACGAGC GTGCCGCGCT CTTTGCCGAC GATGCCGCCG CGATGGACAA GCCGCAGCAT ATGGCGAAGC TCATCAGTTG GGCCATGGCG GACCTGCTGT TGCAGTACCC CAACGCGATC GTCTGCGGCG AGGACGTGGG GCCGAAGGGC GGGGTCTATG CCGCGACGCA AAAGCTGCAC GCGCGGTTCG GATCGGCGCG GGTGATCAAT ACCCTCCTCG ACGAGCAGGC AATTCTCGGG CTTGCCATCG GGGCTGCGCA CAACGGGCTG CTGCCGATGC CCGAGATCCA GTTCCTTGCC TATGTCCACA ACGCCGAGGA CCAGATCCGG GGCGAGGCGG CGACACTCTC GTTCTTTTCG AACGGGCAAT ACACCAACCC GATGGTCGTG CGGATCGCGG GCCTGCCCTA CCAGAAGGGG TTTGGCGGGC ACTTCCACAA CGACAACTCG CTGGCCGTCT TCCGCGACAT TCCGGGCGTG GTGCTGGCGG TGCCGTCGAA CGGGCGCGAT GCCGTGGCGA TGCTGCGCGA ATGCGTGAGG CTGGCGCACG ATGAAGGGCG CGTGGTCGTG TTCGTGGAGC CGATCGCGCT CTACATGACG CGCGACCTGC ATGAGCCGGG CGATGGCATG TGGTCGAGCG TCTACCAACC GCCGGGCGAG GGAGAGATCG CGTTTGGCGA GATCGGCGTT TTCGACTCTG GGCGAGGCGA AGGCACCGAC CTGGCCGTGG TGACCTATGG CAATGGCTTT TACCTTTCGC TCCAGGCGCA GAAGTTGCTG TCAGAGCGCG GCGTTAACGT GCGGGTGATC GATCTGCGCT GGCTGGGGCC GGTGAACGAG GCGGCGGTGC TCGATGCGGT CGCGCCGTGT TCGCGCGTGC TGGTGGTGGA CGAATGCCGG ATCACCGGGG GGCAGAACGA GGCGCTGATG GCCCTGCTGG CCGAGCGAGC GCCGGGCAAG GCCATCGCGC GGATGGCGGC GACCGACAGC TTCATCCCGC TCGCGCGCGC GGCAACGCAT ACGCTGCCGA GCCGGGACGG GATCGTGGTC AAGGTGCTGG AGATGGTGCG TGGCTAA
|
Protein sequence | MSLDAAEQVH RQFLDALDKG TARRRSNLGL KDVGLAPERA AALFRSQALS RQLDRLSRKL QARGEGFYTI GSSGHEGNAV LAEVLRMDDM AFLHYRDAAF QIHRAHRVPG ENPAWDMLLS FTASMEDPIS GGRHKVLGSK RLFIPPQTST IASHLPKAVG AAFSIGIARR MGFDDTVLSK DGVVLCSFGD ASANHSTALG AINTACWAAF QGTPMPIIFL CEDNGIGIST RTPPGWIEAN FSGRAGLNYI PCDGSDLVDT CAAARQALEI ARRQRKPVFL HMKTVRLYGH AGNDVQLAYR SKEEIRAEEE RDPLLASAAL LIEEGVMSAA QVRGVYDEIE ATLERQVELA IKRPKLPDAA AVMASIVPPR REGAARPQAS AHERAALFAD DAAAMDKPQH MAKLISWAMA DLLLQYPNAI VCGEDVGPKG GVYAATQKLH ARFGSARVIN TLLDEQAILG LAIGAAHNGL LPMPEIQFLA YVHNAEDQIR GEAATLSFFS NGQYTNPMVV RIAGLPYQKG FGGHFHNDNS LAVFRDIPGV VLAVPSNGRD AVAMLRECVR LAHDEGRVVV FVEPIALYMT RDLHEPGDGM WSSVYQPPGE GEIAFGEIGV FDSGRGEGTD LAVVTYGNGF YLSLQAQKLL SERGVNVRVI DLRWLGPVNE AAVLDAVAPC SRVLVVDECR ITGGQNEALM ALLAERAPGK AIARMAATDS FIPLARAATH TLPSRDGIVV KVLEMVRG
|
| |