Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1908 |
Symbol | |
ID | 3917131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2020312 |
End bp | 2021457 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640444654 |
Product | pyruvate dehydrogenase (lipoamide) |
Protein accession | YP_497182 |
Protein GI | 87199925 |
COG category | [C] Energy production and conversion |
COG ID | [COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit |
TIGRFAM ID | [TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.591814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCCAAGT CCGCCAGGCC CAGAGCCGCC AAACCCGCCG CCAAACCTGC TGCAAAATCC GCAGGCAAAT CGCCCGCTTC GGCTGCGGCC GATCCGGTGA TCGCTTCCTC GGTTGCCGAA GAGGCAGCCT TCGCACTACG CAGCCTCCAG CAGGCCCATG CGAACAACAA GCGTTACGAC GCGAGCGATG CGGAACTGCT GAAGTTCTAT GAGCAGATGG TCCTGATCCG CCGGTTCGAG GAAAAGGCGG GCCAGCTTTA CGGCCTCGGC CTCATCGGCG GGTTCTGCCA TCTTTACATC GGCCAGGAAG CCGTGGCAGT CGGTCTTCAG TCGGCGCTGA AGGAAGGTCA CGATAGCGTG ATCACCGGCT ATCGCGATCA CGGGCACATG CTTGCCTACG GCATCGATCC GAAGGTGATC ATGGCAGAGT TGACCGGTCG CGGCGCGGGC ATCTCGCGCG GCAAGGGCGG TTCGATGCAC ATGTTCAGCA CGGACCACAA GTTCTACGGC GGTCACGGCA TCGTCGGAGC GCAGGTTCCG CTCGGAGCGG GCCTTGCCTT TGCACACAAG TATCGCGGTG ACGACGGCGT GTGCATGGCT TACTTCGGCG ACGGCGCGGC AAACCAGGGC CAGGTCTACG AGACCTTCAA CATGGCCGCC CTGTGGAAGC TGCCGATCAT CTTCGTGGTC GAGAACAACG GCTACGCCAT GGGAACCGCG GTCAAGCGGG GGTCGGCAGA GACCGAGTTC TATCGCCGTG GCACCGCGTT CCGCATTCCA GGCATGGACG TCAACGGCAT GGACGTTCTC GAAGTGCGCC AAGCCGCCGA GGTCGCGCTC GAGTATGTTC GTGCGGGCAA CGGCCCCGTG CTCATGGAAC TCAACACCTA CCGTTACCGC GGGCATTCGA TGTCCGACCC CGCAAAGTAT CGCAGTCGCG AGGAAGTGCA GGAAATGCGG GACAAGCACG ATCCTATCGA AGGCGCCAAG GCAGAACTGC TGAAGCGGGG CGTGACCGAG GACAAGATCA AGGAAATCGA CAAGCGCATT CGCCAGATCG TCGCGGAATC GGCCGACTTT GCCGAAACCT CGCCCGAGCC GGACATGGCC GAGCTCTACA CTGACGTGCT GGTGGAGAAG TACTGA
|
Protein sequence | MAKSARPRAA KPAAKPAAKS AGKSPASAAA DPVIASSVAE EAAFALRSLQ QAHANNKRYD ASDAELLKFY EQMVLIRRFE EKAGQLYGLG LIGGFCHLYI GQEAVAVGLQ SALKEGHDSV ITGYRDHGHM LAYGIDPKVI MAELTGRGAG ISRGKGGSMH MFSTDHKFYG GHGIVGAQVP LGAGLAFAHK YRGDDGVCMA YFGDGAANQG QVYETFNMAA LWKLPIIFVV ENNGYAMGTA VKRGSAETEF YRRGTAFRIP GMDVNGMDVL EVRQAAEVAL EYVRAGNGPV LMELNTYRYR GHSMSDPAKY RSREEVQEMR DKHDPIEGAK AELLKRGVTE DKIKEIDKRI RQIVAESADF AETSPEPDMA ELYTDVLVEK Y
|
| |