Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1909 |
Symbol | |
ID | 3917132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2021459 |
End bp | 2022844 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640444655 |
Product | pyruvate dehydrogenase subunit beta |
Protein accession | YP_497183 |
Protein GI | 87199926 |
COG category | [C] Energy production and conversion |
COG ID | [COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0434985 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATCG AACTGAAGAT GCCCGCGCTC TCCCCGACGA TGGAGGAGGG CACCTTGGCC AAGTGGCTCG TCAAGGCTGG CGACGAAGTC CGGTCCGGCG ACATTCTTGC AGAGATCGAG ACCGACAAGG CCACGATGGA ATTCGAAGCC GTGGATGAAG GCGTGATCGC CGAAATCCTC GTCGCCGAAG GCACCGAGGG CGTGAAGGTC GGCACCGTGA TCGCGACGAT CCAGGGCGAG GGCGAGGATG CCGCACCCGC AGCGGCAACT CCTGCCGTTG AACAGAAGGT CGAAATGAGC GAAGCCGCGC CAAGCGTCGA GGCCAGGGCC GCACCCGCAG TTGCGATCGC GCCGAAGGTC GACGCAAAGC CGGCAGTAGA TCCTGAGATT CCCGCCGGCA CGGCGATGGT CCCGACCACC GTTCGCGAAG CCCTGCGCGA CGCTATGGCC GAGGAAATGC GCGCCGACGA CCGCGTCTTC GTGATGGGCG AGGAAGTGGC CGAGTACCAA GGTGCCTACA AGGTTACCCA AGGCCTGCTC GATGAATTCG GTCCGCGCCG TGTGATTGAC ACCCCGATCA CCGAGTACGG TTTTGCGGGC ATTGGCGCAG GCGCCGCGAT GGGCGGCCTT CGCCCGATCA TCGAATTCAT GACGTTCAAC TTCGCCATGC AGGCGATCGA CCACATCATC AATTCGGCGG CCAAGACCAA CTACATGTCC GGTGGCCAGA TGCGCTGCCC GATCGTGTTC CGTGGCCCCA ACGGCGCTGC AAGCCGCGTC GGTGCACAGC ACTCCCAGAA CTACGGCCCG TGGTACGCCA ACGTTCCCGG GCTGGTTGTC ATCGCGCCAT ATGACAGCGC CGATGCCAAA GGCCTGATGA AGGCTGCAAT CCGCAGTGAA GACCCGGTGG TCTTCCTGGA AAACGAGCTT GTCTACGGAC GCACTTTCGA CGTGCCGCAG ATGGACGACT TTGTACTGCC CATCGGCAAG GCGCGCATCG TCCGTCAGGG CAAGGACGTG ACCATCGTAT CATATTCGAT CGGCGTCGGG CTTGCTCTTG AAGCGGCGGA GACTCTTGCC GCAGAGGGTA TCGATGCCGA AGTGATCGAT CTTCGCACGC TGCGTCCGCT CGACAAGGAC ACGGTGCTGG CCTCGCTTGC CAAGACGAAT CGTCTGGTTG TGGCGGAAGA AGGGTTCCCG GTCTGCTCGA TTGCTTCGGA AATCATGGCA ATCTGCATGG AGGACGGGTT CGATCATCTC GACGCACCGG TCCTGCGCGT ATGCGACGAA GACGTGCCAC TGCCTTATGC CGCCAACCTC GAGAAGGCTG CGCTTATCGA TGCCGGCAAG ATCGCTGCGG CCGTGCGCAA GGTCTGCTAT CGCTGA
|
Protein sequence | MAIELKMPAL SPTMEEGTLA KWLVKAGDEV RSGDILAEIE TDKATMEFEA VDEGVIAEIL VAEGTEGVKV GTVIATIQGE GEDAAPAAAT PAVEQKVEMS EAAPSVEARA APAVAIAPKV DAKPAVDPEI PAGTAMVPTT VREALRDAMA EEMRADDRVF VMGEEVAEYQ GAYKVTQGLL DEFGPRRVID TPITEYGFAG IGAGAAMGGL RPIIEFMTFN FAMQAIDHII NSAAKTNYMS GGQMRCPIVF RGPNGAASRV GAQHSQNYGP WYANVPGLVV IAPYDSADAK GLMKAAIRSE DPVVFLENEL VYGRTFDVPQ MDDFVLPIGK ARIVRQGKDV TIVSYSIGVG LALEAAETLA AEGIDAEVID LRTLRPLDKD TVLASLAKTN RLVVAEEGFP VCSIASEIMA ICMEDGFDHL DAPVLRVCDE DVPLPYAANL EKAALIDAGK IAAAVRKVCY R
|
| |