Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2778 |
Symbol | |
ID | 4076546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2936897 |
End bp | 2938138 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638008103 |
Product | branched-chain alpha-keto acid dehydrogenase E1 component |
Protein accession | YP_614772 |
Protein GI | 99082618 |
COG category | [C] Energy production and conversion |
COG ID | [COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.602804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.653489 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACA ACGAAACCTA TGCACCGCTC GCACTGAATG TCCCGGAGCC GGGCTGCCGT CCGGGCGATA CGCCTGATTT TTCGTATTTC GAGGTGCCCC GCGCCGGAGC CGTGGCGCGG CCTTCGGTCG ATGTGGACCC TGACGACATG CGTGAGATGG CGTTTTCCAT CGTGCGTGTC CTCAACAAAG AGGGCGAAGC GGTGGGCGAC TGGGCCGGAA CGCTCTCTCC GGAAGAGTTG CGCGAAGGAT TGCGGGACAT GATGCTTCTG CGCGCCTTTG ATGCGCGGAT GCTGAACGCG CAGCGCCAGG GCAAGACCAG TTTCTACATG CAGCATCTTG GCGAGGAGGC CGTCAGCTGT GCCTTCTCTC GCGCGCTCAA GGATGGGGAT ATGAATTTTC CCACCTATCG GCAGGCGGGC CTGTTGATTG CGCGGGACTA TCCGCTGGTC ACCATGATGA ACCAGATCTA TTCCAACGCC GACGACCCGT TGCACGGGCG GCAGTTGCCG ATCATGTACT CCTCAAAAGA ACACGGCTTC TTTTCGATTT CGGGCAACCT CGGCACCCAG TTCGTGCAGG CTGTCGGCTG GGCGATGGCC TCAGCGATTT CCGGTGATAC CAAGATCGCC GCAGGCTGGA TTGGCGATGG TTCCACCGCC GAGAGCGACT TTCACGCCGC GATGGTGTTT GCCTCGACTT ATAATGCGCC GGTGGTGCTC AATATCGTCA ACAACCAGTG GGCGATCTCG ACCTTTCAGG GCATCGCACG CGGTGGGGTG GGCACCTTTG CTGCGCGCGG GCATGGCTTT GGCATCGCGT CGATCCGGGT AGATGGAAAC GACTATCTCG CGGTCAACGC GGTGGCCAAA TGGGCGGCCG AACGCGCACG TCTGGGCCTT GGCCCCACCT TGATCGAGCA TGTGACCTAT CGCGCGGGTG GCCATTCCAC CAGCGATGAC CCTTCGGCCT ATCGCTCCGC AGATGAGGGC GCGGCATGGC CGCTTGGTGA TCCCATCGAC CGTCTCAAAC GCCACCTTAT CCGGATCGGC GAGTGGAGTG AGGAGCGCCA CAGTCAGGCT GAGGCGGAGT TGATGGATCA GGTCATCACC GCCCAGAAAG AGGCCGAAAA GGTCGGCACG CTCGGTGGTG GCAAGGGCCC CTCGCCGCGC GACATGTTCG AGGGCGTGTT CGAGAAAATG CCGCCTCATC TGATTAGGCA ACGACAGGAA GCGGGATACT GA
|
Protein sequence | MSDNETYAPL ALNVPEPGCR PGDTPDFSYF EVPRAGAVAR PSVDVDPDDM REMAFSIVRV LNKEGEAVGD WAGTLSPEEL REGLRDMMLL RAFDARMLNA QRQGKTSFYM QHLGEEAVSC AFSRALKDGD MNFPTYRQAG LLIARDYPLV TMMNQIYSNA DDPLHGRQLP IMYSSKEHGF FSISGNLGTQ FVQAVGWAMA SAISGDTKIA AGWIGDGSTA ESDFHAAMVF ASTYNAPVVL NIVNNQWAIS TFQGIARGGV GTFAARGHGF GIASIRVDGN DYLAVNAVAK WAAERARLGL GPTLIEHVTY RAGGHSTSDD PSAYRSADEG AAWPLGDPID RLKRHLIRIG EWSEERHSQA EAELMDQVIT AQKEAEKVGT LGGGKGPSPR DMFEGVFEKM PPHLIRQRQE AGY
|
| |