Gene TM1040_2778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2778 
Symbol 
ID4076546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2936897 
End bp2938138 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content61% 
IMG OID638008103 
Productbranched-chain alpha-keto acid dehydrogenase E1 component 
Protein accessionYP_614772 
Protein GI99082618 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.602804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.653489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ACGAAACCTA TGCACCGCTC GCACTGAATG TCCCGGAGCC GGGCTGCCGT 
CCGGGCGATA CGCCTGATTT TTCGTATTTC GAGGTGCCCC GCGCCGGAGC CGTGGCGCGG
CCTTCGGTCG ATGTGGACCC TGACGACATG CGTGAGATGG CGTTTTCCAT CGTGCGTGTC
CTCAACAAAG AGGGCGAAGC GGTGGGCGAC TGGGCCGGAA CGCTCTCTCC GGAAGAGTTG
CGCGAAGGAT TGCGGGACAT GATGCTTCTG CGCGCCTTTG ATGCGCGGAT GCTGAACGCG
CAGCGCCAGG GCAAGACCAG TTTCTACATG CAGCATCTTG GCGAGGAGGC CGTCAGCTGT
GCCTTCTCTC GCGCGCTCAA GGATGGGGAT ATGAATTTTC CCACCTATCG GCAGGCGGGC
CTGTTGATTG CGCGGGACTA TCCGCTGGTC ACCATGATGA ACCAGATCTA TTCCAACGCC
GACGACCCGT TGCACGGGCG GCAGTTGCCG ATCATGTACT CCTCAAAAGA ACACGGCTTC
TTTTCGATTT CGGGCAACCT CGGCACCCAG TTCGTGCAGG CTGTCGGCTG GGCGATGGCC
TCAGCGATTT CCGGTGATAC CAAGATCGCC GCAGGCTGGA TTGGCGATGG TTCCACCGCC
GAGAGCGACT TTCACGCCGC GATGGTGTTT GCCTCGACTT ATAATGCGCC GGTGGTGCTC
AATATCGTCA ACAACCAGTG GGCGATCTCG ACCTTTCAGG GCATCGCACG CGGTGGGGTG
GGCACCTTTG CTGCGCGCGG GCATGGCTTT GGCATCGCGT CGATCCGGGT AGATGGAAAC
GACTATCTCG CGGTCAACGC GGTGGCCAAA TGGGCGGCCG AACGCGCACG TCTGGGCCTT
GGCCCCACCT TGATCGAGCA TGTGACCTAT CGCGCGGGTG GCCATTCCAC CAGCGATGAC
CCTTCGGCCT ATCGCTCCGC AGATGAGGGC GCGGCATGGC CGCTTGGTGA TCCCATCGAC
CGTCTCAAAC GCCACCTTAT CCGGATCGGC GAGTGGAGTG AGGAGCGCCA CAGTCAGGCT
GAGGCGGAGT TGATGGATCA GGTCATCACC GCCCAGAAAG AGGCCGAAAA GGTCGGCACG
CTCGGTGGTG GCAAGGGCCC CTCGCCGCGC GACATGTTCG AGGGCGTGTT CGAGAAAATG
CCGCCTCATC TGATTAGGCA ACGACAGGAA GCGGGATACT GA
 
Protein sequence
MSDNETYAPL ALNVPEPGCR PGDTPDFSYF EVPRAGAVAR PSVDVDPDDM REMAFSIVRV 
LNKEGEAVGD WAGTLSPEEL REGLRDMMLL RAFDARMLNA QRQGKTSFYM QHLGEEAVSC
AFSRALKDGD MNFPTYRQAG LLIARDYPLV TMMNQIYSNA DDPLHGRQLP IMYSSKEHGF
FSISGNLGTQ FVQAVGWAMA SAISGDTKIA AGWIGDGSTA ESDFHAAMVF ASTYNAPVVL
NIVNNQWAIS TFQGIARGGV GTFAARGHGF GIASIRVDGN DYLAVNAVAK WAAERARLGL
GPTLIEHVTY RAGGHSTSDD PSAYRSADEG AAWPLGDPID RLKRHLIRIG EWSEERHSQA
EAELMDQVIT AQKEAEKVGT LGGGKGPSPR DMFEGVFEKM PPHLIRQRQE AGY