Gene TM1040_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1079 
Symbol 
ID4076312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1155145 
End bp1156158 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content60% 
IMG OID638006383 
Productpyruvate dehydrogenase (lipoamide) 
Protein accessionYP_613074 
Protein GI99080920 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.846482 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCAA GAAAAAGCGC TCAGAAATCA AATGTTTCCG CCGAAGAACT GACCAAGTAC 
TACCGCGAAA TGCTGTTGAT CCGGCGATTC GAGGAAAAGG CGGGTCAGCT CTACGGCATG
GGGTTGATTG GCGGTTTCTG CCACCTCTAC ATCGGTCAGG AAGCCGTTGT GGTCGGCCTT
GAGGCCGCCG CAGAAGAAGG CGACAAACGC GTCACCTCAT ATCGCGACCA CGGGCACATG
CTCGCCTGCG GCATGGATGC CGGCGGCGTG ATGGCCGAAC TCACCGGTCG CGAGGGCGGC
TACTCCAAGG GCAAGGGCGG CTCCATGCAC ATGTTCTCCA AGGAGAAGCA TTTCTATGGT
GGCCACGGCA TCGTCGGCGC GCAGGTGCCG CTCGGCGCAG GCCTTGCCTT TTCCGACAAA
TACAAGGGCA ACGACCGCGT GACCTTCACC TATTTTGGCG ATGGCGCGGC GAACCAGGGC
CAGGTCTACG AGACCTACAA CATGGCGCAG CTCTGGGATC TGCCGGTGAT TTTTGTCATT
GAAAATAACC AATACGCCAT GGGCACGAGC GTCCAGCGCT CCACCAAGTC GCCCGCGCTC
TGGAAGCGCG GCGAGGCCTA CGGCATCAAG GGCGAAGAAG TGGACGGCAT GAACGTTCTG
GCCGTGAAAG AGGCCGGCGA GCGCGCCGTG GCCCACTGCC GCGCGGGCAA GGGTCCCTAT
ATCCTCGAGG TCAAAACCTA CCGCTATCGC GGCCACTCCA TGTCGGACCC GGCGAAATAC
CGGACCCGCG AGGAAGTGCA GAAAATGCGC GAGGAACGCG ATCCGATCGA ACAGGTCCGC
GAGATGCTGC TCACCGGCAA GCACGCCTCC GAGGAAGACC TCAAAGCCAT CGACAAAGAG
ATCAAGGATA TCGTCAACAA GTCCGCTGAT TTCGCCAAAG AGAGCCCCGA GCCCGCGCTC
GAGGAGCTTT GGACCGATAT TTACGCCGAC GATATTCCGC AAAAGAGCGC CTGA
 
Protein sequence
MAARKSAQKS NVSAEELTKY YREMLLIRRF EEKAGQLYGM GLIGGFCHLY IGQEAVVVGL 
EAAAEEGDKR VTSYRDHGHM LACGMDAGGV MAELTGREGG YSKGKGGSMH MFSKEKHFYG
GHGIVGAQVP LGAGLAFSDK YKGNDRVTFT YFGDGAANQG QVYETYNMAQ LWDLPVIFVI
ENNQYAMGTS VQRSTKSPAL WKRGEAYGIK GEEVDGMNVL AVKEAGERAV AHCRAGKGPY
ILEVKTYRYR GHSMSDPAKY RTREEVQKMR EERDPIEQVR EMLLTGKHAS EEDLKAIDKE
IKDIVNKSAD FAKESPEPAL EELWTDIYAD DIPQKSA