Gene TM1040_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1077 
Symbol 
ID4076310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1152399 
End bp1153739 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content65% 
IMG OID638006381 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_613072 
Protein GI99080918 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.144082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.644909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACTG AAATTCTCAT GCCCGCCCTC TCTCCCACCA TGGAGGAAGG CACGCTGGCG 
AAATGGCTCG TCAAAGAAGG CGACACCGTC TCCTCTGGCG ATCTGATTGC CGAAATCGAA
ACCGACAAGG CCACGATGGA ATTCGAGGCC GTGGACGAAG GTGTCGTTGG CAAGATCCTG
ATCGCCGAAG GCTCCGAGGG TGTGAAGGTC AACACCCCCA TCGCGGTGCT GCTGGAAGAC
GGCGAAAGCG CCGATGATAT CGACACATCG GCGGCAACAC CGGAAGCGGC CCCCGCCGCT
GACGCGGCTG CCGAGGAGGC GCCTGCCGCC GCCGAGAAAG CCGCCGCCCC GGCTGCGGCA
ACCCCTGCCC CGGCAGCGCC CGCTGCGGCA GATGGCTCGC GCATCTTTGC CTCGCCACTG
GCGCGTCGCA TCGCTGCCGA CAAGGGACTC GACCTGAGCG CCATCAAAGG CTCCGGCCCC
CGTGGTCGCA TCATCAAGGT GGACGTGGAA AACGCCACCG CCGCGCCCAA GGCCGACGCA
CAGACCGACG CGCAGGCTGC CGCCGCCCCT GCGGCAAGTG CCTCCCCCGC GCCAGTCGCA
GCCCCCGCCG GCCCCTCCGC CGATCAGGTG GCCAAGATGT ACGAGGGCCG CAGCTTCGAG
GAAGTCAAAC TCGACGGGAT GCGCAAGACC ATTGCCGCGC GTCTCACCGA AGCCAAGCAG
ACCATCCCGC ATTTCTACCT GCGCCGCGAC ATCCAGCTCG ACGCGCTGTT GAAATTCCGC
GCGCAGCTCA ACAAGCAGCT TGAAGGCCGC GGTGTGAAGC TCTCGGTCAA CGACTTCATC
ATCAAGGCCG TGGCGCTGGC GCTGCAATCG GTGCCGGACG CCAACGCCGT GTGGGCCGGG
GATCGTGTGC TCAAGATGAA AGCCTCCGAT GTGGCCGTTG CGGTCGCCAT CGACGGCGGT
CTCTTCACGC CGGTCCTGCA AGACGCCGAC ATGAAGTCGC TGTCGGCCCT GTCGAGCGAA
ATGAAAGACC TCGCCACCCG TGCGCGCGAC CGCAAGCTTG CGCCGCATGA ATACCAGGGC
GGCTCCTTCG CGATCTCCAA CCTCGGCATG TTCGGCATCG ACAATTTCGA CGCCATCGTG
AACCCGCCGC ATGCGGGTAT TCTGGCCGTC GGCTCCGGCG TCAAGAAACC CGTGGTGGGC
GCCGATGGCG AGCTGACCGT TGCCACCGTC ATGAGCGTCA CCATGTCCGT GGATCACCGC
GTGATCGACG GCGCATTGGG CGCGGACCTC TTGAAGGCCA TCGTCGACAA TCTGGAAAAC
CCGATGGTGA TGCTGGCCTG A
 
Protein sequence
MPTEILMPAL SPTMEEGTLA KWLVKEGDTV SSGDLIAEIE TDKATMEFEA VDEGVVGKIL 
IAEGSEGVKV NTPIAVLLED GESADDIDTS AATPEAAPAA DAAAEEAPAA AEKAAAPAAA
TPAPAAPAAA DGSRIFASPL ARRIAADKGL DLSAIKGSGP RGRIIKVDVE NATAAPKADA
QTDAQAAAAP AASASPAPVA APAGPSADQV AKMYEGRSFE EVKLDGMRKT IAARLTEAKQ
TIPHFYLRRD IQLDALLKFR AQLNKQLEGR GVKLSVNDFI IKAVALALQS VPDANAVWAG
DRVLKMKASD VAVAVAIDGG LFTPVLQDAD MKSLSALSSE MKDLATRARD RKLAPHEYQG
GSFAISNLGM FGIDNFDAIV NPPHAGILAV GSGVKKPVVG ADGELTVATV MSVTMSVDHR
VIDGALGADL LKAIVDNLEN PMVMLA