Gene RSP_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4050 
SymbolpdhB 
ID3720099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp1121645 
End bp1122973 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content69% 
IMG OID640070663 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_352544 
Protein GI77463040 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.605618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACCG AGATCCTGAT GCCCGCGCTG TCTCCGACGA TGGAGGAGGG GACGCTCGCG 
AAATGGCTGA AGAAGGAAGG GGATGAGGTC CGCTCGGGCG ACATCATCGC CGAGATCGAG
ACCGACAAGG CCACCATGGA GTTCGAGGCG GTCGACGAGG GCATCCTCGG CAAGATCCTG
ATCGCCGAGG GCACGGCAGG CGTGAAGGTC AACACGCCCA TCGCCGTGCT GGTGGAAGAG
GGCGAGAGCG TGGACGCCGT GTCCTCCGCC AAGGTGCCGG AGCCGCAGGA ACCGGCCGAC
GAGGCCGCAC CCGCGCAGGG GGCTCCGAAG GAGGCCCCTG CCCCGGCCGC CAAGGCGCCC
GCGGCGCAGG CGGCCCGATC CGAGGGAGAG CGCGTCTTCG CCTCGCCGCT CGCCCGCCGG
ATCGCCAAGG AGAAGGGGAT CGACCTTGCC GCGGTGCAGG GCTCGGGCCC GCGCGGCCGG
ATCGTGAAGG CCGATGTCGA GGGGGCGCAA CCCTCGGCCG CTCCCGCCGC CAAGGCGGAC
GCCGCGGCAC CGAAGGCAGA AGCGCCCGCC GCTGCGGCCG CGCCCGTCGC CGCGCCGGCC
GCCTCCGCGG CTTCGGTGGC GAAGCTCTTC GCGGATCGCG ACTATGAGGA AGTGACCCTC
GACGGGATGC GCAAGACCAT TGCCGCGCGT CTGTCCGAGG CCAAGCAGAC CATCCCGCAC
TTCTACCTCC GGCGCGAGGT GGCTCTGGAT GCGCTGATGG CTTTCCGCGC CGATCTCAAT
GCGAAGCTCG AGAGCCGGGG CGTAAAGCTC TCGGTCAACG ACTTCATCAT CAAGGCCTGT
GCGGTGGCGC TCCAGCAGGT GCCGAACGCG AATGCCGTCT GGGCCGGAGA CCGGATCCTG
CGGCTGAAGC CCTCGGACGT GGCGGTGGCC GTGGCGATCG AGGGCGGGCT CTTCACGCCG
GTCCTGCGCG ATGCGCACCA GAAGAGCCTG TCGGCGCTGT CGGCCGAGAT GAAGGATCTC
GCCGCCCGCG CCCGCACGAA GAAGCTCGCA CCGCACGAAT ATCAGGGCGG CAGCTTCGCG
ATCTCGAACC TCGGCATGTT CGGGGTCGAG AATTTCGATG CGGTCATCAA CCCGCCGCAC
GGCTCGATCC TCGCCGTCGG CGCAGGCATC CGCAAGCCGG TGGTGGGCAA GGACGGCGCG
ATCACGACGG CCACCATGAT GTCGATGACG CTCTCGGTGG ACCACCGGGT GATCGACGGC
GCGCTGGGGG CCGAGTTCCT GAAGGCGATC GTCGAGAATC TCGAGAACCC GATCGCCATG
CTGGCCTGA
 
Protein sequence
MATEILMPAL SPTMEEGTLA KWLKKEGDEV RSGDIIAEIE TDKATMEFEA VDEGILGKIL 
IAEGTAGVKV NTPIAVLVEE GESVDAVSSA KVPEPQEPAD EAAPAQGAPK EAPAPAAKAP
AAQAARSEGE RVFASPLARR IAKEKGIDLA AVQGSGPRGR IVKADVEGAQ PSAAPAAKAD
AAAPKAEAPA AAAAPVAAPA ASAASVAKLF ADRDYEEVTL DGMRKTIAAR LSEAKQTIPH
FYLRREVALD ALMAFRADLN AKLESRGVKL SVNDFIIKAC AVALQQVPNA NAVWAGDRIL
RLKPSDVAVA VAIEGGLFTP VLRDAHQKSL SALSAEMKDL AARARTKKLA PHEYQGGSFA
ISNLGMFGVE NFDAVINPPH GSILAVGAGI RKPVVGKDGA ITTATMMSMT LSVDHRVIDG
ALGAEFLKAI VENLENPIAM LA