Gene Rsph17029_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1147 
Symbol 
ID4895466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1188556 
End bp1189884 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content69% 
IMG OID640111733 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_001043029 
Protein GI126461915 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.16247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCG AGATCCTGAT GCCCGCGCTG TCTCCGACGA TGGAGGAGGG GACGCTCGCG 
AAATGGCTGA AGAAGGAAGG GGATGAGGTC CGCTCGGGCG ACATCATCGC CGAGATCGAG
ACCGACAAGG CCACCATGGA GTTCGAAGCG GTCGACGAGG GCATCCTCGG CAAGATCCTG
ATCGCCGAGG GCACGGCAGG CGTGAAGGTC AACACGCCCA TCGCCGTGCT GGTGGAAGAG
GGCGAGAGCG TGGACGCCGT GTCCTCCGCC AAGGTGCCGG AGCCGCAGGA ACCGGCCGAC
GAGGCCGCGC CCGCGCAGGA GGCTCCGAAG GCGGCCCCTG CCCCGGCCGC CAAGGCGCCC
GAGGCGCAGG CGGCCCGGTC CGAGGGAGAG CGCGTCTTCG CCTCGCCGCT CGCCCGCCGG
ATCGCCAAGG AGAAGGGGAT CGACCTTGCC GCGGTGCAGG GCTCGGGCCC CCGCGGCCGG
ATCGTGAAGG CCGATGTCGA GGGGGCGCGA CCCTCGGCCG CGCCCGCCGC CAAGGCGGAT
GTCGCGGCAC CGAAGGCAGA AGCGCCCGCC GCTGCGGCCG CGCCCGTCGC CGCGCCGGCC
GCCTCCGCGG CTTCGGTGGC GAAGCTCTTC GCGGATCGCG ACTATGAGGA AGTGACGCTC
GACGGGATGC GCAAGACCAT TGCCGCGCGT CTCTCCGAGG CCAAGCAGAC CATCCCGCAC
TTCTACCTCC GGCGCGAGGT GGCTCTGGAT GCGCTGATGG CCTTCCGCGC CGATCTCAAC
GCGAAGCTCG AGAGCCGGGG AGTGAAGCTC TCGGTCAACG ACTTCATCAT CAAGGCCTGT
GCGGTGGCGC TCCAGCAGGT GCCGAACGCG AATGCCGTCT GGGCCGGTGA CCGGATCCTG
CGGCTGAAGC CCTCGGACGT GGCGGTGGCC GTGGCCATCG AGGGCGGGCT CTTCACGCCG
GTCCTGCGCG ATGCGCACCA GAAGAGCCTG TCGGCGCTGT CGGCCGAGAT GAAGGATCTC
GCCGCCCGCG CCCGCACGAA GAAGCTCGCG CCTCACGAAT ATCAGGGCGG CAGCTTCGCG
ATCTCGAACC TCGGCATGTT CGGGGTGGAG AATTTCGATG CGGTCATCAA CCCGCCGCAC
GGCTCGATCC TCGCCGTCGG GGCAGGCATC CGCAAGCCGG TGGTGGGCAA GGACGGGGCG
ATCACGACCG CCACCATGAT GTCGATGACG CTCTCGGTGG ACCACCGGGT GATCGACGGC
GCGCTGGGGG CCGAGTTCCT GAAGGCGATC GTCGAGAATC TCGAGAACCC GATCGCGATG
CTGGCCTGA
 
Protein sequence
MATEILMPAL SPTMEEGTLA KWLKKEGDEV RSGDIIAEIE TDKATMEFEA VDEGILGKIL 
IAEGTAGVKV NTPIAVLVEE GESVDAVSSA KVPEPQEPAD EAAPAQEAPK AAPAPAAKAP
EAQAARSEGE RVFASPLARR IAKEKGIDLA AVQGSGPRGR IVKADVEGAR PSAAPAAKAD
VAAPKAEAPA AAAAPVAAPA ASAASVAKLF ADRDYEEVTL DGMRKTIAAR LSEAKQTIPH
FYLRREVALD ALMAFRADLN AKLESRGVKL SVNDFIIKAC AVALQQVPNA NAVWAGDRIL
RLKPSDVAVA VAIEGGLFTP VLRDAHQKSL SALSAEMKDL AARARTKKLA PHEYQGGSFA
ISNLGMFGVE NFDAVINPPH GSILAVGAGI RKPVVGKDGA ITTATMMSMT LSVDHRVIDG
ALGAEFLKAI VENLENPIAM LA