Gene Rpal_3206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3206 
Symbol 
ID6410876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3453317 
End bp3454723 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content66% 
IMG OID642713083 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_001992184 
Protein GI192291579 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.267356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATCA ACATTCTGAT GCCCGCGCTG TCGCCGACCA TGGAGAAGGG CAACCTTGCG 
AAGTGGCTGA AGAAGGAGGG CGACAAGGTC AAGAGCGGCG ACGTCATCGC CGAGATCGAG
ACCGACAAGG CGACGATGGA AGTCGAAGCC GCCGACGAGG GGACGCTCGC CAAGATCATC
GTGCCCGAAG GCACCCAGGA CGTGCCGGTC AACGACGTGA TCGCGGTGCT GGCTGCGGAC
GGCGAGGACG TCAAGGCCGC GGGGGCAGGC TGGAAGGCCA GTGCCGGCGG TGCCTCCTCA
CCCCAGCCCT CTCCCCAGAG GGAAGAGGGA GCCGGCCCCG CGGGCGGCAA GGCCGAGGCC
AATTCGCACA TACAGGACAA GGCTGACCAA AGGCCGACGC CACAGCCGCC TTCACCTCTC
CCTAACGGGG ACAGGTCGCC GCCGCAGGCG GCGGGTGAGG GGGCTCCAGC GCCGGCGAAT
GGCCGCGTGT TCGCTTCGCC CTTGGCGCGG CGGCTGGCGA AGGATGCCGG CATCGATATC
GCCCGGGTCA CCGGCACCGG ACCGCACGGC CGCGTCATTG CCCGCGACGT CGAGCAGGCC
AAGAGCGGTG GCGGCCTGAA GGCAGCGGCT GCGGCGCCTG CGGCCGGTCC CGCGATCGCG
CCGGCGATGT CGGATCAGCA GATCCGCGCG CTGTATCCGG AAGGCTCCTA TGAAGTCGTG
CCGCACGACG GCATGCGCCG GACCATTGCG CAAAGGTTGA CGCAGTCGAC CCAGACCATC
CCGCATTTCT ATCTGACGAT CGACTGCAAC CTCGATCGCC TGATGGCCGC GCGCGAGGAC
ATCAACGCCG CCGCGCCGAA GGACAAGGAC GGCAAGCCGG CCTACAAGCT GTCGGTCAAT
GACTTCATCA TCAAGGCGAT GGCGATCGCC CTGCAGCGTA TCCCCGACGC CAACGTATCA
TGGACCGAAG GCGGGATGTT GAAGCACAAG CATTCCGACA TCGGCGTTGC GGTGGCGATG
CCGGGCGGAC TGATCACGCC GATCATCCGC AGCGCCGAAA CTCAGTCGCT GTCGTCGATC
TCGGCGCAGA TGAAGGATTT TGCCGCACGT GCGCGGGCCC GCAAGCTGAA GCCTGAAGAG
TACCAGGGCG GCACCACCGC GGTGTCCAAT CTGGGGATGT TCGGAATCAA GGACTTCACC
GCCGTGATCA ACCCGCCGCA TGCTACGATC CTGGCGGTCG GCACCGGCGA GCAGCGCCCG
ATCGCCCGGG ACGGCAAGAT CGAGATCGCT ACCATGATGA GTGTGACGCT GAGTTGCGAT
CACCGCGCCG TCGACGGTGC GCTCGGCGCC GAACTGATCG GCGCCTTCAA GACGCTGATC
GAAAATCCCG TGATGATGAT GGTGTGA
 
Protein sequence
MPINILMPAL SPTMEKGNLA KWLKKEGDKV KSGDVIAEIE TDKATMEVEA ADEGTLAKII 
VPEGTQDVPV NDVIAVLAAD GEDVKAAGAG WKASAGGASS PQPSPQREEG AGPAGGKAEA
NSHIQDKADQ RPTPQPPSPL PNGDRSPPQA AGEGAPAPAN GRVFASPLAR RLAKDAGIDI
ARVTGTGPHG RVIARDVEQA KSGGGLKAAA AAPAAGPAIA PAMSDQQIRA LYPEGSYEVV
PHDGMRRTIA QRLTQSTQTI PHFYLTIDCN LDRLMAARED INAAAPKDKD GKPAYKLSVN
DFIIKAMAIA LQRIPDANVS WTEGGMLKHK HSDIGVAVAM PGGLITPIIR SAETQSLSSI
SAQMKDFAAR ARARKLKPEE YQGGTTAVSN LGMFGIKDFT AVINPPHATI LAVGTGEQRP
IARDGKIEIA TMMSVTLSCD HRAVDGALGA ELIGAFKTLI ENPVMMMV