Gene P9303_16211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_16211 
SymbolpdhB 
ID4778398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1420071 
End bp1421054 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content51% 
IMG OID640087130 
Productpyruvate dehydrogenase E1 beta subunit 
Protein accessionYP_001017630 
Protein GI124023323 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.220601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGGGA CGCTTCTCTT TAATGCTCTT CGAGATGCCA TCGATGAAGA GATGGCCAGA 
GATTCGCATG TTTGTGTGAT GGGAGAGGAC GTCGGCCAAT ACGGCGGCTC CTACAAGGTC
ACCAAGGATC TCTACGAGAA ATATGGCGAG TTGCGGGTGT TGGATACACC GATTGCCGAG
AACAGTTTTA CGGGTATGGC CGTTGGCGCC GCCATGACTG GCCTACGCCC GATTGTGGAG
GGCATGAACA TGGGTTTTCT GCTGCTTGCT TTCAACCAGA TCTCCAACAA CATGGGAATG
CTTCGTTACA CCAGTGGCGG AAATTTCACA ATTCCCACCG TGGTGCGTGG GCCTGGTGGT
GTGGGGCGCC AACTCGGTGC TGAACATAGT CAGCGACTTG AGGCCTATTT TCACGCTGTG
CCTGGGATCA AGATCGTTGC TTGCAGCACG CCAACCAATG CCAAGGGCTT GATGAAAGCC
GCGATCCGAG ACAACAATCC AGTTCTCTTT TTCGAGCATG TGCTGCTCTA CAACCTGATT
GAGGAGCTCC CAGACGGTGA TTATGTCTGT GCCCTAGATC AAGCAGATCT GGTTCGTGAG
GGTAAAGACG TCACGATCCT CACCTATTCG CGTATGCGTC ATCACTGTCT CAAGGCTGTT
GAACAGTTGG AGGCAGACGG CATCGATGTG GAATTGATCG ATTTGATTAG TCTCAAGCCC
TTCGATATGG AGACCATTGT TCGCTCCATC CGTAAAACCC ATCGGGTGAT TGTGGTTGAG
GAGTGTATGA AAACTGGTGG GATTGGTGCT GAGTTGATTG CGCTGATTAC TGAGCAGTGT
TTTGACGAAC TCGATGCTCG CCCAATTCGC CTCTCCAGTC AGGACATTCC CACTCCATAT
AACGGCAAAT TGGAGAATTT CACGATCATT CAGCCTCATC AGATTGTTGA AGCGGCTCAG
CAGATTGTTC TTAAGGGGCT TTGA
 
Protein sequence
MSGTLLFNAL RDAIDEEMAR DSHVCVMGED VGQYGGSYKV TKDLYEKYGE LRVLDTPIAE 
NSFTGMAVGA AMTGLRPIVE GMNMGFLLLA FNQISNNMGM LRYTSGGNFT IPTVVRGPGG
VGRQLGAEHS QRLEAYFHAV PGIKIVACST PTNAKGLMKA AIRDNNPVLF FEHVLLYNLI
EELPDGDYVC ALDQADLVRE GKDVTILTYS RMRHHCLKAV EQLEADGIDV ELIDLISLKP
FDMETIVRSI RKTHRVIVVE ECMKTGGIGA ELIALITEQC FDELDARPIR LSSQDIPTPY
NGKLENFTII QPHQIVEAAQ QIVLKGL