Gene PCC8801_4151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4151 
Symbol 
ID7105991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4349761 
End bp4350795 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content47% 
IMG OID643477140 
Productpyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit 
Protein accessionYP_002374239 
Protein GI218248868 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTCTG AACGCACTTT ACCCCAATTT AACACCGCTA CGGTCAACAT TACGAAAGAG 
GAAGGACTCC TCCTCTACGA AGATATGATG CTCGGACGCT TGTTTGAAGA CAAGTGTGCT
GAAATGTATT ACCGAGGTCG GATGTTTGGG TTTGTCCATC TCTACAACGG ACAAGAAGCT
ATCTCGACGG GGATCATTAA AGCGTTGCGT TCTGGGGAAG ATTATGTGTC GAGTACCTAT
CGAGATCACG TCCATGCCCT CAGTTGTGGG GTTCCTGCGC GGGAAGTCAT GGCAGAATTG
TTCGGGAAAG AGACTGGATG CAGTAAAGGA CGGGGTGGCT CGATGCACCT GTTTTCTGCT
CAACATCGAC TGTTAGGCGG TTATGCTTTT GTGGCTGAAG GTATCCCCGT AGCCATGGGG
GCAGCCTTTC AAAGTAAATA TCGACGGGAA GCCATGGGTG ATCCCAATGC AGATCAAGTG
ACGGTCTGTT TCTTCGGGGA TGGGGCCAGC AATAACGGTC AATTCTTTGA GTGTCTGAAT
ATGTCGGCTC TGTGGAAATT ACCGATTATT TATGTGGTAG AAAATAATAA ATGGGCGATC
GGCATGGCTC ATGATCGCGC GACTTCTCAA CCAGAAATCT ACAAAAAAGC CAGTGTTTTT
AGTATGGCCG GGGTTGAAGT TGATGGGATG GATGTTTTAG CCGTTCGTTC TGTGGCTCAA
GAAGCGATCG CTAGAGCCCG CGCAGGGGAG GGTCCAACCT TAATTGAAGC CCTTACCTAT
CGGTTTCGGG GTCACTCCTT GGCTGATCCT GATGAACTAC GAGCCCCTGA TGAAAAGCAA
TTTTGGGGAG CGCGTGATCC CATTACTAAG TTAGCGACCT ATTTAGTTGA ACACAATTTG
GCTAATAGTC AAGAACTCAA AGATATCGAA AAACGAGTGC AAGAAACCAT TAATGAAGCG
GTGCAATTTG CTGAAAACAG TCCAGAACCC GATCCTAGTG AACTTTATCG CTATATTTTT
GCTGAGGATG AATAA
 
Protein sequence
MVSERTLPQF NTATVNITKE EGLLLYEDMM LGRLFEDKCA EMYYRGRMFG FVHLYNGQEA 
ISTGIIKALR SGEDYVSSTY RDHVHALSCG VPAREVMAEL FGKETGCSKG RGGSMHLFSA
QHRLLGGYAF VAEGIPVAMG AAFQSKYRRE AMGDPNADQV TVCFFGDGAS NNGQFFECLN
MSALWKLPII YVVENNKWAI GMAHDRATSQ PEIYKKASVF SMAGVEVDGM DVLAVRSVAQ
EAIARARAGE GPTLIEALTY RFRGHSLADP DELRAPDEKQ FWGARDPITK LATYLVEHNL
ANSQELKDIE KRVQETINEA VQFAENSPEP DPSELYRYIF AEDE