Gene P9303_19591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19591 
SymbolacoA 
ID4777211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1723691 
End bp1724782 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content54% 
IMG OID640087469 
Productpyruvate dehydrogenase E1 alpha subunit 
Protein accessionYP_001017966 
Protein GI124023659 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.131913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA AAACCTCCGT GAGCTCAGGC CAAACCACGG CAAAACCCCT GGCAGCAGGC 
CGCCATGGCG AGAGAATCTC CACGCTAATC AGCTCCAAAC GCGCCAAGGT TGATCGTCAG
ATCGGCCTAG AACTCTTCCG AGACATGACC CTTGGGCGAC GCTTCGAGGA CAAATGTGCC
GAGATGTACT ACCGGGGAAA AATGTTTGGC TTCGTTCACC TCTACAACGG CCAAGAAGCC
GTTAGCACAG GTGTGATCGG TGCAATGAAA CGCCAGCACG ATTGGTTTTG CAGTACCTAT
CGCGATCACG TTCATGCTCT TAGCGCCGGT GTACCCGCCC GCGAAGTAAT GAGTGAGCTC
TTCGGCAAGG AAACCGGCTG TAGCAAAGGT CGTGGGGGCT CCATGCACCT CTTTTCGCAA
GAGCATCACC TCCTAGGAGG ATTTGCCTTC ATCGGCGAAG GGATTCCCAT CGCCCTGGGG
GCAGCCTTCA CCAGTCGCTA TAAGCGGGAT GCCCTGGGTG ATGCCAGCAG CAATGCTGTA
ACAGCAGCTT TTTTCGGTGA CGGCACCTGC AACAACGGCC AATTCTTTGA GTGCCTCAAC
ATGGCGCAGC TTTGGCAACT GCCGATCCTG TTCGTTGTCG AGAACAACAA ATGGGCCATT
GGCATGGCCC ATGAGCGAGC CACCAGTGAA CCGGAAATCT GGCAAAAAGC TGCTGCCTTC
GGAATGGCTG GCGAAGAGGT TGACGGCATG GATGTTCTCG CCGTAAGAGC TGCCACTCAG
AGAGCAATAA AAAGAGCCAG GGCTGGTGAA GGTCCCACCC TGCTGGAGTG CCTCACCTAT
CGATTCCGTG GCCATTCTCT TGCTGATCCA GATGAACTAC GCGCCGAAGA GGAAAAGCAA
TTCTGGGCAA AACGAGATCC CCTAAAGGCT CTTGAGAAGG ATCTCACCTC TGAATCTTTG
GTGCGTGCTG ATGAACTACG CGCTATTGAA AAAGAGATCG ATGCAGAAGT AAATGACTGC
GTGGAGTTTG CCCTTGCCGC AGCCGAACCA AATGCCAACG AACTCACTCG CTACATCTGG
GCTGAAGATT GA
 
Protein sequence
MSQKTSVSSG QTTAKPLAAG RHGERISTLI SSKRAKVDRQ IGLELFRDMT LGRRFEDKCA 
EMYYRGKMFG FVHLYNGQEA VSTGVIGAMK RQHDWFCSTY RDHVHALSAG VPAREVMSEL
FGKETGCSKG RGGSMHLFSQ EHHLLGGFAF IGEGIPIALG AAFTSRYKRD ALGDASSNAV
TAAFFGDGTC NNGQFFECLN MAQLWQLPIL FVVENNKWAI GMAHERATSE PEIWQKAAAF
GMAGEEVDGM DVLAVRAATQ RAIKRARAGE GPTLLECLTY RFRGHSLADP DELRAEEEKQ
FWAKRDPLKA LEKDLTSESL VRADELRAIE KEIDAEVNDC VEFALAAAEP NANELTRYIW
AED