Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_13351 |
Symbol | acoA |
ID | 5730131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1202814 |
End bp | 1203896 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641285706 |
Product | pyruvate dehydrogenase E1 alpha subunit |
Protein accession | YP_001551220 |
Protein GI | 159903876 |
COG category | [C] Energy production and conversion |
COG ID | [COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit |
TIGRFAM ID | [TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0648937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAGA GCACAATTGC AGATCATGAG ATGAGTAATC TCCCCAAGAA AGAGGATCAT GCTGAGCGAC TTTCCTCGTT ATCAGGAGGT GAGTCTGCAG TTATAGATCG CGATACAGGC CTAAGGCTTT TCAAAGACAT GACCTTAGGA AGAAGGTTTG AGGACAAATG TGCTGAGATG TATTACCGAG GGAAAATGTT TGGATTTGTT CATCTTTACA ACGGGCAAGA AGCTGTGAGT TCCGGTGTCA TAGGAGCAAT GAAGCTCAAA CACGATTGGT TTTGCAGCAC CTATCGCGAT CATGTACATG CTCTTAGCGC TGGAGTCCCT GCAAGGGAAG TAATGAGTGA GCTATTTGGC AAAGAGACTG GATGCAGCAA GGGGAGAGGA GGATCCATGC ATCTTTTTTC CAAAGAACAT CATCTTTTGG GAGGCTATGC ATTTATTGGA GAAGGGATTC CAGTCGCTCT AGGTGCAGCC TTTAGCAGTC GTTACAAAAA AGAGGTTTTT AAAGATAAAA ATAGCGATGC TGTAACAGCC GCCTTCTTTG GAGATGGTAC GTGTAATAAC GGTCAATTTT TCGAATGCCT CAACATGGCC CAACTATGGA AGCTGCCAAT AATTTTTGTT GTTGAAAACA ATAAATGGGC TATAGGGATG GCTCATGACA GAGCTACTAG TGATCCTGAA ATTTGGAGAA AAGCTGGAGC TTTTGGCATG GAGGGAGAAG AGGTTGATGG AATGGATGTC CTAGCGGTTA GAGGAGCAGC AGAAAGGGCC CTTGAAAGAG CTAGAGCAGG AGAAGGACCT TCTTTAATTG AATGTCTTAC CTATAGATTT AGAGGTCATT CTCTTGCTGA TCCGGACGAA TTAAGATCTG AACAAGAAAA AGAGTTTTGG GCACAAAGAG ATCCATTAAA GAACCTTGCC AAGGTTCTTG TATCAAAAGA ATTGGCAAAT GAAAATGAAC TTAAAAATAT TGAGAAAGAG ATTGATTCTG AAGTTACTGA TGCAGTTGAA TTTGCACTTG CAGCTAAAGA CCCTGACCCA AGTGAATTAA CTAAATATAT CTGGGCTGAA TAA
|
Protein sequence | MNQSTIADHE MSNLPKKEDH AERLSSLSGG ESAVIDRDTG LRLFKDMTLG RRFEDKCAEM YYRGKMFGFV HLYNGQEAVS SGVIGAMKLK HDWFCSTYRD HVHALSAGVP AREVMSELFG KETGCSKGRG GSMHLFSKEH HLLGGYAFIG EGIPVALGAA FSSRYKKEVF KDKNSDAVTA AFFGDGTCNN GQFFECLNMA QLWKLPIIFV VENNKWAIGM AHDRATSDPE IWRKAGAFGM EGEEVDGMDV LAVRGAAERA LERARAGEGP SLIECLTYRF RGHSLADPDE LRSEQEKEFW AQRDPLKNLA KVLVSKELAN ENELKNIEKE IDSEVTDAVE FALAAKDPDP SELTKYIWAE
|
| |