Gene A9601_14871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14871 
SymbolacoA 
ID4718208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1268563 
End bp1269636 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content38% 
IMG OID640079208 
Productpyruvate dehydrogenase E1 alpha subunit 
Protein accessionYP_001009877 
Protein GI123969019 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAACAA ATCAAAACTT CGTTAAGGTA TTTTTCTTGG AAACACACGT AGAGAGAATC 
TCAAATCTTC AAGACATAAA AAAAGCCGAA TTAGATCGAG AAACCGGATT ATTCCTTTAT
GAAGACATGA CGCTTGGTCG TAGATTTGAA GATAAGTGTG CAGAAATGTA TTACAGGGGG
AAAATGTTTG GTTTCGTTCA TTTATATAAC GGTCAAGAGG CTATAAGCAC AGGAGTAATT
GGCGCCATGA AAAAGAAACA TGATTGGTTT TGTAGTACCT ATCGCGATCA TGTTCATGCA
CTAAGTGCGG GTGTTCCCTC ATTTGAAGTA ATGAGTGAAC TTTTTGGTAA ATCCACTGGT
TGTAGCAAAG GCCGAGGTGG ATCCATGCAC TTATTTTCAA GAGAGCATCA CCTACTAGGA
GGATATGCAT TTATTGGAGA GGGAATTCCT GTTGCCTTAG GGGCAGCCTT TTCAAGTAAA
TACAAAAAGG AAGTTGCTGG AAATAGTAAT AGTGATGCTG TAACTGCAGC ATTTTTTGGG
GATGGGACTT GCAATAATGG GCAGTTTTTT GAATGTTTAA ATATGGCCCA GTTATGGAAA
TTACCCATAA TTTTTGTTGT TGAGAATAAT AAATGGGCTA TTGGTATGGC TCATGATAGA
GCTACTAGTA ATCCTGAAAT CTGGAGAAAA GCGTCTGCTT TTGGCATGCA CGGTGAGGAA
GTTGATGGAA TGGATGTATT AGCAGTAAGA GGGGCAGCAC AAAGAGCAAT TGAGCGAGCT
AGGGCAGGAG AAGGTCCCAC ACTTTTAGAA TGTTTAACTT ATAGATATAG AGGGCATTCT
CTTGCAGATC CAGATGAATT AAGGTCTGAA AAAGAGAAGG AGTTTTGGGG AAAAAGAGAC
CCTATTAAGA AATTAGCTCA AGAAATTATT GACGGTAAAT TCGCTACGGA AGAAGAATTA
AAAATTATTG AAAAGAAGAT TGATGCTGAA ATAGCGGAGT CAGTTAAAAA TGCAATTGAA
GCTCCTGAAC CTCCTTCAGA AGAATTAACC AAATATATTT GGGCAGAAGA TTAG
 
Protein sequence
MLTNQNFVKV FFLETHVERI SNLQDIKKAE LDRETGLFLY EDMTLGRRFE DKCAEMYYRG 
KMFGFVHLYN GQEAISTGVI GAMKKKHDWF CSTYRDHVHA LSAGVPSFEV MSELFGKSTG
CSKGRGGSMH LFSREHHLLG GYAFIGEGIP VALGAAFSSK YKKEVAGNSN SDAVTAAFFG
DGTCNNGQFF ECLNMAQLWK LPIIFVVENN KWAIGMAHDR ATSNPEIWRK ASAFGMHGEE
VDGMDVLAVR GAAQRAIERA RAGEGPTLLE CLTYRYRGHS LADPDELRSE KEKEFWGKRD
PIKKLAQEII DGKFATEEEL KIIEKKIDAE IAESVKNAIE APEPPSEELT KYIWAED