Gene A9601_16531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_16531 
SymbolatpA 
ID4718383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1397246 
End bp1398763 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content38% 
IMG OID640079379 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001010043 
Protein GI123969185 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.592214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATCTA TACGCCCTGA TGAAATCAGT TCAATCTTAA AACAACAAAT AACTGATTAT 
GACCAATCTG TAAGTGTTAG CAATGTAGGA ACTGTTCTGC AAATCGGTGA TGGCATTGCA
AGAATATATG GCTTAGATCA GGTCATGGCA GGTGAGTTGT TGGAATTTGA GGATGGTACC
GAAGGTATAG CTTTAAATCT TGAAGATGAT AATGTTGGGG CCGTTTTAAT GGGAGAGGCA
CTTGGTGTCC AAGAAGGAAG TAACGTTAAG TCCACAGGTA AAATCGCATC TGTTCCAGTT
GGTGAAGCAA TGCAGGGGAG AGTTGTTAAC CCTCTCGGAC AACCAATAGA TGGGAAAGGG
GAAATTCCAA CAAGTGATAC AAGATTGATT GAAGAAATGG CGCCTGGAAT AATCAAAAGA
AGATCAGTTC ATGAACCAAT GCAAACTGGT ATCACATCTA TTGATGCAAT GATTCCTGTT
GGAAGAGGTC AAAGAGAATT AATTATTGGC GATAGACAAA CTGGAAAATC TGCGATTGCT
ATCGATACAA TTATCAACCA AAAAGGTCAA GATGTAGTTT GTGTATACGT AGCTATTGGT
CAGAAGTCAG CATCAGTAGC AAATATCGTA GAGGTTTTAA GAGAGAGAGG AGCTCTAGAT
TACACCGTTG TAGTTAGTGC AGGAGCTTCA GAACCAGCTG CTTTACAGTA CTTAGCACCT
TATACTGGTG CAGCAATTGC TGAGCATTTT ATGTATCAGG GTAAAGCAAC ACTTGTTATT
TATGATGATC TAACAAAACA AGCTCAGGCT TACAGACAAA TGTCTCTTCT TTTAAAAAGA
CCACCAGGAA GAGAGGCTTA TCCTGGAGAC GTGTTCTACT TGCACAGTAG ATTACTAGAA
AGAGCAGCAA AACTTTCTGA TGCAATGGGC GGGGGTTCTA TGACAGCTCT TCCAATTATT
GAAACTCAGG CAGGAGACGT TTCGGCTTAC ATTCCAACTA ATGTTATTTC AATTACGGAT
GGACAAATAT TCTTGAGTGC AGATTTATTT AACTCAGGAT TAAGACCAGC TATTAATGTT
GGTATATCTG TTAGTCGTGT TGGAGGAGCA GCTCAGACAA AAGCAATTAA AAAAATTGCA
GGAACTTTAA AATTAGAACT CGCACAGTTT GATGAACTAG CTGCTTTTTC TCAATTTGCA
TCTGATCTTG ATGAAGCAAC TCAGCAACAA CTTGAAAGAG GCAAAAGACT AAGAGAGCTA
TTAAAGCAAC CTCAATTCTC TCCTCTAAAC CTTGCAGAAC AAGTTGCAGT TGTTTATGCA
GGAGTAAAAG GTCTTATTGA TGAGGTTCCT GTTGAAGATG TTACTAAATT TGCAACTGAA
CTTAGGGAAT ACCTAAAATT AAATAAATCA GAATTTATAG AAGAGATTCT TAAAGAAAAG
AAACTAAATG ATGGATTAGA AGCGACACTA AAAGAGGTGA TAAATGAAGT TAAATCATCA
ATGCTTGCCA CAGTTTAA
 
Protein sequence
MVSIRPDEIS SILKQQITDY DQSVSVSNVG TVLQIGDGIA RIYGLDQVMA GELLEFEDGT 
EGIALNLEDD NVGAVLMGEA LGVQEGSNVK STGKIASVPV GEAMQGRVVN PLGQPIDGKG
EIPTSDTRLI EEMAPGIIKR RSVHEPMQTG ITSIDAMIPV GRGQRELIIG DRQTGKSAIA
IDTIINQKGQ DVVCVYVAIG QKSASVANIV EVLRERGALD YTVVVSAGAS EPAALQYLAP
YTGAAIAEHF MYQGKATLVI YDDLTKQAQA YRQMSLLLKR PPGREAYPGD VFYLHSRLLE
RAAKLSDAMG GGSMTALPII ETQAGDVSAY IPTNVISITD GQIFLSADLF NSGLRPAINV
GISVSRVGGA AQTKAIKKIA GTLKLELAQF DELAAFSQFA SDLDEATQQQ LERGKRLREL
LKQPQFSPLN LAEQVAVVYA GVKGLIDEVP VEDVTKFATE LREYLKLNKS EFIEEILKEK
KLNDGLEATL KEVINEVKSS MLATV