Gene NATL1_18491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18491 
SymbolatpA 
ID4779780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1511187 
End bp1512701 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content40% 
IMG OID640085138 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001015669 
Protein GI124026554 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.983268 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTCTA TACGTCCCGA CGAGATCAGT TCAATCCTTA AACAGCAGAT TGCTGATTAC 
GATAAGTCAG TATCTGTAAG CAATGTAGGT ACCGTTTTAC AAATCGGTGA TGGTATCGCA
AGAGTTTATG GCCTCGAAAA GGTTATGGCA GGTGAACTAG TTGAATTCGA AGATGGAACA
GAAGGGATTG CTTTAAACCT TGAAGATGAC AATGTTGGTG TCGTTTTGAT GGGTGAAGCT
TTAGGAGTAC AAGAGGGAAG TACAGTTAAA GCAACTGGAA AGATTGCCTC AGTCCCAGTT
GGTGAGGCAA TGCTTGGAAG AGTTGTTAAT CCTCTTGGAC AGCAAATCGA TGGAAAAGGA
GAGATTGCTA CAACTGACAC AAGATTAATT GAGTCAATAG CACCTGGAAT TATTAAGAGA
AAGTCAGTAC ATGAGCCTAT GCAAACTGGG ATCACATCTA TTGATGCGAT GATCCCTATT
GGAAGAGGTC AGAGGGAGTT AATTATTGGA GATCGACAAA CAGGGAAGAC TGCAATAGCA
ATCGATACAA TCATCAACCA AAAAGGTCAA GATGTTGTTT GTGTTTATGT AGCTGTTGGT
CAAAAACAAG CATCAGTGGC AAATGTGGTT GAAGTTCTTA AAGAAAAAGG AGCTTTGGAT
TATACAATTA TTGTTAATGC CGGAGCATCT GAAGCTGCAG CTTTGCAATA TTTAGCTCCT
TATACAGGTG CTGCAATTGC TGAGCACTTT ATGTATCAGG GCAAGGCGAC ACTAGTTATA
TACGATGATC TAACTAAGCA AGCTCAGGCT TACAGGCAAA TGTCATTACT CTTACGTCGT
CCACTAGGCC GTGAAGCATA TCCAGGAGAT GTTTTCTATT GTCATAGTCG TTTGCTTGAG
AGGGCTGCAA AACTTTCTGA TGCTATGGGT GCAGGATCAA TGACATCCTT GCCTATTATT
GAAACGCAGG CTGGTGATGT TTCGGCTTAT ATCCCAACTA ACGTCATATC AATTACTGAT
GGTCAGATTT TCTTGAGCTC AGATTTATTC AACTCTGGAT TAAGACCTGC AATTAACGTT
GGTATATCGG TTAGTCGTGT AGGCGGAGCT GCGCAAACAA AAGCTATTAA GAAAATTGCC
GGTACGTTAA AACTTGAACT GGCTCAATTT GATGAACTTG CGGCATTCTC TCAATTTGCT
TCAGATCTTG ATGAGGCTAC TCAAAAGCAA TTAGGTAGAG GTAAAAGGCT AAGAGAACTT
CTCAAGCAGC CTCAATTTGA TCCTCTAAAT TTAGCTGAAC AAGTAGCTAT TGTTTATGCA
GGAGTTAAAG GATTGATTGA TGAGGTTCCC GAGGAGAAAG TGGTTAACTT TGCTCGTGAA
TTGCGTGATT ATTTAAAAAC AAACAAGGCT GATTTCCTTA AAAATGTTCT TTCCGAAAAA
GTATTAAGTG AAGCATCTGA ATCAATGCTT AAAGACGCTA TTAGCGAGGT TAAGTCCTCC
ATGCTTGCGG CTTAA
 
Protein sequence
MVSIRPDEIS SILKQQIADY DKSVSVSNVG TVLQIGDGIA RVYGLEKVMA GELVEFEDGT 
EGIALNLEDD NVGVVLMGEA LGVQEGSTVK ATGKIASVPV GEAMLGRVVN PLGQQIDGKG
EIATTDTRLI ESIAPGIIKR KSVHEPMQTG ITSIDAMIPI GRGQRELIIG DRQTGKTAIA
IDTIINQKGQ DVVCVYVAVG QKQASVANVV EVLKEKGALD YTIIVNAGAS EAAALQYLAP
YTGAAIAEHF MYQGKATLVI YDDLTKQAQA YRQMSLLLRR PLGREAYPGD VFYCHSRLLE
RAAKLSDAMG AGSMTSLPII ETQAGDVSAY IPTNVISITD GQIFLSSDLF NSGLRPAINV
GISVSRVGGA AQTKAIKKIA GTLKLELAQF DELAAFSQFA SDLDEATQKQ LGRGKRLREL
LKQPQFDPLN LAEQVAIVYA GVKGLIDEVP EEKVVNFARE LRDYLKTNKA DFLKNVLSEK
VLSEASESML KDAISEVKSS MLAA