Gene NATL1_18481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18481 
Symbol 
ID4779782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1510203 
End bp1511153 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content38% 
IMG OID640085137 
ProductF0F1 ATP synthase subunit gamma 
Protein accessionYP_001015668 
Protein GI124026553 
COG category[C] Energy production and conversion 
COG ID[COG0224] F0F1-type ATP synthase, gamma subunit 
TIGRFAM ID[TIGR01146] ATP synthase, F1 gamma subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.672263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAACC TAAAGGACAT AAGAGACAGA ATTGTCTCTG TCAAAAACAC TAGGAAAATC 
ACAGAAGCAA TGAGACTAGT TGCTGCTGCG AAAGTTCGCA GAGCACAGGA TCAGGTCTTA
AGGAGCAGAC CTTTTGCTGA TCGATTGGCA AGGGTTTTAG AAAATATTCA ATCAAGAATG
CAGTTTGAAG CCGCTGACTC TCCTCTTTTA AACAAGCGTG AAGTTAAGAC AATTACTCTC
TTAGCTGTCA CTGGGGATAG AGGATTATGT GGGGGATATA ATGCGAACAT AATCAAACGA
ACAGAGAAGC GATATGCCGA ATTGAAAGGT CAAGGATATA GCCCCGATTT AGTTCTAATT
GGTAAGAAAG CAATAGGATA TTTCGAGAAT AGATCTAGTC TTTATAATAT TAGGGCTACT
TTTAAAGAAT TAGAGCAAGT CCCAACTTCT GAGGATGCTG CCTCTATTAC TAGTGAAGTA
TTAGCAGAAT TTCTCTCTGA AAGTACTGAT CGAGTAGAGG TCATTTTTAC TAAGTTTGTA
AGTTTGGTTA GTTGTAATCC AACAATACAA ACTCTGTTAC CTTTAGATCC TCAAGGAATA
GCAGATTCAG AAGACGAAAT TTTTAGATTA ACTACAAAAG ATAGCCAATT AATTATTGAG
AAAGATGCTG CTCCAACTAA TGAAGAGCCG AAACTACCAT CGGATATTGT GTTTGAACAA
AGTCCCGATC AGCTTTTGAA TGCTCTTTTA CCTCTTTATT TACAGAATCA ATTATTACGT
GCATTACAGG AGTCCGCTGC TTCAGAATTG GCGAGTAGAA TGACTGCAAT GAATAATGCT
AGTGATAATG CTAAGGAATT GGCTAAAAAT CTTACTATTG ATTATAACAA GGCTCGACAA
GCAGCAATTA CGCAAGAAAT TTTAGAAGTT GTCGGTGGAG CTTCAGCATA G
 
Protein sequence
MANLKDIRDR IVSVKNTRKI TEAMRLVAAA KVRRAQDQVL RSRPFADRLA RVLENIQSRM 
QFEAADSPLL NKREVKTITL LAVTGDRGLC GGYNANIIKR TEKRYAELKG QGYSPDLVLI
GKKAIGYFEN RSSLYNIRAT FKELEQVPTS EDAASITSEV LAEFLSESTD RVEVIFTKFV
SLVSCNPTIQ TLLPLDPQGI ADSEDEIFRL TTKDSQLIIE KDAAPTNEEP KLPSDIVFEQ
SPDQLLNALL PLYLQNQLLR ALQESAASEL ASRMTAMNNA SDNAKELAKN LTIDYNKARQ
AAITQEILEV VGGASA