Gene OSTLU_24766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24766 
Symbol 
ID5002680 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp165391 
End bp168284 
Gene Length2894 bp 
Protein Length365 aa 
Translation table 
GC content61% 
IMG OID640418101 
ProductF-ATPase family transporter: protons (chloroplast) 
Protein accessionXP_001418858 
Protein GI145348854 
COG category[C] Energy production and conversion 
COG ID[COG0224] F0F1-type ATP synthase, gamma subunit 
TIGRFAM ID[TIGR01146] ATP synthase, F1 gamma subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.057416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.404649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GAACGATCGA CTCGATGTCC GCCGTCGCCC ACACGTCGCT CGCCATGAGC AAGGCCACCG 
CCGTGCGCGG TGCGTCCGTG AAACGGTCCA CCGCCGCGCA GCGCGCGACG GCGCCGCAGC
GATCCCTCGT GGTGCGTGAA CGACGAACGA CGCGACGCGC GAACGACGCG ATGGCGCGCG
ATGGGACGGG CATCGGGATT ATCGCGCCGC GCGCGAGGAC GCGCGCGCGG GCGAGGATGA
TCGCGCGCGG GATTCGGTGG AGGAAAATAG TGCGCGAGAG ATGGCGTCGC GGGGCGACGC
GCGCGCGCGC GAGGGACGCG CGGCCGGGCC CGGTCGTGGA CGTAGGGTCT GGTGAGGATA
ACGCGGATCG GATCGCGGCG TCATCGGGAG GGCGCGGGCG ATGGCGACGC GCGTCGGGGG
GCGCGCGTGC GCGCGGGGGC GCGCGGGGGT TGTGTGGTGA TCGCGCGCGC GATCGGGGGG
CGCGGCGCGC GGCGTGCGCG CGCGCGTGGG CGCGGGCGGG GGCGGGACGA TCGTATCCAA
AGAGGGTTTT GCGCGCGCGC GAGACCGACG CGAGACGCGA GAGACTGACG GTGATTCGTA
CGCTTTATGA CGATTCGTAG ATTCGCAACG CGAGCCCGAA GGAAATGCGC GACCGCATCG
CGTCGGTCGG TAACACGAAG AAGATCACCG ATGCGATGAA GCTCGTCGCG GCGGCGAAGG
TGCGCAAGGC GCAAGACGCC GTCATTGGCG CGCGCCCGTT CTCTGAGTCT TTGGTCAAGG
TTTTGTTCGC CATCAACAGC CGATTGGCTG GTGAGGATGT GGACGTGCCG TTGACGAAGA
TGCGCCCGGT GAAGACGGCG ATGCTCGTCG TCTGCACGGG TGATCGTGGT TTGTGCGGGG
GGTTCAACAA CTTCATTATT CGCAAGACGG AGCAGCGCGT GGCGGAGCTC AAGGCGCAAG
GCGTTGAATG CAAGCTCATC ACCGTCGGTA AGAAGGGTGG TGTGTACTTC AACCGTCGCA
AGGAGCAATA CAACTTGGTC AAGCGCTTCG ACATGGGTCA AGCGCCGTCC ACGCAAGACG
CGCAAACCAT CGCCGACGAA ATCTTCGCCG AGTTCACCTC GGAGGAAGTC GACAAGGTCG
AGATGATTTA CTCCCGATTC GTTTCCCTCA TCGCTGCGGA GCCGACCGTG CAAACCATTT
TGCCGCTCTC CAAGGAAGGT GAGGTGTGCA ACGTTGACGG TGTTTGCATT GACGCGGCGA
ACGATGAAAT CTTCAAGCTC ACGACTGAAG ATGGCAAGTT CGCCGTCAAG CGCGAAGCGT
CTGACACGGA GGTTTCTGAG TTTGAGGGTG TCATGCAGTT CGAGCAAGAC CCGAACCAAA
TTCTTGATGC CCTCATGCCG CTGTACATGA ACTCCCAAAT CCTCCGTGCG CTCCAAGAGT
CTCTCGCCTC TGAGCTCGCG GCGCGCATGA ACGCGATGTC CACCGCCTCG GACAACGCCA
AGGAGCTCAA GAAGACCTTG TCTTTGGTTT ACAACCGCGC TCGCCAAGCG AAGATTACCT
CGGAAATTAT CGAGCTCGTC GCCGGTGCCT CCGCCGCGTA AGCCACCGAC CATTCATTTC
CTTAGAAGGG ATTGTTTGTT TCGTTCGCCG GTTTCCGCGC GCGCGAACGC ACTAGAAGCG
CCTGTCTGCG CCCGATTTCT CACGCGCGCC GCCGCGGCGC GAGCGTCCAT CGCCTCGATC
ATCGCTCGCG CCCGACGCGT CTGTCGAATC AGTCGCGCGA CACTTAGCAC ATAATCACTA
TTGCAATCAG CAGTTTGAAC CAACTCTGAA TCCTCGCCTT CGCAGTCAAA ATAAGTGCGA
CCGACCGCCG ATCGGCACGT TACGATCGAG CCCTCTCGCT CGATCGCCAC ACGGTGCCTC
GCATCGCTCG ACCGCGCGCG TCGTCGCGAA TCACCAACAA TGCGCGCCAG CGTCGACGCC
GCGCGCGCTC GCGCGCCGAC GCCATCGCGC GCGCGGCGCG GCGGCGACCG CAAACGTCGA
ACACCTCCGG TGACGATTCT CGCCGCCGCG CGCGGTAACG TCCGCGTCGA CGCCATGGCG
ATCGGCGGTG GACACACCCA TCATCATAAT CACCACCACG ACCACGGACA CGGGTGGTTC
AAGTCGCGCG CGGCGCGCGA TTTGGAGAAG TGGCACAAGA CGCAAGAGGG ACGCGCGGCG
TTGGAATTGG CGGATCGAGA GACGCGCGGC GCGGCGTCGA CGAAGGCGCT CCAGATGGTG
AAAGATCACG TCGCGAACGA AGCGCCGAAA CCGGATACGC TGGTGCACCA CGCGCAGCCG
CATGAAATCG ACCAGACGAC GCGATGGGTC ATCGCGACGT TTTTGAACCG GTTGATTCAC
GTGCCGATGA TGAATCCGAT GACGGAACAA ATCATCTGCG TCAAGGCGGT CGACTGCATC
GCTGACGCTA TCGAGCGCGA GCTGCGCCGC ACTGGCGCGG GTCACTTTTT TAACGACGCC
GTGGAGCACG AACGGCGAGG TGACCTCGAG GAGTGGTTGA ACAACGTGTG CGACGATTTG
AACTTGGTCA TAGACGTGCC GATGTTGGAC GAAAGGCAAG AATTTGATTG TATTCACGCC
GTGATGTCGA TCGTGACGCA TCAATACTTG GAAGCAAAGA AGAGGGACGA AAAAGACAAT
CCAATCAAGC ACAAGTGGAT TTTGCTTGTG AACATGCTCG GCTAAACGAT GTGTGTATGT
ATGTGGCTGA CGCACACGAA AGATAGCTGT AAAATAAGTT GAACCCACTA ACCGGCGCGA
GAATTAGCGT GTATTTAACC TATGTGATAT CACATAGGTT ATTATCAATA ATAAGAGTGT
CATAATCTAC ATCC
 
Protein sequence
MSAVAHTSLA MSKATAVRGA SVKRSTAAQR ATAPQRSLVI RNASPKEMRD RIASVGNTKK 
ITDAMKLVAA AKVRKAQDAV IGARPFSESL VKVLFAINSR LAGEDVDVPL TKMRPVKTAM
LVVCTGDRGL CGGFNNFIIR KTEQRVAELK AQGVECKLIT VGKKGGVYFN RRKEQYNLVK
RFDMGQAPST QDAQTIADEI FAEFTSEEVD KVEMIYSRFV SLIAAEPTVQ TILPLSKEGE
VCNVDGVCID AANDEIFKLT TEDGKFAVKR EASDTEVSEF EGVMQFEQDP NQILDALMPL
YMNSQILRAL QESLASELAA RMNAMSTASD NAKELKKTLS LVYNRARQAK ITSEIIELVA
GASAA