Gene OSTLU_29856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29856 
Symbol 
ID5000530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp77904 
End bp80980 
Gene Length3077 bp 
Protein Length847 aa 
Translation table 
GC content62% 
IMG OID640415951 
Productpredicted protein 
Protein accessionXP_001416304 
Protein GI145343262 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID[TIGR01828] pyruvate, phosphate dikinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0486692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCGACGCCG CGACACGCGC ACGCGCCGCG CGCGCGACCG ATCGTTCGGG GTACGTCCGA 
ACGACGTTCA CGCGCGCGCC GACCGTCGCG ACAACCGACG ATGGCGCGAT CGATGACGCG
AACGATGACG CAGCCGACGG CGAGAAGGCC AACCGGCGCG AAAACGATGG CGCCGACGTC
GACGACGCGG AAGCGCGCGA ACGGGGCGCG CGCGCGACGC GGCGCGCGCG CGTCGGTGAC
GGACGCGAAC GGGAACGAGG CGACGGACGC GAGCGAACGC GCGATCGCGA ACGCGGCCTC
GGGGGGGCTC GGACGGGTGT TTCGATTCGG AGGGGGACGG GCGGAGGGCG CGCGCGACGC
GAAGGCGCGC CTGGGAGGGA AGGGGGCGAA TCTGGCGGAG ATGTCGCGCG TCGGGCTGAG
CGTGCCGCCG GGATTCACCG TGGACACGGA GACGTGCGCG GACTTTCACG CGAACGGTGG
GGCGCTGCCG GAGGGGGCGT GGGAGGAGAT GGTCGCGGGG CTGGCGCACG TGGAGGCGGC
GCTGGGGAAG ACGCTGGGGG ACGCGGAAAA TCCGCTGTTG GTGTCCGTGA GGAGCGGCGC
GGCGGTGTCG ATGCCGGGGA TGATGGACAC GGTGCTGAAT CTTGGGTTGA ACGATTCCGT
GGTCGAGGGG TTGGCTCGAC GCGCCGGTGG ACGCTTCGCG TTTGACTCGT ATCGCCGATT
TCTCGACATG TACGGCAACG TCGTCATGGG GATCGATCAC GGCAAGTTTG AACACGCGCT
GGAGACGCTG AAGCGCGACG TCGGGGTGGA TTCGGACGAA GGACTGAGCG AGCAAAACTT
GCGCGACTTG GTTGAGATCT ACAAGGGGGT GTACGCCTCG GAAAACGTGT CGTTTCCGCA
AGATCCGCTC GAGCAGTTGC GTCTCGCCAC GTACGCCGTC TTCGACTCGT GGAACAGCGA
CCGGGCGAAG AAGTACATGG CCATCAACAA GATCACCGGT TTGCGAGGCA CGGCTGTGAA
CATTCAAGCC ATGGTGTTCG GGAACATGGG CGAGACTTCC GGAACGGGGG TGTTGTTCAC
GAGAAATCCG AGCACGGGCG AGAATAAGCT CTACGGCGAG TACTTGGTCA ACGCTCAAGG
CGAGGACGTC GTCGCGGGCA TTCGCACGCC GAGCGATATC TCAACCATGA AGGACGCGTT
GCCCGCGGCG TACGAGCAGT TGGTGAAGAA CACGGAACTC TTGGAATTGC ACTTCAAAGA
CATGCAAGAC ATCGAATTCA CCGTCGAGGA CGGCCAACTC TTCATGCTTC AATGTCGCTC
GGGCAAGCGC ACGGGCGCGG GCGCGGTGAA AATGGCGGTC GACTTCGTGA AGGAAGGCTT
GGTGACCAAG GAAGAAGCCG TGCAAATGGT CGAACCCACG CACGTCGATC AACTGCTTCA
TCCGCAGTTT AAAGACGAGG GCGCGTACAA GGCCGACGTC ATCGGCGCCG GCTTGCCCGC
GTCACCGGGC GCCGCCGTCG GACAAATCGT CTTCAGCACC GAAGACGCCG AAGCCGCGAA
AGCCGAAGGT CGCAAGGTCA TCTTGGTTCG CGTCGAAACC TCGCCCGAGG ACGTCGGCGG
CATGGACGCC GCCGAGGGCA TCTTGACCGC GCGCGGCGGC ATGACGTCGC ACGCCGCCGT
CGTCGCTCGC GGCTGGGGTA AGACGTGCGT CTCTGGCTGC GGCGAGTTGT CCATCGACGA
GCACGCGCGC ACGTTCACTT TGGGTGGCGT CACGCTTCAT GAAAACGACT GGTTGAGCTT
GAACGGTACC ACGGGCGAAA TCATTCGCGG TCGCGCCGAT CTCATGCCAC CGACTGTGAG
CGAAGATTTG GGCACGTTCA TGTCTTGGGT CGACGAATTC CGTGACATGA AGGTGCTGAC
AAACGCCGAT ACGCCCGAAG ATGCCGCCGC GGCGCGCGCC AACGGTGCCG AAGGCATCGG
TCTCGTGCGC ACGGAGCACA TGTTCTTTGG CAGCGGCGAG CGCATTCGAA CCGTGCGTCG
GATGATCATG GCCAAGGACA CGCCCTCCCG CGAAGCCGCG CTCGATGCGC TGTTGCCGTT
CCAGCGAGAT GATTTCAAGG GCATCTTCCA CGCCATGAGC GGTTTGCCGG TGACCATTCG
CCTTCTCGAT CCTCCGCTTC ACGAGTTCTT GCCCGACGGC GAGTTGGACG ACGTCGTTCA
ACTTCTCAGC GAGGACACTG GCGAGACTGA AGAAGACATC GTCGAGCGCA TCGAAAAGCT
CGTCGAAGTG AACCCGATGT TGGGTTTCCG CGGCTGCCGA TTGGCCATCA CGTATCCAGA
AATCGCGCGA ATGCAAGTTC GCGCCATTCT CGAAGCCGCG TGCGAAGCCA AAGCCGAGGG
CGCGTCGCCG ATTCCCGACA TCATGGTCCC GCTCGTCGGC ACCGTCGCCG AACTCGAAGA
TCAAGTGGCG TTGATTCGTC AAACCGCGGA CGTCGTCTTC GGCGAGCGTG GGGACTCTGT
GGAATACCGC GTGGGTACCA TGATCGAGAT TCCTCGCGCG GCGCTGTTGT CGAATGAAAT
CGCCAAACAC GCCGAGTTCT TCTCGTTCGG TACGAATGAC TTGACGCAAA TGACGTTCGG
TTACTCTCGC GATGACGTGG GTAAGTTCCT GCCCACGTAC CTCGAAAAAG GTGTGCTGAA
GCACGATCCT TTCCAAGTCA TCGACGTCGA CGGCGTCGGG CAACTCGTCC AAATGTCCGT
TGAGCGCGGT CGCTCCACGC GCCCCGACTT GAAGGTTGGG ATTTGCGGTG AGCACGGTGG
CGAACCGATC TCTGTCGAGT TCTTCGCGAA GAGCGGTCTC GACTACGTCT CGTGTTCGCC
GTTCCGCGTG CCCATCGCAC GTCTCGCCGC GGCGCAAGCC GCGATTCGAG CCAAGAAAGT
GTAAGCCCTC CGACGTCGCG CCGTCGCGAG CGAGCTTTAT CGCCGCCACT ATAATCCCTC
TGTACGATTC GCGAGAGTTC AAGACTACTT AAAGGAAGTT CACTCGCCAA GATGTAATTA
CAAATTCGCA AGCAAGC
 
Protein sequence
MSRVGLSVPP GFTVDTETCA DFHANGGALP EGAWEEMVAG LAHVEAALGK TLGDAENPLL 
VSVRSGAAVS MPGMMDTVLN LGLNDSVVEG LARRAGGRFA FDSYRRFLDM YGNVVMGIDH
GKFEHALETL KRDVGVDSDE GLSEQNLRDL VEIYKGVYAS ENVSFPQDPL EQLRLATYAV
FDSWNSDRAK KYMAINKITG LRGTAVNIQA MVFGNMGETS GTGVLFTRNP STGENKLYGE
YLVNAQGEDV VAGIRTPSDI STMKDALPAA YEQLVKNTEL LELHFKDMQD IEFTVEDGQL
FMLQCRSGKR TGAGAVKMAV DFVKEGLVTK EEAVQMVEPT HVDQLLHPQF KDEGAYKADV
IGAGLPASPG AAVGQIVFST EDAEAAKAEG RKVILVRVET SPEDVGGMDA AEGILTARGG
MTSHAAVVAR GWGKTCVSGC GELSIDEHAR TFTLGGVTLH ENDWLSLNGT TGEIIRGRAD
LMPPTVSEDL GTFMSWVDEF RDMKVLTNAD TPEDAAAARA NGAEGIGLVR TEHMFFGSGE
RIRTVRRMIM AKDTPSREAA LDALLPFQRD DFKGIFHAMS GLPVTIRLLD PPLHEFLPDG
ELDDVVQLLS EDTGETEEDI VERIEKLVEV NPMLGFRGCR LAITYPEIAR MQVRAILEAA
CEAKAEGASP IPDIMVPLVG TVAELEDQVA LIRQTADVVF GERGDSVEYR VGTMIEIPRA
ALLSNEIAKH AEFFSFGTND LTQMTFGYSR DDVGKFLPTY LEKGVLKHDP FQVIDVDGVG
QLVQMSVERG RSTRPDLKVG ICGEHGGEPI SVEFFAKSGL DYVSCSPFRV PIARLAAAQA
AIRAKKV