Gene OSTLU_87084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_87084 
Symbol 
ID5001241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp694677 
End bp697670 
Gene Length2994 bp 
Protein Length997 aa 
Translation table 
GC content60% 
IMG OID640416662 
Productpredicted protein 
Protein accessionXP_001417570 
Protein GI145346178 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.019106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCGAGCG CGGAAGCGCC GTCGCCGCCG CCGCCGCCGC CACCGCCACC GCCACCGCAA 
GTCGAACCGA TCAAAGTCAC CGAAGACACG AGTTTCGCGG CGCTGAACCA GTGGCGCGGT
CAAGAAGTGA AATTACAAAC GCACAAAGGC GGCGGCGGCG AACGTAACCT TCAGTGGAAC
ACGGATAATC TCGCCGACGC CGCGCGCCGT ATCGTCGAGG GCGATCGCGA CTCTTCTTCA
TGGAGACAGA AACTGCAAAT GGTTGAGAAG CTAATTTGCG ATGACGGCGC CGACGTTGAC
TCTTTGGCGT ACGCTACTGT GTATTTATTT TGGATATCTG TCGGCGCCAT CGCGTGCGTC
GAGGACGGTA CGCACTACCG TCCAAACCAC CACGCCGGTT CGGCTGAACG TATGTACGGC
GCAATCGAGG CTGCGGAACG TTTCGCGAAC GATGTTGCGA GTGGTGGTGA TATTTACCGC
GCGCGTGAGC TTCGTGCGTT GATTCGCCGT CTGCATCCAC GACTCCCCGC GTTCACCGCC
GAGTTCACGC AAAGCGTGCC GCTGACGCGC ATTCGCGACA TCGCGCACGG CAAGGGTGAT
CAACATGGGA AATGTCGCGA AGTTCGACAA GAAATCAAGC ACACGATTCA GAACAAACTC
CATCGCTGCG CTGGTCCCGA GGATCTAGTT GCGACGGAAT CTATGCTCGC CAAACTCACC
GCTCCGGGAA CCGACTACCC CGAAGAGTTT GTCAACGAGT TTAAGATTTT CTATCGCGAG
CTCAAGGAAT TCTTCAACGC CTCCTCTGTG GCCGATCGCA TCGATCGTAT CGCGAACGAA
AATGGCGCCC CTGGCCGTGC GGCCGATAGC TCAAAGAAGT TTCTCTCTGC GAAGGCGACC
GTGGATGCGC TTCCTGCGAG CGACCGCGTG GGCGATCAAA CGACTATGAG CGCGCTCGTC
GCGTGCTTGC GTGCTATTCA CGACGCGCGA ACGGACATCA CCGCCGCCTT GGAATCGGGC
GGGGATTTGG GTCAAGCTGA ATCATCGACG CGTCAACAGT GGCGCTTGGC CGAGGTGAGC
ATGGAAGATT ACGCGTTCGT GTTGTTGAGC CGATTGCTCA ACGCGCTCGG CGCAGAGTCT
GAGCCGCCGC GAAACGACAT CAGCGCGAGC GAAGTAAAAC TCACTCTTGA GGCGCTGGCT
TTGACTTCTC GCACCATGGC GCTGAGTGCA GGAGGCGATA ACGAGCTAGA GGCAATCGCG
TCCGAAGCCG AAGCACTGGC TCGTAATGGT TTGCCCGCGG GCGAGGAAGG CGGTTTGCGC
GTCCAAGCCG TCGCCGAACG CGCTCGACGC GGCGCCGTCG ACTTTTGTTC GCTGTTGGAA
TCTTTATTCG ATGGACGGGC GTCGAGTCTC GGCAATGCGC TCGGTATTGA CCACGGCTCA
ATCAGTGTGT TCACAGAAGG TCAGATTCGC GCGAGCGTAG TGTTCCAATC CGCCAAGATC
GCTTCCTTGT TGCTGCGAGT CAGCCGGCAA ATCACCGGCG CCGCTGGATG GGATTGCGTC
GTGCAAGGCG AGGCTATCGG CGCGCTCAAG TGCGTCGAAA GGCTCACGCC CGAAGAGTGC
GCGCAGTTCA CCGAGCCAGT AATCGTGCTC GTTGCTAGTG CTGATGGTGA TGAAGAAGTG
TCGACGTGCG GCCCGAACGT GCGCGGCGTG GTGCTGTGTC ACGCGTTGCC GCATCTCAGT
CATTTGGCGC TTCGCGCGCG TCAAGCCAAA GTGCCCCTCA TCGCCGTCGA AGACGACAAG
CTTGTCGACT ACGCGCGTTC TTTGGCGAAT GAACCTGCCG TGAAGCTCAG TGCGGAAACC
ACTGGAATTA AGCTCGAGCC AACGACGGCT CCGGCGTCTG TTGCGGCTGC TTCAAGCGAA
GCGGGGCCAC AGGCGACGAA ACCCGTGATC CGCCTCGACA CTGATCTCTC AAGAGCGGGA
ACCGTGTTCG ATCTCGTCGC GCTGGACAAG CGTGGACTCG AAAAGTCGAT TCGCATCGCC
GGTACGAAGT CCGCTATGTG CGCACGATTG AGCACTATCG CCGAAAACTC TTCTGGATCG
GCGGCGTTCG CCGCGCCCGC CGGTGTCGTC ATTCCATTCG GCGCAATGGA ATTCGCGTGC
GCGAGCATCA GCAAGCTCGA ACATCTCGAC AGTTTGCTCC TCGAACTCAA CCAGTACGCG
GACGACCCAG TGAGGATGCG ACACACGTGC GAAGCCATCC AGAACCTCGT TCGTTCGCTC
AAGCCGTCCG CGAGCGCGCT GCAATCCGTC GCTGAAAAGT TTGGCCCGAA TGCGCGCGTC
ATGGTTCGAA GTAGCGCCAA CGTTGAAGAT CTCGAGGGGA TGTCCGCGGC TGGTTTGTAC
GATTCCATCC CAAATGTCGA CCCGAACTCG GAAGACGCAT TCAGTCGCGC TGTTGGCGAG
GTATGGGCGT CCCTGTACAC CACTCGCGCC GTGGCTTCTC GCGCCGCCGC CGGCGTCGAT
CAACTCGAGG CGCACATGTG CGTCCTCGTC CAAGAGATGC TCTCGCCCGA GGTCAGTTTC
GTTCTACACA CGAAGCACCC GCTCACAAAT GATAATAACG AAGCGTACGT CGAGTTTGCG
CTCGGTTTGG GCGAGACTTT GGCGTCGGGC GCGGTTCGAG GATCGCCCTG CCGCGTGAGC
GTCGACAAGC GATCCGGCAA AGCGACGGTG AATGCGTTCG CCTCGTTCGG AACCGCCCTC
GTCCGCGATG ACGACTCGGC AACCGGAATG AAATCTGTCG CCGCGGATTA CGCATCCCAC
TGGCTTCACA ACGACGTCGC GAAGCGCGAC GAAATCGCCA CCAAACTTCT CGCCATCGGC
TCTGAGCTCG AGCGCGAGTT GAGTCCGCGC GGCGAGACGC TCCCGCAAGA CGTCGAAGGC
TGCATCCTTC CCTCTGGGGA AATTTGCATC GTCCAAGCGC GCCCGCAGCC CTAA
 
Protein sequence
VPSAEAPSPP PPPPPPPPPQ VEPIKVTEDT SFAALNQWRG QEVKLQTHKG GGGERNLQWN 
TDNLADAARR IVEGDRDSSS WRQKLQMVEK LICDDGADVD SLAYATVYLF WISVGAIACV
EDGTHYRPNH HAGSAERMYG AIEAAERFAN DVASGGDIYR ARELRALIRR LHPRLPAFTA
EFTQSVPLTR IRDIAHGKGD QHGKCREVRQ EIKHTIQNKL HRCAGPEDLV ATESMLAKLT
APGTDYPEEF VNEFKIFYRE LKEFFNASSV ADRIDRIANE NGAPGRAADS SKKFLSAKAT
VDALPASDRV GDQTTMSALV ACLRAIHDAR TDITAALESG GDLGQAESST RQQWRLAEVS
MEDYAFVLLS RLLNALGAES EPPRNDISAS EVKLTLEALA LTSRTMALSA GGDNELEAIA
SEAEALARNG LPAGEEGGLR VQAVAERARR GAVDFCSLLE SLFDGRASSL GNALGIDHGS
ISVFTEGQIR ASVVFQSAKI ASLLLRVSRQ ITGAAGWDCV VQGEAIGALK CVERLTPEEC
AQFTEPVIVL VASADGDEEV STCGPNVRGV VLCHALPHLS HLALRARQAK VPLIAVEDDK
LVDYARSLAN EPAVKLSAET TGIKLEPTTA PASVAAASSE AGPQATKPVI RLDTDLSRAG
TVFDLVALDK RGLEKSIRIA GTKSAMCARL STIAENSSGS AAFAAPAGVV IPFGAMEFAC
ASISKLEHLD SLLLELNQYA DDPVRMRHTC EAIQNLVRSL KPSASALQSV AEKFGPNARV
MVRSSANVED LEGMSAAGLY DSIPNVDPNS EDAFSRAVGE VWASLYTTRA VASRAAAGVD
QLEAHMCVLV QEMLSPEVSF VLHTKHPLTN DNNEAYVEFA LGLGETLASG AVRGSPCRVS
VDKRSGKATV NAFASFGTAL VRDDDSATGM KSVAADYASH WLHNDVAKRD EIATKLLAIG
SELERELSPR GETLPQDVEG CILPSGEICI VQARPQP