Gene OSTLU_51006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_51006 
Symbol 
ID5004725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp480016 
End bp483049 
Gene Length3034 bp 
Protein Length1007 aa 
Translation table 
GC content55% 
IMG OID640420146 
Productpredicted protein 
Protein accessionXP_001420862 
Protein GI145353090 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.381291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0264006 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAGG CGGTGTTCAG TGGGAAGGAT AAGGCGAAGA TAACGTCGCA TAAACGGACG 
GGGTCGTTGT TCGCGAGCGA GGAGGGCGAG GCGCTGGACG CCCTGGCGCG GTCGTCGAGT
TATCTGAGCG GTAGGGGAGA GACGAAGGAG TTCAACGCGC ACGACGTGAT TGAGGAGTGT
GATGAACTGC TTCGGACGAT CTTCTTCGCC GTGGTGAGGG AGACGGCGGG AGATAAGTTT
TTGGGGCAGT TGAAGAGCGT GTACGAGGCG AGCGAAAAGT TTGGCAGCTC GCACGACCCG
AAGGATTTCG ACGCGATGCA GGCGATGTTG GAGACGATGG AAGTGGATGA ATCGTTGCAG
TTCGCGTCGG CGTACTCGAA TCTGTTGAAT CTGCACAATA TTTCCGAGCA AGTGGCGAAC
GCGATGGAGG AAAGACACAG ACGTTTGGAC GATATTCCGC GAGGGCCGGC AAAGACGACG
AACGGCGCGA TTAAAGGTTT GCTTCGAGCC GGGAAGTCGA CGGAGGAAAT CTACAGCGCT
TTAGCGGTGC AGCACGTTGA TTTAGTGCTC ACCGCCCACC CCACGCAAGC CTTGAGACGG
TCGATGTTGA AATCTTTCGG AATAATTCGT GAAAAGCTGT TGCAGCTGCA ACGATTTCGC
CTTTCGCGAT ACGAGCGTGC AGAAGTTTTG GACGAGATTC GATCGAAGGT CGCCTCGGCG
TGGCGCACAG ATGAGATCCG CCGCACGCCA CCGAAGCCTC AAGACGAAAT GCGCGCGGGT
CTGACGTACT TTCAGCAAAC CATCTGGGAT GGCATCCCAA CTTTCATGCG TCGCGTGGAC
ACGTCGCTCT TGGCAAACGG ATGCCCTCGT TTGCCGCTCG ACAGATCCAT CGTTACGTTC
GGGTCGTGGA TGGGTGGTGA TCGTGACGGC AACCCATACG TCACGGCATC ATGTACGCGC
GACGTCGTTC TCTTGGCCCG TGTGCAAGGG GTTAACCTAC TCTTCCGAGC GATTCAGCGT
CTGATTTTCG ATTTGTCCAT GTGGCGATGC AACGACGCCG TCAAGGCGTT GGCAAAGGAC
ATCTTGGAGA ACTCGGAAAC GGACAACTTT ACGATCTTCG AAGAGCGCAA GAAGCGAAAC
TACGATGACT TTTGGAAGGC CATTCCCGAG CACGAGCCTT ACCGCGTCAT CTTGGCTGAA
CTTCGAGACA AGCTTTACAA CACGCGAGAG GCGTTGCAGC GTTGCATCGC GGACAACGAC
GTCAACATCG ACATGAACGA CGAGACTATC ATTCGTTCCA AGGACGAACT GTTCGCACCC
TTGGTGGTGT GCTACGAGTC TCTCATCGAA GTCGGTGACG CTCAAATTGC CAACGCCTAT
CTTCTCGACG TCATTCGCCA AGTGCAGTGC TTTGGGTTGG GGCTGGTCAA ACTCGATATC
CGGCAAGAAT CTGATCGCCA CGCCGAGGCT TTGGACGCGG TGACCAGGTA CATCGGCCTC
GGATCGTACC TCGAATGGAG TGAGGAACAG AAAATTGAGT TCCTGACACG CGAGCTTGAG
AGCAAGCGGC CACTTTTACC TTCCGACCTC GAATGCTCGG ACGACGTTCG AGAAGTCCTG
GACACGTGTA AGATGATTGC GCACTTGCAA CAAACGTGCC CGGGTGCCCT CGGAACGTAC
GTTATATCCA TGGCGACAAG TGCCAGCGAT GTTTTGGCAG TTGTATTATT ACAGCGCGAG
TGTGGCTGTC GCAAACAGGA TCTTCTTCGC GTTGCACCGC TCTTTGAGCG ACTCGACGAT
CTGAACGACG CCCCGCGCGT GTTGCGCCAG CTCTTCTCCG TGAAGTGGTA TCACGACCAC
ATCGCGGGAT TCCAAGAGGT TATGATCGGG TATTCGGATA GTGGAAAAGA CGCTGGTCGC
ATGGCTGCAG CATGGGCGCT GTACGATGGT CAAGAACGCG TCGTGGCGGC AGGGAAGGAG
TTTGACGTTG CGCTCACCCT GTTTCACGGT CGCGGAGGCA CCGTAGGTCG CGGGGGAGGT
CCAGCGCACA TTGCCATGTT GTCTCAGCCT CCAGGCACTG TGAATGGTAG CATCAGAGTC
ACCGTGCAAG GAGAAGTGAT CGAAACCGAC TTCGGTGAAA AAGAAAACTG TTTTCACACT
CTGGATTTGT ACACGGCGTC CGTTCTGGAG CACACGTTGA AGCCGCCGGC GCATCCTCGC
GATGAATGGC GCCGAGTCAT GGATAGAATG AGCGAATACT CATGCGCGCA TTATCGTAAG
ACTGTGTTCG AAACCCCTGA TTTCGTCGGA TACTTTGCAC AGGCCACGCC TGGAGCCGAG
CTTGGATCGC TCAACATCGG TTCTCGACCG GCCAAGCGTA AACCGAGTGC GGGCGTCACC
GCTCTTCGAG CAATTCCGTG GATTTTCGCG TGGACGCAGT CGCGATTCCA CTTGCCAGTC
TGGCTCGGAA TTTCCACATC ATTCAGACGC TTGATCGACG AAGGCGAACT GGAAACGCTG
AGGGACATGT ACAAAAGTTG GCCTTTCTTT GAGGTGACGA TCGATTTAGT GGAGATGGTG
CTCGCCAAGG CGGATCCCGT CGTCGTGGCG TATTACGAAA GAGCCCTTGT CGATCCAAAA
TTGCACGACT TCGGCGCCAG TCTGCGCGGC GAGCTTCAAG AAAGCATCGA TTGCATCCTC
GCCGTCTCCG AGCATATCGG TTTGCTCGCC AAACCAGAAA AAGTCGAGGC GAACGAGGCG
GTTCAGGTGC ATAAAAAGCT CGCTCACAAA CTTCACAAGC GGTCGCTGTA CATCACGCCG
CTCAACGTGT GTCAAGTGCG ATATCTCATC GCCGCTCGCG CGCTTGAAAA TGAAGAAGAT
GGCGATAAGC TAAGCATGCA AAAAGTCAAG ATCACCTTGC TCGAGGGTTA CCCGTTCCAA
GATTACAATT ACAAGGGCGC GGTGAACGAC GTTTTAAAAA TCACGATGAA GGGCATCGCA
GCCGGGATGC AAAACACTGG GTGACAACAT CGAA
 
Protein sequence
MLKAVFSGKD KAKITSHKRT GSLFASEEGE ALDALARSSS YLSGRGETKE FNAHDVIEEC 
DELLRTIFFA VVRETAGDKF LGQLKSVYEA SEKFGSSHDP KDFDAMQAML ETMEVDESLQ
FASAYSNLLN LHNISEQVAN AMEERHRRLD DIPRGPAKTT NGAIKGLLRA GKSTEEIYSA
LAVQHVDLVL TAHPTQALRR SMLKSFGIIR EKLLQLQRFR LSRYERAEVL DEIRSKVASA
WRTDEIRRTP PKPQDEMRAG LTYFQQTIWD GIPTFMRRVD TSLLANGCPR LPLDRSIVTF
GSWMGGDRDG NPYVTASCTR DVVLLARVQG VNLLFRAIQR LIFDLSMWRC NDAVKALAKD
ILENSETDNF TIFEERKKRN YDDFWKAIPE HEPYRVILAE LRDKLYNTRE ALQRCIADND
VNIDMNDETI IRSKDELFAP LVVCYESLIE VGDAQIANAY LLDVIRQVQC FGLGLVKLDI
RQESDRHAEA LDAVTRYIGL GSYLEWSEEQ KIEFLTRELE SKRPLLPSDL ECSDDVREVL
DTCKMIAHLQ QTCPGALGTY VISMATSASD VLAVVLLQRE CGCRKQDLLR VAPLFERLDD
LNDAPRVLRQ LFSVKWYHDH IAGFQEVMIG YSDSGKDAGR MAAAWALYDG QERVVAAGKE
FDVALTLFHG RGGTVGRGGG PAHIAMLSQP PGTVNGSIRV TVQGEVIETD FGEKENCFHT
LDLYTASVLE HTLKPPAHPR DEWRRVMDRM SEYSCAHYRK TVFETPDFVG YFAQATPGAE
LGSLNIGSRP AKRKPSAGVT ALRAIPWIFA WTQSRFHLPV WLGISTSFRR LIDEGELETL
RDMYKSWPFF EVTIDLVEMV LAKADPVVVA YYERALVDPK LHDFGASLRG ELQESIDCIL
AVSEHIGLLA KPEKVEANEA VQVHKKLAHK LHKRSLYITP LNVCQVRYLI AARALENEED
GDKLSMQKVK ITLLEGYPFQ DYNYKGAVND VLKITMKGIA AGMQNTG