Gene OSTLU_32964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32964 
Symbol 
ID5003368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp154299 
End bp157697 
Gene Length3399 bp 
Protein Length1132 aa 
Translation table 
GC content58% 
IMG OID640418789 
Productpredicted protein 
Protein accessionXP_001419085 
Protein GI145349322 
COG category[C] Energy production and conversion 
COG ID[COG1038] Pyruvate carboxylase 
TIGRFAM ID[TIGR01235] pyruvate carboxylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0171502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.395627 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACCG TGGCGGTGTT CGCCGAAGCC GATCGACAGT CGACGCATAG GTACAAATCC 
GATGAATCGT ACGAGGTCGG TCACGGCAAG GCGCCGGTGG CGGCGTATTT GGATTACGAA
TCCATCATTC GCTGCGCCAA GGAGAACGGG GCGCAGGCTA TTCATCCGGG CTACGGATTT
TTGAGCGAAA ACGCCGCCTT TGCGCGACGA TGCGAGGAAG AAGGGATCGT GATGATCGGG
CCGAAGTCGC AAACGTTGAC GGAGATGGGC GATAAGGTGA TCGCCAAGGC GAAAGCGAAG
GAGTGTGGGT TGCCGCTCGT GCCGGGGACT GAAGAGGCGA CGGCGGACGT CAACGATGCC
TTGGAATTCG CCAAGGAGTT CGGAATGCCA ATCATGCTCA AGGCCGCGAT GGGCGGTGGT
GGTCGCGGTA TGCGTGTTGT CAAGGAGTAC TCCGAGCTCG AAGATGCGTT CAAGCGCGCT
TCGAGCGAGG CACAGACTGC GTTCGGCGAC GGGAGAATGT TTTTGGAACG TTACGTCGAG
GCGCCGCGAC ACATCGAGGT GCAAATTCTA GCCGACAACT ACGGTAACGT CGTTCACTTG
GGCGAGCGCG ATTGTTCCGT GCAACGTCGT CACCAAAAAG TTGTCGAGTT GGCGCCCGCG
CCCAACCTTG ACCCAGTGCT TCGCCAAAGA TTATTTGACG ACGCCGTGGC GCTCGCCAAG
CATGTCAACT ATCGCAACGC CGGCACGGTG GAGTTCATGG TTGATAAGCA AGGGCGTCAC
TATTTTTTGG AAGTGAACCC GCGCATCCAA GTAGAACACA CTGTGACGGA AGAAGTCACT
GGGATCGATT TGGTGCAATC GCAGATATTG ATCGCAGGTG GTCAAAAGTT GTCCGACATC
GGTATCAAGT CCCAAGACGA CATTCAGCTT CGCGGCTTCG CGATGCAGTG CAGAATCACC
ACCGAAGACC CGTCCATGAA CTTCTCTCCG GATTTCGGCA AAGTTGAGGT CTACCGTCCT
CCGGGCGGTA TGGGTGTGCG TTTGGATGGT GAAGTCGTGG TCGGTTCGCG AGTGTCACCC
AACTACGATT CGCTCCTCGT CAAGCTCACG TGCTCGGAGA AGAACTTTGA AGCAACCGTG
CAAAAGATGT ATCGATCGCT CAATGAATTC CGTATTCGCG GCGTGAAGAC AAACATTCCG
TTCCTGATGA ACGTTTTGTC CAGCGAAACG TTCTTGAGCG CGAACTTTGC CACGGACTTT
ATCGATAGCA CTCCGAGCTT GTTCAAGTTG GACTCGTACA TCGATGACAC GCAGAAGCTT
CTCAATTACC TCGGTGACGT TGCCGTGAAC GGTTCGTCGC ACCCTGGTGC GGTCGGTCCC
GCGCCGACGT GCGCGGAACC GCCAGTTCCC GAACCGAAGA AGTCTTTAGC GCAGCTCAAG
GACACCGGGT TCAAGGCTAT TTTGGACAAG GAGGGTCCGG CGGCGTTCGC CAAAGCGGTT
CGCCAGCACA AGGGCTTGCT CATCATGGAT ACGACGTGGC GGGACGCGCA CCAGTCGCTC
TTGGCGACGC GCGTGCGGAC GCACGACCTT CGTCGTTCGG CACCACTCAC GCGAAGCGCC
CTCAGTGGCG CGTATTCGCT TGAGATGTGG GGTGGCGCCA CCTTCGACGT CGCGCTTCGA
TTCTTGCAAG AGGATCCGTG GAGGCGACTC GAGCTTCTTC GTGAGCAAGT GCCAAACATC
CCATTTCAAA TGCTTCTTCG CGGCGCCAAC GCGGTCGGTT ATACTTCGTA CGCGGACAAC
GTCGTGCAAA GCTTCGTGCA CCAAGCGCGC AAGTCTGGTA TCGATGTTTT CCGCGTTTTT
GACTCGCTCA ACTACGAAGA TAACCTCATG TTTGGCATCG ACGCCGTGCG CAACGCCGAC
GGCGTCGTTG AAGCGACAAT TTGCTACACT GGTGACGTGA GCGATCCGAA GAAGACTAAA
TATACGCTGG ACTATTATGT CGACCTCACG ACCAAGCTCG TTGAGCACGG TATTGATGTG
CTCGCCGTGA AGGACATGGC TGGTTTGTTG AAGCCGCGCG CCGCGACGAT GTTGATCTCC
GCCATTCGCG CCAAGTTTCC CGACTTGCCC ATCCATGTGC ACACCCACGA CACCGCTTCA
ACGGGCGTGG CTTCCATGCT CGCTGCTGCC GCGGCAGGTG CGGATGTTGT CGATGTGTGC
ATGGACGCCA TGGCTGGCAC GACGTCCCAA CCCGCCATCG GCGCCGTGAT CAACTCAGTC
GCGGGCACCG AGCTCGACAC CGGATTGGAT ATCGATGAAG TGCTCAAGCT CAACTTGTTC
TGGGAACAAA CGAGAGGTCT GTACTCTCCG TACGAGAGCG GTATTAAAAG CGGAAGCTCA
GATGTGTACA TTCACGAGAT GCCCGGTGGT CAATACACCA ACTTGAAGTT CCAAGCGTAC
GCCAACGGAC TCGGAAGCGA ATGGGATCGC ATCAAGGATT CGTACGCGAC GGCGAATAAG
ATCTTGGGTG ACATCGTGAA GGTGACGCCG TCTTCCAAGG TCGTCGGTGA TTTCGCGCAA
TTTCTCGTCG CGAACAACCT GGATGAGCAC TCCGTGGTGG AAAAAGCTGA CACGTTGTCC
TTCCCGACTT CCGTCGTCGA GTACTTCCAA GGATACCTGG GCCAACCCGT CGGTGGCTTC
CCCGAACCGC TTCGCTCGCG AGTGGTGAAG AACAAGGAAA TCATCGCCGG TCGTCCTGGG
GCGTCGCTTC CCAGCATGGA CATCAACAAG TTACAAAACG AGCTGTCCAT CAAGCACACC
GGCCGTCGAT CGATCACGCA CAAGGATGCG CTCGCGGCGG CTTTGTACCC GAAGGTGTTC
GACGAGTACG TCGTGAAGCG CGACACCGTC GGGCCGGTCG GTTTGCTCCC GACCAAGGCG
TTCTTGAAGG GATTGGACAT CGATGAAGAG ATCGAAGTCA CCACCGATCG CGGTGTGAGC
ACGAACATCA AGCTTAAGGC TGTCGGTGAG CTGTTGCCAA GCGGCAACCG CGAAGTGTTC
TTTGAAGTCA ACGCGATTCC GCGCGTCGTC GAGATCCACG ACAGAAAGAT TCTCGAGAGC
GCCTCCAAGG GCGGCGGCGG CGCAGTCGCT CGCGAGAAGA GCGATCCACT CGACCCGGGC
TCGGTCGGCG CCCCGATGTC AGGGTCCGTC GTCGAAGTCC TCGTCGCTCC CGGTCAAAAG
ATTAAAGCGG GCGAGCCCAT CGTCGTGTTG AGCGCGATGA AGATGGAAAC GACCGTCGCT
TCTCCCGTCG CCGGCACCCT CAAGCACGTC GGCGTCGTCA AAGGCGACAC GTGCGCCGCC
GGCGATTTAA TGTGCGCCAT CGACGTCGAC GCCGCGTAA
 
Protein sequence
MQTVAVFAEA DRQSTHRYKS DESYEVGHGK APVAAYLDYE SIIRCAKENG AQAIHPGYGF 
LSENAAFARR CEEEGIVMIG PKSQTLTEMG DKVIAKAKAK ECGLPLVPGT EEATADVNDA
LEFAKEFGMP IMLKAAMGGG GRGMRVVKEY SELEDAFKRA SSEAQTAFGD GRMFLERYVE
APRHIEVQIL ADNYGNVVHL GERDCSVQRR HQKVVELAPA PNLDPVLRQR LFDDAVALAK
HVNYRNAGTV EFMVDKQGRH YFLEVNPRIQ VEHTVTEEVT GIDLVQSQIL IAGGQKLSDI
GIKSQDDIQL RGFAMQCRIT TEDPSMNFSP DFGKVEVYRP PGGMGVRLDG EVVVGSRVSP
NYDSLLVKLT CSEKNFEATV QKMYRSLNEF RIRGVKTNIP FLMNVLSSET FLSANFATDF
IDSTPSLFKL DSYIDDTQKL LNYLGDVAVN GSSHPGAVGP APTCAEPPVP EPKKSLAQLK
DTGFKAILDK EGPAAFAKAV RQHKGLLIMD TTWRDAHQSL LATRVRTHDL RRSAPLTRSA
LSGAYSLEMW GGATFDVALR FLQEDPWRRL ELLREQVPNI PFQMLLRGAN AVGYTSYADN
VVQSFVHQAR KSGIDVFRVF DSLNYEDNLM FGIDAVRNAD GVVEATICYT GDVSDPKKTK
YTLDYYVDLT TKLVEHGIDV LAVKDMAGLL KPRAATMLIS AIRAKFPDLP IHVHTHDTAS
TGVASMLAAA AAGADVVDVC MDAMAGTTSQ PAIGAVINSV AGTELDTGLD IDEVLKLNLF
WEQTRGLYSP YESGIKSGSS DVYIHEMPGG QYTNLKFQAY ANGLGSEWDR IKDSYATANK
ILGDIVKVTP SSKVVGDFAQ FLVANNLDEH SVVEKADTLS FPTSVVEYFQ GYLGQPVGGF
PEPLRSRVVK NKEIIAGRPG ASLPSMDINK LQNELSIKHT GRRSITHKDA LAAALYPKVF
DEYVVKRDTV GPVGLLPTKA FLKGLDIDEE IEVTTDRGVS TNIKLKAVGE LLPSGNREVF
FEVNAIPRVV EIHDRKILES ASKGGGGAVA REKSDPLDPG SVGAPMSGSV VEVLVAPGQK
IKAGEPIVVL SAMKMETTVA SPVAGTLKHV GVVKGDTCAA GDLMCAIDVD AA