Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32964 |
Symbol | |
ID | 5003368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 154299 |
End bp | 157697 |
Gene Length | 3399 bp |
Protein Length | 1132 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418789 |
Product | predicted protein |
Protein accession | XP_001419085 |
Protein GI | 145349322 |
COG category | [C] Energy production and conversion |
COG ID | [COG1038] Pyruvate carboxylase |
TIGRFAM ID | [TIGR01235] pyruvate carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0171502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.395627 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACCG TGGCGGTGTT CGCCGAAGCC GATCGACAGT CGACGCATAG GTACAAATCC GATGAATCGT ACGAGGTCGG TCACGGCAAG GCGCCGGTGG CGGCGTATTT GGATTACGAA TCCATCATTC GCTGCGCCAA GGAGAACGGG GCGCAGGCTA TTCATCCGGG CTACGGATTT TTGAGCGAAA ACGCCGCCTT TGCGCGACGA TGCGAGGAAG AAGGGATCGT GATGATCGGG CCGAAGTCGC AAACGTTGAC GGAGATGGGC GATAAGGTGA TCGCCAAGGC GAAAGCGAAG GAGTGTGGGT TGCCGCTCGT GCCGGGGACT GAAGAGGCGA CGGCGGACGT CAACGATGCC TTGGAATTCG CCAAGGAGTT CGGAATGCCA ATCATGCTCA AGGCCGCGAT GGGCGGTGGT GGTCGCGGTA TGCGTGTTGT CAAGGAGTAC TCCGAGCTCG AAGATGCGTT CAAGCGCGCT TCGAGCGAGG CACAGACTGC GTTCGGCGAC GGGAGAATGT TTTTGGAACG TTACGTCGAG GCGCCGCGAC ACATCGAGGT GCAAATTCTA GCCGACAACT ACGGTAACGT CGTTCACTTG GGCGAGCGCG ATTGTTCCGT GCAACGTCGT CACCAAAAAG TTGTCGAGTT GGCGCCCGCG CCCAACCTTG ACCCAGTGCT TCGCCAAAGA TTATTTGACG ACGCCGTGGC GCTCGCCAAG CATGTCAACT ATCGCAACGC CGGCACGGTG GAGTTCATGG TTGATAAGCA AGGGCGTCAC TATTTTTTGG AAGTGAACCC GCGCATCCAA GTAGAACACA CTGTGACGGA AGAAGTCACT GGGATCGATT TGGTGCAATC GCAGATATTG ATCGCAGGTG GTCAAAAGTT GTCCGACATC GGTATCAAGT CCCAAGACGA CATTCAGCTT CGCGGCTTCG CGATGCAGTG CAGAATCACC ACCGAAGACC CGTCCATGAA CTTCTCTCCG GATTTCGGCA AAGTTGAGGT CTACCGTCCT CCGGGCGGTA TGGGTGTGCG TTTGGATGGT GAAGTCGTGG TCGGTTCGCG AGTGTCACCC AACTACGATT CGCTCCTCGT CAAGCTCACG TGCTCGGAGA AGAACTTTGA AGCAACCGTG CAAAAGATGT ATCGATCGCT CAATGAATTC CGTATTCGCG GCGTGAAGAC AAACATTCCG TTCCTGATGA ACGTTTTGTC CAGCGAAACG TTCTTGAGCG CGAACTTTGC CACGGACTTT ATCGATAGCA CTCCGAGCTT GTTCAAGTTG GACTCGTACA TCGATGACAC GCAGAAGCTT CTCAATTACC TCGGTGACGT TGCCGTGAAC GGTTCGTCGC ACCCTGGTGC GGTCGGTCCC GCGCCGACGT GCGCGGAACC GCCAGTTCCC GAACCGAAGA AGTCTTTAGC GCAGCTCAAG GACACCGGGT TCAAGGCTAT TTTGGACAAG GAGGGTCCGG CGGCGTTCGC CAAAGCGGTT CGCCAGCACA AGGGCTTGCT CATCATGGAT ACGACGTGGC GGGACGCGCA CCAGTCGCTC TTGGCGACGC GCGTGCGGAC GCACGACCTT CGTCGTTCGG CACCACTCAC GCGAAGCGCC CTCAGTGGCG CGTATTCGCT TGAGATGTGG GGTGGCGCCA CCTTCGACGT CGCGCTTCGA TTCTTGCAAG AGGATCCGTG GAGGCGACTC GAGCTTCTTC GTGAGCAAGT GCCAAACATC CCATTTCAAA TGCTTCTTCG CGGCGCCAAC GCGGTCGGTT ATACTTCGTA CGCGGACAAC GTCGTGCAAA GCTTCGTGCA CCAAGCGCGC AAGTCTGGTA TCGATGTTTT CCGCGTTTTT GACTCGCTCA ACTACGAAGA TAACCTCATG TTTGGCATCG ACGCCGTGCG CAACGCCGAC GGCGTCGTTG AAGCGACAAT TTGCTACACT GGTGACGTGA GCGATCCGAA GAAGACTAAA TATACGCTGG ACTATTATGT CGACCTCACG ACCAAGCTCG TTGAGCACGG TATTGATGTG CTCGCCGTGA AGGACATGGC TGGTTTGTTG AAGCCGCGCG CCGCGACGAT GTTGATCTCC GCCATTCGCG CCAAGTTTCC CGACTTGCCC ATCCATGTGC ACACCCACGA CACCGCTTCA ACGGGCGTGG CTTCCATGCT CGCTGCTGCC GCGGCAGGTG CGGATGTTGT CGATGTGTGC ATGGACGCCA TGGCTGGCAC GACGTCCCAA CCCGCCATCG GCGCCGTGAT CAACTCAGTC GCGGGCACCG AGCTCGACAC CGGATTGGAT ATCGATGAAG TGCTCAAGCT CAACTTGTTC TGGGAACAAA CGAGAGGTCT GTACTCTCCG TACGAGAGCG GTATTAAAAG CGGAAGCTCA GATGTGTACA TTCACGAGAT GCCCGGTGGT CAATACACCA ACTTGAAGTT CCAAGCGTAC GCCAACGGAC TCGGAAGCGA ATGGGATCGC ATCAAGGATT CGTACGCGAC GGCGAATAAG ATCTTGGGTG ACATCGTGAA GGTGACGCCG TCTTCCAAGG TCGTCGGTGA TTTCGCGCAA TTTCTCGTCG CGAACAACCT GGATGAGCAC TCCGTGGTGG AAAAAGCTGA CACGTTGTCC TTCCCGACTT CCGTCGTCGA GTACTTCCAA GGATACCTGG GCCAACCCGT CGGTGGCTTC CCCGAACCGC TTCGCTCGCG AGTGGTGAAG AACAAGGAAA TCATCGCCGG TCGTCCTGGG GCGTCGCTTC CCAGCATGGA CATCAACAAG TTACAAAACG AGCTGTCCAT CAAGCACACC GGCCGTCGAT CGATCACGCA CAAGGATGCG CTCGCGGCGG CTTTGTACCC GAAGGTGTTC GACGAGTACG TCGTGAAGCG CGACACCGTC GGGCCGGTCG GTTTGCTCCC GACCAAGGCG TTCTTGAAGG GATTGGACAT CGATGAAGAG ATCGAAGTCA CCACCGATCG CGGTGTGAGC ACGAACATCA AGCTTAAGGC TGTCGGTGAG CTGTTGCCAA GCGGCAACCG CGAAGTGTTC TTTGAAGTCA ACGCGATTCC GCGCGTCGTC GAGATCCACG ACAGAAAGAT TCTCGAGAGC GCCTCCAAGG GCGGCGGCGG CGCAGTCGCT CGCGAGAAGA GCGATCCACT CGACCCGGGC TCGGTCGGCG CCCCGATGTC AGGGTCCGTC GTCGAAGTCC TCGTCGCTCC CGGTCAAAAG ATTAAAGCGG GCGAGCCCAT CGTCGTGTTG AGCGCGATGA AGATGGAAAC GACCGTCGCT TCTCCCGTCG CCGGCACCCT CAAGCACGTC GGCGTCGTCA AAGGCGACAC GTGCGCCGCC GGCGATTTAA TGTGCGCCAT CGACGTCGAC GCCGCGTAA
|
Protein sequence | MQTVAVFAEA DRQSTHRYKS DESYEVGHGK APVAAYLDYE SIIRCAKENG AQAIHPGYGF LSENAAFARR CEEEGIVMIG PKSQTLTEMG DKVIAKAKAK ECGLPLVPGT EEATADVNDA LEFAKEFGMP IMLKAAMGGG GRGMRVVKEY SELEDAFKRA SSEAQTAFGD GRMFLERYVE APRHIEVQIL ADNYGNVVHL GERDCSVQRR HQKVVELAPA PNLDPVLRQR LFDDAVALAK HVNYRNAGTV EFMVDKQGRH YFLEVNPRIQ VEHTVTEEVT GIDLVQSQIL IAGGQKLSDI GIKSQDDIQL RGFAMQCRIT TEDPSMNFSP DFGKVEVYRP PGGMGVRLDG EVVVGSRVSP NYDSLLVKLT CSEKNFEATV QKMYRSLNEF RIRGVKTNIP FLMNVLSSET FLSANFATDF IDSTPSLFKL DSYIDDTQKL LNYLGDVAVN GSSHPGAVGP APTCAEPPVP EPKKSLAQLK DTGFKAILDK EGPAAFAKAV RQHKGLLIMD TTWRDAHQSL LATRVRTHDL RRSAPLTRSA LSGAYSLEMW GGATFDVALR FLQEDPWRRL ELLREQVPNI PFQMLLRGAN AVGYTSYADN VVQSFVHQAR KSGIDVFRVF DSLNYEDNLM FGIDAVRNAD GVVEATICYT GDVSDPKKTK YTLDYYVDLT TKLVEHGIDV LAVKDMAGLL KPRAATMLIS AIRAKFPDLP IHVHTHDTAS TGVASMLAAA AAGADVVDVC MDAMAGTTSQ PAIGAVINSV AGTELDTGLD IDEVLKLNLF WEQTRGLYSP YESGIKSGSS DVYIHEMPGG QYTNLKFQAY ANGLGSEWDR IKDSYATANK ILGDIVKVTP SSKVVGDFAQ FLVANNLDEH SVVEKADTLS FPTSVVEYFQ GYLGQPVGGF PEPLRSRVVK NKEIIAGRPG ASLPSMDINK LQNELSIKHT GRRSITHKDA LAAALYPKVF DEYVVKRDTV GPVGLLPTKA FLKGLDIDEE IEVTTDRGVS TNIKLKAVGE LLPSGNREVF FEVNAIPRVV EIHDRKILES ASKGGGGAVA REKSDPLDPG SVGAPMSGSV VEVLVAPGQK IKAGEPIVVL SAMKMETTVA SPVAGTLKHV GVVKGDTCAA GDLMCAIDVD AA
|
| |