Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32991 |
Symbol | |
ID | 5003263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 210970 |
End bp | 213726 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418684 |
Product | predicted protein |
Protein accession | XP_001419310 |
Protein GI | 145349788 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0660064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.166301 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTCGA ACGAGCACAC GAGCGATCGG CCGGCGACGA TGCGTTGGGA TGTGGGCAAT TTGGAAGAAG CGAGCGCGGC GCGAGCCGTC GTGCTCGCGG ACGAAAGCGG CGGGTCGTGG TTCGAGAAGC TCGCGGCCGT GGCGAATCAA ATCGGTATCG GAAAAGAGAT CACGCGCGAT AAGTTGGCTG TGAGCGCGAC GTATTTGCGT TGGATAAGCA CCGGGACGAT CAAGTGCGTG GAGAGTGGAG GTCATCGTCG TCCAAACGGA CCGGCGATGG TGGGTCGGGG GATATTTATA AGTATGGAGC AGGTGCAGGG CGCGATGTAT CGTCACGGAA CGGATTTGGG CGAAGTTGAG CGCGTTGTTA TGCGTCAAGT CCACCCTTGG CTACCTTCGT TCAACGAAGA GTTCACGAAC GCCGTGCCGC TGACGCGAAT TCGGGACATA GCGCACAGAA ACGACATTCC GGAGGATTTG AAGAAAACCA TCAAACACAC GATTCAGAAC AAGCTGCATC GGAACGCTGG GCCAGAGGAT TTAATCGCAA CCGAAGTTGT GTTAGAACGA ATCACCGCGC GTGAGGGCGA GTACTCTGAA GACTTCGTTC GCGAGTTTCG CGAGTTCCAC AGAGAACTCA AACGTTTCTT CAACGCCACC GGCGTGTTTG AGCGTCTCGA TAGCCTTAGG GGGACTTTAG ACGAGGAAAC GGAACCGCTC ATCGGCGAAC TTAGCGAACT GCAACACTCG CTCAACAATC TCAACGACGG CTGGCTCAAT CATGAAGGTG CAACGCTGCG TCACACGCTA GACGTCTGCA CACGGCTACG ATCGTACTTT TGCGCCGGTC TCAGCACGGG GATGCGAAAC GACGCTCCGG ATGATTCCGT GCGACAACGG CACGCCTGGC GACAGGTAGA AATGTCGCTG GAGGAGTACG CGTTTGTGTT ATTAGCGCGC TGCAATAATC TCATGGAGCG AGAGAAGTCA GACAACATGC AGCACGCGGC GAACGTTTCT GCGCTCGCTT TGAAACACAT CGGTCTTTCG GGATGGAAGT CCCTCGAGAC TGGAATCATC GCACGTGAGC TCATGTCTTG GACTACGAGC ACACCCGCGA TCGAGGACGG AAATGCGGCC CTTCGCTTGA AGGCAACGCT ACAACGCGCA AGGCGACTCA TCGAATCGCA AACGAGAGCC GTCATGAGTG GTTTCGGCGA CGCTCCCGCA CAACTTGCCG ACGCATTTGG TCTGGAACAC CACGTCGGCG CGACATTTGT CGAGTCTGTC ATTCGTGCGA GCGTGGCTTT CCAACTCTCT AGAACGATCG AGATGATGTC GGAGGTGGTT GAACGCAATG TTGACGGGGA CGGGTTCGAT CCGATCGTCC CCGGAGACGC GCAAGGCGTG CTCGTACTTT TGGACCGTTT GAATCCAGAG AGCGTTCAAG TGCACGGAGA CAAGGACATC ATAGCGTTTG TCTCCGAGGT CGATGGAGAT GAAGAGATTT CGTCGGCGGG AAGCAATGTG AAGGGAGTCA TCTTGCGCGA CGAGTTGGCA CATTTGTCAC ACCTCGCAAT TCGAGCGAGG CAAGAGCGCA CGCCCCTCGT TAGCGCACTA AGTGGGGAAG CTCGATCTAA AGTGAGCACG CGCGTGGGAA AAGACACCGT TTTGAACGTG TCTTCATTAC ATACCGAGTT GCGCGATTTC GATGGAACGC GCGATTCGCG GGAATCCGGT CATGCGGTGT CGCACGCGGC CGTGTCGCCA ACGGCGTGCG CTATGGTGAA CGTGATGACG TGTCTGCCCT TGGCGGAAGC CACGATCTCA AACGCTGGCG CGAAATCGTC GACGTGCAGT CGCTTGGCGA TCATCGCTCG CGATAGTGCG GCGTTTAAAG CGCCCTCGGG ATTCGTCGTG CCTTTCGGAT CCATGGAAGC GTCGATTCGC GACGAAGAGC GATTTGGTCA ACTTTTGCTC GCTCTCGAAA GCGTCAGCGT GCACGACATC GACGACGCGT GCTCGGCGAT CCAAAGTTTT ATCGTCGAAA ACCTACCAGA GCGCGAGATC GTCGAGAGGG CGTGCTCGGC ATTAGACGCG AGCGCTCGCC TGGTCGTGCG ATCGAGCGCA AACGTGGAAG ATTTAAGCGG CATGTCCGCG GCCGGACTGT ACGAAAGTGT CGTGGGCATC GATGCACAAA ACGTCACCGA AGTACAACGT GCTATTGCTG ACGTCTGGGC GTCGCTGTAC TCGCGTCGCG CCGTGCTCGC GCGCCGCGCC GCAGGCGTAA AACAATCCGA AGCGCGCATG GCGGTGCTGG CGCAGGAACT GTCGCCGAAC GCGCTCTCAT TCGTCCTTCA CACACAAAGC CCGATTCGAG GCGCAAAATC TGTACAAGCT GAGGTGTGTG TCGGTTTAGG TGAAACACTC GCGAGCGGTA TCGACGGCAC GCCTTGGCGT TTTGAGATCG ACCGCGCCAC CGGCGCCGTG GATGTGCTCG CGTACGCCAA TCACGCGTCT TCGCTGCGAT GCAGGTACGG CGCACCGACG TTTGGCAAAG TGACGATGGA ATCAGTGGAC TACTCACGAC AAGAACTCAG CACGAACGCA GACGCGAGAG CGCGTCTCGG ACGACGGCTT TTGAAGGCGG CGATTGAACT CGAAACCGCG CTCGGCGCGG CGCAAGACGT TGAAGGCGGC GTGCTCGGCG ACGACGAAGC CGTAATAATC GTGCAGTCGC GCCCGCAACC AATTTAG
|
Protein sequence | MQSNEHTSDR PATMRWDVGN LEEASAARAV VLADESGGSW FEKLAAVANQ IGIGKEITRD KLAVSATYLR WISTGTIKCV ESGGHRRPNG PAMVGRGIFI SMEQVQGAMY RHGTDLGEVE RVVMRQVHPW LPSFNEEFTN AVPLTRIRDI AHRNDIPEDL KKTIKHTIQN KLHRNAGPED LIATEVVLER ITAREGEYSE DFVREFREFH RELKRFFNAT GVFERLDSLR GTLDEETEPL IGELSELQHS LNNLNDGWLN HEGATLRHTL DVCTRLRSYF CAGLSTGMRN DAPDDSVRQR HAWRQVEMSL EEYAFVLLAR CNNLMEREKS DNMQHAANVS ALALKHIGLS GWKSLETGII ARELMSWTTS TPAIEDGNAA LRLKATLQRA RRLIESQTRA VMSGFGDAPA QLADAFGLEH HVGATFVESV IRASVAFQLS RTIEMMSEVV ERNVDGDGFD PIVPGDAQGV LVLLDRLNPE SVQVHGDKDI IAFVSEVDGD EEISSAGSNV KGVILRDELA HLSHLAIRAR QERTPLVSAL SGEARSKVST RVGKDTVLNV SSLHTELRDF DGTRDSRESG HAVSHAAVSP TACAMVNVMT CLPLAEATIS NAGAKSSTCS RLAIIARDSA AFKAPSGFVV PFGSMEASIR DEERFGQLLL ALESVSVHDI DDACSAIQSF IVENLPEREI VERACSALDA SARLVVRSSA NVEDLSGMSA AGLYESVVGI DAQNVTEVQR AIADVWASLY SRRAVLARRA AGVKQSEARM AVLAQELSPN ALSFVLHTQS PIRGAKSVQA EVCVGLGETL ASGIDGTPWR FEIDRATGAV DVLAYANHAS SLRCRYGAPT FGKVTMESVD YSRQELSTNA DARARLGRRL LKAAIELETA LGAAQDVEGG VLGDDEAVII VQSRPQPI
|
| |