Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50201 |
Symbol | |
ID | 5003343 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 124919 |
End bp | 128131 |
Gene Length | 3213 bp |
Protein Length | 1043 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418764 |
Product | predicted protein |
Protein accession | XP_001419289 |
Protein GI | 145349746 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00844273 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGCGC CTCGACGTAA AATCACGGGC CGAAGCGTGG TTCTCACCAC CAACGCTCTC GAGCCGCTCA TCTGCCACTG GGGTGTCGCC AAGGAAGAGC CCGGGGAATG GGTATTGGCT CCAAAGCTCG TGCATCCCGC CGGTACGGAG GCGGTGTCAC ACATGAGCTG CGAGACGCCG CTCGAGGAAT TCACCGGTTG CTTTCCGGCG GATCGCGTAT CGTCGACACC GTCCATGGAT GACGCAGATC TGTTCGACGA GTGCGCTTAC ATGCAAAGAC TCGAGTTCAA TCTTCCGGGT GATGGCCCAA ACGAGCTCAT GGGGTTGCAT TTCGTCTTTC GCAACGTTGA CGGGACCGCG TGGTACAAGG ACACGTCCAA CGGCAACGCC AATTATCACG CGTCGTGCAT ACGAGTCAAG GAAGAAGATC GCGTCGCGGA TGAGCTCGTG GATACCATCA CGCGCGCCGA AGCTGGAGGA TCGTGGTGGT CGCTCATGCA TCGTTTTAAT CTCGCGCGAT CGCTTTTGGA AAAATATTGC GTCCAGGGCG AAGACTCGGC CAAGGCGCAA AGCGCGGCGG CTAAAATTTT TGTATGGCTT CGATACAGCT CCATTCGACA GTTGACCTGG CAACGGAACT ACAACGTCAA ACCACGAGAA CTGTCGAGCG CGCAATCTCG TCTGACGCAC ATGCTCGCAG AAATTTACGT CACAAAGCCG CATTTACGCG ATATGGCGCG TCTCATGCTC GGCACCGTGG GCAAGGGCGG CGAGGGTGGA CAAGGCCAAC AAATTCGCGA CGAAATCTTG AACATTATGC ACCGAAACAA CATTAAGGAG GTTAAGGGCA TTTGGATGGA GGAGTGGCAT CAAAAGCTAC ACAATAATAC TACTCCAGAC GATATCATTA TCTGTCAAGC CTACTTGGAC TTTTTACGAT CCGACGGTAA TTTGGGGGCG TACTGGCATA CTCTCTCCGA AGGCGGCGTC ACCAAGGAAC GGTTGGAATC TTTCGAGCGT CCCGTGAGAA GTGAACCGAT CTGGCGACCA AATATTAAGG ATAATCTCAT TCGAGACTTC GAAAACTATT TAAAGATTCT CAAATCCGTG CATAGCGGTG CAGACTTGGC GGAGAGTTAC GACGCGTGCC GCAGTCGTTT ATCCGACGTC ACGCGAGGCG CAGTCGAGTA CGTCATCGCG CAACAATCGA GCGGTGATAT CTTTCCCGTC GTGAACGCGT GCTTGGAGGC TCGACACGGT TTACGTGATG CTGGTTTGGG CGATCCTTCG GATGCGCCAT GGTGTCGCGA GTTGCTGTAT CTCGACTTGT CCATCGCCGA CATCAGCAAT CGCGCGATTC AGCGCGGCTC GGACGGCGTG ACGGATACGG AGGGTTTGTT AGAGCTGACG GACATGGTCC TCGAAGATTT GTGTCTGTCT CTACCATCTA CAAACGATGA CTTGCTGTAT AGTTTGATGA ACTGGCGACG CATCCGCGAC TTACAGCGCG CTGGCGACGC CGCCTGGGCG CTTCGAGCAA AGGCGACGGT CGACCGCGTT CGTCTCGCCG TCACCGAACA CGCCGTGGCG ATTTCGGATA GCATGCAACC GGCGGCGCAT ACCATCGGCA CTCGATGCGA TTGCGATAAA TGGGTGGTCG ATCTCTTTTC AGAAGAAGTC ATTCGCGGCG GTCCGGCGTT CGCGCTTTCG CTGATGCTTA CAAGACTCGA CCCGTACTTG CGCCGCGAGG CGAACATGGG CGATTGGCAA ATTATCTCTC CCGCGACGTG CGCCGGAGTT GTTGCGCACG TGAAAACCCT CGCAGAGGTG ATGAACGAAA CATTCAAAAC CCCAACCGTG TTGGTTTGCG ATCACGTAGG CGGAGGCGAG GAGATACCTT CCGGCGCAAT CGCCGTGCTC ACGGGATCCT CTGTCGACGT GCTGTCGCAC AGCGCCGTTC GAGCGCGCAA CGGCGGCGTT CTGTTCGCCA CTTGCTACGA TCCGACTCTC TTGGACAAGT TTTCTGGGAT GAACAAAAAG GCAGTCAAGT TACACGTCAC CGCGGACGAG TGCGTGGCGT TCGATGAGAT TGCATTTGAG AACATTGGCA AGGAAAACTC GGCAGACGGC GCCTCGCACA ACGGTGACGC GCAGCGAATC AACATTAAAG CCATCGATTT CGCGGGCGAT TTCGCCGTTT CAATGGAAGA CTTCCGCGAG GATCTCGTTG GCGCGAAAGC GCGCAATACG AAGGCGCTGC GCGACGCCTT GAAGAATGGC GGGATTCCCG ATTGGATCAA CCTCCCGGTG TCCGTGGCGA TTCCGTTCGG CACGTTTGAG CACGTCCTCG CGAGACCGGA GAATGAAAAG CAAGCCGAGA CACTCAACAA GCTTTTGAGT GAAATCGATG ACACTACTGG TGTGACGCTC TCCGCGTCAC TACGATCGTG CCGTCGTTGC GTTCGCACCA TCGTACCCCC CGCTGGGATG CTCGAAAAAC TCGCCGCCGT CATGCGAAGC GGCGGGTTAA CCCCTCCGGA GGACGACGAC GCGTGGGAGC TCGCGTGGAA AGCCATCTGT GACGTGTGGG CATCAAAGTG GAACGAGCGC GCGTTTGTCA GCATGCGTAA TCGCGGTCTC GATCACAATA ATCTGCGCAT GTCTGTGCTC GTGCAACCCG TCATCAACGC CGATCACGCC TTCGTGATTC ACACCGTGAA TCCGAGCACG AACGCCGCCG ACGAGCTCTA CGCCGAAGTC GTCCAAGGCA TGGGCGAAAC GCTCGTCGGG AACTATCCTG GGCGCGCGCT GAGTTTTACG GTGAAGAAGA CGCCCGACGG CGACGTCTCA CCGCCGGAAA TCGCTGGATT CCCATCCAAG AACACCGTCC TTCGCGTCCC CCGCGAGACG CTCATCTTCC GCTCCGACTC CAACGGCGAA GACCTCGAAG GCTACGCCGG CGCTGGCCTG TACGAATCCG TCCCCATGCA CGCCACCGTG GAACACCACG CCGATTACGC GTCTGATCCG ATGATATGGG ACCACGACGC GAGCCAGGCC ACGCTCGCCG CCGTCGCCCG CGCCGGCGCC GCCATCGAGC GCGCGCTCGG CGCTCCCCAA GACGTCGAGG GCGTCGTCCG CGACGGCCGC GTCTTCGTCG TCCAAACCCG TCCGCAGGTC TAGCCTCCTG TATTTCATCG CCTCGCGCCG TCC
|
Protein sequence | MSAPRRKITG RSVVLTTNAL EPLICHWGVA KEEPGEWVLA PKLVHPAGTE AVSHMSCETP LEEFTDLFDE CAYMQRLEFN LPGDGPNELM GLHFVFRNVD GTAWYKDTSN GNANYHASCI RVKEEDRVAD ELVDTITRAE AGGSWWSLMH RFNLARSLLE KYCVQGEDSA KAQSAAAKIF VWLRYSSIRQ LTWQRNYNVK PRELSSAQSR LTHMLAEIYV TKPHLRDMAR LMLGTVGKGG EGGQGQQIRD EILNIMHRNN IKEVKGIWME EWHQKLHNNT TPDDIIICQA YLDFLRSDGN LGAYWHTLSE GGVTKERLES FERPVRSEPI WRPNIKDNLI RDFENYLKIL KSVHSGADLA ESYDACRSRL SDVTRGAVEY VIAQQSSGDI FPVVNACLEA RHGLRDAGLG DPSDAPWCRE LLYLDLSIAD ISNRAIQRGS DGVTDTEGLL ELTDMVLEDL CLSLPSTNDD LLYSLMNWRR IRDLQRAGDA AWALRAKATV DRVRLAVTEH AVAISDSMQP AAHTIGTRCD CDKWVVDLFS EEVIRGGPAF ALSLMLTRLD PYLRREANMG DWQIISPATC AGVVAHVKTL AEVMNETFKT PTVLVCDHVG GGEEIPSGAI AVLTGSSVDV LSHSAVRARN GGVLFATCYD PTLLDKFSGM NKKAVKLHVT ADECVAFDEI AFENIGKENS ADGASHNGDA QRINIKAIDF AGDFAVSMED FREDLVGAKA RNTKALRDAL KNGGIPDWIN LPVSVAIPFG TFEHVLARPE NEKQAETLNK LLSEIDDTTG VTLSASLRSC RRCVRTIVPP AGMLEKLAAV MRSGGLTPPE DDDAWELAWK AICDVWASKW NERAFVSMRN RGLDHNNLRM SVLVQPVINA DHAFVIHTVN PSTNAADELY AEVVQGMGET LVGNYPGRAL SFTVKKTPDG DVSPPEIAGF PSKNTVLRVP RETLIFRSDS NGEDLEGYAG AGLYESVPMH ATVEHHADYA SDPMIWDHDA SQATLAAVAR AGAAIERALG APQDVEGVVR DGRVFVVQTR PQV
|
| |