Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_13918 |
Symbol | |
ID | 4999954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 828465 |
End bp | 831794 |
Gene Length | 3330 bp |
Protein Length | 1109 aa |
Translation table | |
GC content | 56% |
IMG OID | 640415375 |
Product | predicted protein |
Protein accession | XP_001415613 |
Protein GI | 145341018 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG5537] Cohesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0169683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGCGC GGTCGAGGAA GAGCGAGGCG ACGGCGACGC GAGGAGGGAA CGAAAACGCG GACGAGAGCG GGGCGCCGGC GAAGAAAAAG TCGAGGAGCG GCGCGCGTAA GAAGAAACCG ACGACGGGTG GGGAGGCGTT GAAAAATAGA AACGAGAACG CGTCGGTGCA GAACGCGTCG GTGAGCGGAG ATGGGGACGA AGACGCGGAC GAGGGCTCGG GGGACTTGTT TGATCTGTTG CGCGTGGAAA ACGCGGCGAC GATTTTACAC GCGAGTGATT GGCGAGCGCG GTACAACGCG AGCGAGATCT CGGCGGTGGC GGAGATTTAT TCGTTATTGT CGAAAGCGGC CGGTTGCTCG TCTGGGGTGA CTGCGATTGA GTTGCAGAGA AGCGACTGCT TGATGATTAT GAATCGCGTC GTCGAAGACA TGGCGGCGGG GAATTTGTAC GGCGACGATC CTTTGGCAAA ACGTTCGCGA GATTTCAAGG GATTCCGTGA GAACTTCTTG GACTTTATCG ATAAGTGCAT TCGCGACGCG AGCGAAGGCG GGGAGCTGTA CGATGGTACG CTCTTCGCGA CGCTGGCGGA GATCGTGAGC ACGTGCGCCG GGAGCAAGGC GCGTCCGTTG CGCATGGCTG CCACGATGAT GGGTTTACAG ATGATTTCCA GTTTGATTAC GGTGGTGAAT AACTTGCAGA AGGCTCGCGA TTTGAAACAA AACCAGGTGG ACATCGAATT GAAGAAGAAA AAGTCTGGCG GCGAAGTGGT CAAGAGTTTG AAGCGCCAAA TCGAAAGCGC GCAGGAACAC ATCGAGTTGG TGGAAGGTTA CATGAACGAC ATCTTCACTC ACGTTTTCAC GCATCGCTTC CGTGATTGCG ACGAAAACAT TCGCGCCGCG TGCATGACCG CGCTTGGGAA GTGGATGATG AAGCATCAGC TCGTGTTCCT CACCGACTTT TACTTGAAAT ATCTCGGTTG GAGCTTGAAC GATAAGAGCG CCGCTGTTCG ACTCGAAGTT TTGCTCGCGC TGAAGACTTT GGCGTCCAGC CAGTCTCACT TAGCTATGAT GGATACGTTT ATTGCTCGAT TCCGTGGTCG CATGGCGGAA ATGTTGCGCG ACGTCGACGC CCACGTCGTC GTGGAGGCGG TGCGACTCGC CGCGGTTTTG CACGAGCACA CCGAATTGGA TCCTGAGCAC ATGAACTTTG TCACCGCGCT CATCATGGAC AAGACTCCAT CGATCCGTAC CGCCGCGGCG AAAGCGACGA AGACACTGAT GCACACGCTT ACCGAGACGT ACCGCAAGGC GCGAGGGATT TCCTATGATG ATTCGACAAA CCCGGCGCTC GAGAAGGAAC TTCACGGCAT CGTGCAGCTC TTGAACGATC TCGGCGATGA AAATGGCGGG CACGGTAAAG TCATTGAAGG TTTGTCGGGG GTATATCCCG TCTTGGCACA ACCTGGCTTT ATCGCGGGCA TCTTGAAGCA CGACATGGAA ATGGCAGACG CGGCCGTGAT CGCGAACGTC TTGGTTCTCA CCATGCGCAA GGCGATGGGC GAAGACGTCT CCAACTCGTA CACCAAGACG GTGTCTCGAC AAAGCGCAAA AATACGAAAC GCGATCGAAG CTGCGCACGA ACAAATGACG AAAGATATCG GTAGCTTGAT TCCGCAACTT CTGAGCAAGT ATCAAGCCGA AGCTAACGTC ATCGGTCCGC TTGTCGAAGT TGTTCGTTTT GTGAAACTCG AGCACTACTC GTTGCGTCAC GAGGAGGATC AATTTACCGC TCTTGCCGAG CAAATAAAAG ACATCTTCTT CAAGCACAGC GACAAGCGTA CTTTGGAAGC CTGCGGTGAA GCGTTCAATT ACTTCTGCAA CGAGGGCTTC GAAGCTACGG CGCCTTTCGC ACAACCGGTG CTCGACAGCA CGGTGAACGA CTTGTCGGCT CGCCTATCTC CGGCGTTGAA GAAGGTTCGA GCTTTGATGG CCAAGAGCGA TGAAAGCGTG CTGAACGAAA ACGAGGGTTA CGCGTTTGAG CTTCGCATGT GCTTGTATCG GGTGCGTGCT TTGATTTCAA AGTGCAATAT CTCCAGCGGC GTGCGCGTGA TTAACGACTT GTCTCAATAC GTCGCCGACG TCTCGCGCGC CAACGTGCCA GTGGGTAAAG AATCTGTCGC CATGGCGTCC TCGTCTGTTT CTTTCGCTCT CATTTGGCAA GGCCTTGAAC TAATGGATAG TGATTCGGCG ACGAGCGTCG AGGTGAACGA GCACTTGACC GAACGCGACG CCTTCTTGTC CAACGTGATG CACATCTTGC GTCGCGCCGA GGACTCGATT GCGGATTCGG ACGATCTTCG GCGTTCTTTG ATTTCCACCG TTTGCGATAT GGTGCTGTAT TATTACAACG CGAGCACACT GCCCGCGGCG CATCCTGCGA AAGTGTTGCA GCTGAAGCTC AACTCGGCGG ACAGTGAAGC TGTGTGGCAG CAGTGCACCG CGTTGATCAC GCCGGATGAC GTCAAGCAAG ACGCCGACTT GGACTCGGCA CGCCTCGCGT ACCGCATGGC AGTGCACGAA CAGCGTATCG CCTCCAACGG TGCGATCGGT GCAGACTTTT TGTCCAACTT CAAACTTACC GGCCCCTGGA TCGATGCCGC CATCCGCACG TACTGCAGTG ATTTACGCCG AACCGGTCCA CAAGTGCTCG TGCGCGCTGT CTTGACAGCG TTGCACAGCG CGTATGCAGA AGTCTTACAG ACGGATCTCG GAAACCGACA AGTGCTCATC GAGGCGTTCA CCGATCTCGC GACCCGATTG AGTGATATCT TTATGCTATC ATCCAAACGC GATCGCTTGG TCATGCGCAT CATGTTCGAC GAGTCGCTCA AGTCTGTCCT TCTCCCCGAA CCGTCGTACG ACCGATTTTC CTTCTTGGCG TACGGACTGG GTCCCTTCCT CTCAAGACTG TCCGCCGTCG ACGCCAAGGC TTTGACGACG TTCGTCGACG ATGCGCTCGC CAAAGTCGAT AGTGAGGACA CGAGATGTAC GCCATTGATT GATTTCGCCG ATCAGCTCAA CAAGCGTTCC AAGGGCGCGT CGGAGCGCCG CGCGCGTTGG GCGAGAAAAC GCTCCGCGGA ACAACAAGAT GTAGACGTCA AGAAGTCCAA GGCCGACGAC GGAGACGAGG ACGAAGGCGA TGATGATGCC ATGGAGCAAG ACGACACCAT CGAAACCGCC AACGATGACG CCGAAACCAC CGCGGATGAT CACACTTTGG ACGACGGTCC GGTGGAGACC GCCGAGCGCA GTCGCTCGCG CCGCAAGTGA
|
Protein sequence | MPARSRKSEA TATRGGNENA DESGAPAKKK SRSGARKKKP TTGGEALKNR NENASVQNAS VSGDGDEDAD EGSGDLFDLL RVENAATILH ASDWRARYNA SEISAVAEIY SLLSKAAGCS SGVTAIELQR SDCLMIMNRV VEDMAAGNLY GDDPLAKRSR DFKGFRENFL DFIDKCIRDA SEGGELYDGT LFATLAEIVS TCAGSKARPL RMAATMMGLQ MISSLITVVN NLQKARDLKQ NQVDIELKKK KSGGEVVKSL KRQIESAQEH IELVEGYMND IFTHVFTHRF RDCDENIRAA CMTALGKWMM KHQLVFLTDF YLKYLGWSLN DKSAAVRLEV LLALKTLASS QSHLAMMDTF IARFRGRMAE MLRDVDAHVV VEAVRLAAVL HEHTELDPEH MNFVTALIMD KTPSIRTAAA KATKTLMHTL TETYRKARGI SYDDSTNPAL EKELHGIVQL LNDLGDENGG HGKVIEGLSG VYPVLAQPGF IAGILKHDME MADAAVIANV LVLTMRKAMG EDVSNSYTKT VSRQSAKIRN AIEAAHEQMT KDIGSLIPQL LSKYQAEANV IGPLVEVVRF VKLEHYSLRH EEDQFTALAE QIKDIFFKHS DKRTLEACGE AFNYFCNEGF EATAPFAQPV LDSTVNDLSA RLSPALKKVR ALMAKSDESV LNENEGYAFE LRMCLYRVRA LISKCNISSG VRVINDLSQY VADVSRANVP VGKESVAMAS SSVSFALIWQ GLELMDSDSA TSVEVNEHLT ERDAFLSNVM HILRRAEDSI ADSDDLRRSL ISTVCDMVLY YYNASTLPAA HPAKVLQLKL NSADSEAVWQ QCTALITPDD VKQDADLDSA RLAYRMAVHE QRIASNGAIG ADFLSNFKLT GPWIDAAIRT YCSDLRRTGP QVLVRAVLTA LHSAYAEVLQ TDLGNRQVLI EAFTDLATRL SDIFMLSSKR DRLVMRIMFD ESLKSVLLPE PSYDRFSFLA YGLGPFLSRL SAVDAKALTT FVDDALAKVD SEDTRCTPLI DFADQLNKRS KGASERRARW ARKRSAEQQD VDVKKSKADD GDEDEGDDDA MEQDDTIETA NDDAETTADD HTLDDGPVET AERSRSRRK
|
| |