Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27420 |
Symbol | |
ID | 5005360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 232557 |
End bp | 237273 |
Gene Length | 4717 bp |
Protein Length | 1270 aa |
Translation table | |
GC content | 56% |
IMG OID | 640420781 |
Product | predicted protein |
Protein accession | XP_001421242 |
Protein GI | 145353911 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.503853 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGCGCCGAC GCGCGTGCAT GGATCCCGCG AGACCGTTTT CGACGTGCGT GAGAGAAACT CGCGCGACGC GCGCGATGCG ACGTCGCGCG CGACGCGGCG GCGCGGGCGC GCGACGCGGC GGCGCGATGG ATGACGATCG ATTTGAGATT TCGCGAGCTG TCGAGTGATT AAAAATAAAA CGACGACGCG AAGACGCGGA CGCGCGACTG ACGAGGGCGA GGGCGAGGCG AACAGGACGG TGGTCGTGCC GCTCGAGGAC GAGAGACGCG CGAGGGCCGT GGGAGGGGCG CTCGGGGTGG ATAAGGAGTT GCAACCGAGG GTGGTGCGAA AGACGGTGAC GACGCGCGAG AGGGAGAACG GGGACGGTTG GGAGGTCGTC GCGACGTTCG AGGCGAGCGA GTTGAAGGCG CTGAGGAGTG CGGTGAGTGG GTATTACGAT TTGTTGACGG TGAGCGTGCG GACGGTGGAG GCGTTCGAGG AGGAGATTTG AATCGGCGGT ACGGGCGAGA ACGGGCGAGA TGCCGACGCG CGAGTATCAG ATGAGCGATT ATCACACGTA CGAGGCGATC GGGAAAGGGA AGCACAGTAC GGTGTATAAG GGGCGACGGA AGAAGTCGAT TCAGTACTTT GCGATGAAGA GCGTGGAGAA AGGACAGAGA AATCGAGTGA TGCACGAGGT GCAAGTGATG AAGGCGTTTT CCCACGAAAA CGTGTTGAAA TTCATCGCGT GTTACGAGAC GCAAAATCAT CTGTGGTTGA TTCTGGAATA CTGCGTCGGA GGTGATTTGT TGACGCTGCT GTCGCAAGAT CTCAAGCTTC CCGAACCTTC CATCATGACG TTCGCGCGCG ATATGTTCGT CGCCGCGCGA GAGTTGCACA GTAAAGGACT GGTGCACTGC GATTTAAAGC CAAGTAACAT GTTGCTCGAC GAAGAAGGGC GGATTAAGAT TTGCGGATTC GGACTCAGTC GCAAGGTTTC CACTGTCGTG GCGTCTTCGG GTACGCCCAC GCAGCTGTCG CGTCGAGGGA CACCGTGTTA TATGGCTCCG GAGATGTTTA CACAAGAGGG CGTGCACTCG TACGCGACGG ACTTGTGGGC GTTGGGGTGC GTGCTGTATG AGTGCGCCAC CGGACGACCC CCGTTCACGA GTACGTCGTT GACATCATTG ATCGAACAGA TTTTAGAGCA CGAGCCCGCA CCGTTGCCCG CGGCGTATGG TACGACGTTT AAGAATTTGG TCTCTGGGTT GCTCATCAAG CGCCCGCACG CTCGTTTGAC GTGGGACCAA GTCATTAACC ATGAATTTTG GACCGAGCGC GGAGGGCAAT ACTTGGACGA TCTTGGTGAC TTAGTGACGC TTAGACTCCC ACCGCAGCCT TCATTCGACG CGCTCGCGGC TAAACTGAAG AGCGAGGATT TGGCGGCGAG ACTGCGAATG CTTGGAAACG CAAGAGCGTC GGGGGCGGTG GACGTGCGCG ATTCTTTCAT GCCCGTCTTG ACGCAAAGTT TACGAGATAG CGTAAACGTC TCGCGACTGT CGAGGATTGC GCGTTCAAAT TTGGAGCGAG AAGAAGTCGG TTCGTACGCG CCTAGCGGTG CTGCAGCTAT CGAAGGCGAC CCCTTGCAAG TCGCTGCCAT TCACGAAGAC GTGGAGTTGG AGAATCCAGA CGCTGAGCTC AACTTCACCG CTCCGAGCGG GAACGACGAT TTGGACGTGG ACATGGAAAC CGAAGACGTT ATCATACCTT CAGCTCCCAA AACGCCTCCT AGGGGAGGTC GTCGTGGTGA CGATGATTAC GCAGATGAAA ACAGAGCAGA GAACTTTGCC GATCCCGCTG AGTCGTTTGG TGACGAGATG CAGTCGGACA GCGATGACGC AACTCGCATC GTGCTCACAC CTCCGACCGC GGACGCTATG TTAGTAGACA CGCGTTCGCC GGCTTCGACG AGTCGCTCGT TACAGACGGC AGATGATGAG TTCGAAGAAA CGTCGTCTTC GATGCGTCGC TTGGCGCCGC TTCCGGCGGA GACGCCGGCG GTGATCGGGA GGACAGAAAG TAGGATGCCT TCGCTGGCGG CAAGGAATGA TGTGCTGGCA AAAAGTGTCG ATTACGACAA GTTCAAGAAG CTGTGCACGC ATTCGACCGA TTTGACCGTG CGACCCATCG TGTCCAATCA CAGAATTGAA GTGGTGAAAG ACGCGCCTCA CGATCCAGAA AAGTTGCCCT TCAAAGAATT GAGCGTGGAC AAAATGTTGG CGACTTCTCA GAGCGAGCTC GAAGCGTTTC TGACTCGTAT TTACCGCTCC GTGGCGCACC CTTCAAACGT TCAAGAAAAA GTTAACACTT TGGCTTATTT CGAGACGTTG TGCACCGAGG CGTCGAGCGC GAACGTCTTG GCGAATTCAT CCTTGATGTC AATGTTCGTC CGAGTGCTCG GTAGCAGCAA ATCTCCACCG TTGAAGATCA AGCTTTGCAG CATCATGGGC TTGTTGATGC GACATGCGAC GCATATTAGC GATGAGTTCG CGAAGAGTGA GGCTGCAAAA GTACTGGCGG GCACTATGAG CGACGAAAAC TCTCGCGTGA GACGCCGAGC TATGGCGGCG CTCGGGGAAC TGTCGTTTTA CGTCGCGACG CAGCAGAGGC CAGACGCATC ACAAGTATGG GGTATCACCG ACAGTGTGAT TGAGGTCTTC ATCAAGACGT TAGAAGAGGA AGAAGATGAA ATTACTAAGC ACTACGCGTG CAAGAGCATC GAGAACATAG TCTCTCACGA GGGCTCAACG TGGATACAAG ACGCATTCGC GGAGTGTCGC GTCGTAAGTG TTTTGTTTTC AATCGCGACG AATGACGCCG TAGACGACCA GCTTCGTGGT ACTTCGGCAA GTGCGCTCGC AAGAATCGTT CGGCTCAAGT CGGAAACGCT TCACCCACTG GTCGGCGAAG AGGATTGCGA TGGTTCAAAT GCACGCGATG GCCTGGTGAA AATGCTTAGA GATCGTGAGA GAAAAGTACA ACAGGCGGCA CTCAACGTGT TGTGTAGAGC GCTGGTGGAT GAAAAGTCGC GCTTGATTGC CACTATCATG GAAGTTCAAA CGCTTTTGCC CATTCTTGCT GGATTGTACG AGCGAAGTCA AACGCCCATC ATCAAAGCCA AGTCGTTGCT CGCGATGGCG CTTTTGATTC GCGCGCATCC AAAATGGTTG CAATCGCTTT GCAAGGCGCG CGTTTTGCCG ATGTTGGATC GTCAACCGAC GCAAATGGAC CCTTATTTAC AAGAATGTTT CGATACGTTC GTCACCACCG TTGTGGCCCT CGTGCCGGTC GTGAACACTG GTATCTTGCA GTACGTCGAG CGATCGCTCT CCACGGGATT GTCATCGGAA AACGCCATGC GACCGCAAGC GTTTGAGTTA TTTCCTGTGC TCATTCATTT GCTGCAATCG ACTGTCATGC GACCAAAAGT ATTGACCAGT GCGTTTGTGA TCGACATTGC GAAGTACATT CGCGCCGCTG AGGCTGAAGA GGATTACCCT GGTCGCGACG AGCTTCGCGT GGGAGTTATG GCGTTGCTCG AGACGTGCGC ACAGTGTACG AATGAGTTGA TTCATAAGGT TGATGCGTTG ACGCGTCACT TATTACCGGC GCTGTGCGAC TTACTCGAAG GAAGTGCGCA CGTTGAAACT CGTTTCTTGG CGCTGAAGCT CATCTACGAA CTCTTGCTGC CATTGCGTTT GGATTTGGAT ATCGCCACGC AAAGCGGTCT CGATCGAAAA GTCGTTGCGC ACCACTTGGA CCAGCTCTTG ATGCAAGACT TGTTCCCGAT GTGCCCTGCG TTGATCGATA GTGAAGACCC AATCCCATTG TACGCTTTGA AACTTCTCAG CGGCACGCTG GAGATTGAGC CCGCGCTTTG TCGCGAAATC ATCGCCTTGG GTTTGGCGCC AAGATTCTTT GAGTTTCTCT CGCTCGAGCA CACGAACAAC AACGAGCACA ACATACATCT GTGTCTCGCC CTCGCTCGAT GCAAAGCGCT CAGCACGCAA TCATTGTGGG ACTTCGACGC CCCTGCGAAA GTCGCGCAAG TATGTGCGTA TAGCTACGAG CGAAACGTCA CGCAGTTTGT CGAGCCCGCG CTCGGCATCG CCGCCACCCT CCTTCGCCGC GCCGCCGCCG ACGCGCCTCG CGCCGCCGGA GACGCCGTCC CATCCGAAAT CACCCCTCTC CTCGAGGTCG CGCCGACGCT TCGTCGGCTT GCAGCGACCA TACCCGACGG TGCGCTCGCG CGCGATATCA CTCATTCCCT CGACACCTTC AACGCGCATT AATTGTAACA CAGAACTCGC ACGCGAGTCA ACAGCACGCT GTCAACTCAC ACACAGATGC TCATTCGGTC AACAATCGAC CGCACACCAC GCGCGCGTCG CCGCACGACG TCGAGCGACA CAGGGAGCCG TAACGGAACC GCACGGAACG TCACCTTCCG CCGCGCGCGA ACGCTCGCGA TAGACAACCG ACGCGCGCGT TGTCGCAACG TCGACCGGGA CACGAAGCGT TTTTTCAACC GTCGCGCGCC TTCGCGTCAT CATGCCTCGA GATCACACCC GCGACGCCAA GCCGTGGACC GCGGAGGAAG ATGCAGCGTT GTTGACGCTA CAAGGCGCGC ATGGGAACAG ATGGAGCGAG ATCGCGCGCG CGATGGGCAC TCGGAGT
|
Protein sequence | MPTREYQMSD YHTYEAIGKG KHSTVYKGRR KKSIQYFAMK SVEKGQRNRV MHEVQVMKAF SHENVLKFIA CYETQNHLWL ILEYCVGGDL LTLLSQDLKL PEPSIMTFAR DMFVAARELH SKGLVHCDLK PSNMLLDEEG RIKICGFGLS RKVSTVVASS GTPTQLSRRG TPCYMAPEMF TQEGVHSYAT DLWALGCVLY ECATGRPPFT STSLTSLIEQ ILEHEPAPLP AAYGTTFKNL VSGLLIKRPH ARLTWDQVIN HEFWTERGGQ YLDDLGDLVT LRLPPQPSFD ALAAKLKSED LAARLRMLGN ARASGAVDVR DSFMPVLTQS LRDSVNVSRL SRIARSNLER EEVGSYAPSG AAAIEGDPLQ VAAIHEDVEL ENPDAELNFT APSGNDDLDV DMETEDVIIP SAPKTPPRGG RRGDDDYADE NRAENFADPA ESFGDEMQSD SDDATRIVLT PPTADAMLVD TRSPASTSRS LQTADDEFEE TSSSMRRLAP LPAETPAVIG RTESRMPSLA ARNDVLAKSV DYDKFKKLCT HSTDLTVRPI VSNHRIEVVK DAPHDPEKLP FKELSVDKML ATSQSELEAF LTRIYRSVAH PSNVQEKVNT LAYFETLCTE ASSANVLANS SLMSMFVRVL GSSKSPPLKI KLCSIMGLLM RHATHISDEF AKSEAAKVLA GTMSDENSRV RRRAMAALGE LSFYVATQQR PDASQVWGIT DSVIEVFIKT LEEEEDEITK HYACKSIENI VSHEGSTWIQ DAFAECRVVS VLFSIATNDA VDDQLRGTSA SALARIVRLK SETLHPLVGE EDCDGSNARD GLVKMLRDRE RKVQQAALNV LCRALVDEKS RLIATIMEVQ TLLPILAGLY ERSQTPIIKA KSLLAMALLI RAHPKWLQSL CKARVLPMLD RQPTQMDPYL QECFDTFVTT VVALVPVVNT GILQYVERSL STGLSSENAM RPQAFELFPV LIHLLQSTVM RPKVLTSAFV IDIAKYIRAA EAEEDYPGRD ELRVGVMALL ETCAQCTNEL IHKVDALTRH LLPALCDLLE GSAHVETRFL ALKLIYELLL PLRLDLDIAT QSGLDRKVVA HHLDQLLMQD LFPMCPALID SEDPIPLYAL KLLSGTLEIE PALCREIIAL GLAPRFFEFL SLEHTNNNEH NIHLCLALAR CKALSTQSLW DFDAPAKVAQ VCAYSYERNV TQFVEPALGI AATLLRRAAA DAPRAAGDAV PSEITPLLEV APTLRRLAAT IPDGALARDI THSLDTFNAH
|
| |