Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31513 |
Symbol | |
ID | 5001761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 73997 |
End bp | 76395 |
Gene Length | 2399 bp |
Protein Length | 756 aa |
Translation table | |
GC content | 57% |
IMG OID | 640417182 |
Product | predicted protein |
Protein accession | XP_001417909 |
Protein GI | 145346879 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [K] Transcription |
COG ID | [COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGCGCGTC ACGACCGACC GACCGACGCG CGCGGTCTCG CGACGAAGAC GAAGGTAAAG ACGCTTTCAG CGCCGCGCCC TTCGTGCGAA CGACGACGCC GACGACGACG GAAGAGAGAC GCGAACCATG CGCATCCAAA TCAAAGGCGG CGTGTGGAAA AACACCGAAG ATGAGATCTT AAAGGCGGCG GTGATGAAGT ACGGCAAGAA CCAGTGGCCG AGAATCGCGT CGCTGTTGAA TCGGAAGTCG GCGAAGCAGT GCAAAGCGCG ATGGTTCGAA TGGCTCGATC CGAGCATCAA AAAGACGGAG TGGACGCGGG AGGAGGACGA AAAGCTGCTG CACCTGGCGA AATTGATGCC GACGCAGTGG AGAACGATCG CGCCCGTGGT CGGACGGACG CCGAGTCAGT GCTTGGAGAG ATATGAGAAG TTACTCGATG CGGCGTGCGC GAAGGATGAC GATTACGACG CCGGAGATGA TCCGAGAAGA CTGCGTCCGG GGGAAATTGA CCCTAACCCA GAGACGAAGC CGGCGAAACC GGATGCGGTG GATATGGATG AAGACGAGAA GGAGATGCTC GCTGAGGCTC GAGCGAGACT TGCGAACACG AAGGGTAAAA AAGCGAAGCG AAAAGCACGG GAGAAGCAGT TGGAGGAGGC GAGAAGGTTG GCTGAGTTGC AAAAGAAGCG CGAACTGAAG GCGGCAGGAA TCGCACACGT GCGGCGCGCA AAGCGCGTTC GAGGCGTGGA TTACAACGCC GAAATCGCGT TTGAACGAAA GCCGGACGCA GTGATGTACG ATACGCGCGA AGAAGACGAA GCATTTGCAA AGCAGCAATC TGCAAAGGTG TTTAAACCAA TTTCGCTCGC CGAGCTCGAA GGGAAGAAGA GCGCAAAGCA ATTAGACGAA GAGAGCAAGA AGCGTGAGGC GGCGAAACAG AAAATGCAAG AGCGTCGCGA TATGCCCGGT GCAGTACAAC AAGCGCTCAA AGTGAACGAC GCGTCGTTTT TTCGACGATC GAAGCTCATG TTACCGACGC CGCAAGTGTC TGACCGAGAG CTGGAGGACA TCGCAAAGAT TGGCAAAGGA GGCGTCGGCT TGCTCGACGA CGGCAGCGCG ACGCCTGCAT CTGGGCTATT AGGATCATAC GGGCAAACGC CGGCGACCTC GTCCGGATTA GCCGGGCGAA CGCCGATGCG AACGCCTCAA GTCGGGGGCG ACGCGATTTT GATCGAGGCC CAGCAGCAAG CCGCCCGACG TCAACAACAG TCGACTTTAT TCGGTGGTGC TGAAGAGGCG GCGGCCGTCA TGCCCACTGA CTTTGCTGGT GCGACGCCAA GCCACGCGAA AGCGGCACCG ACGCCGTCGC GAAGTGATGT GAGCTCGCAC ATGGGCGCGA CGCCGTCACT GCACGGGCAA ACGCCCATTC GCGACGGATT GAACATCAAC GACCAGTATG CCTCGCACTT CGGCGATCTC TCGGCGCGAG AGCGACGTGC ACACACTGCG TCGACTGCTG CCTCATTGAA GAGTGCGTTC ATGTCGCTTC CGAAGCCACA AAACGAATAT CAAATTGATC TCCCAGACGA GCCGATGGAA GACGAGCCGA TGGAGGACGC CGTCGTGGAA GATGAAGCCG ACGTTCGTGC GCGCGAAGCC GCGGCCTTGG CGGAGTATGA AGCGATTCAA CGCCGTAAGC GCTCCCAAGC TGTCCAGCGC GATTTACCTC GACCGACTGA GTTGACGCCA GTCGCGCCGC TCGCCGAGGA TTCGATTTCG AAGCTCGTCA ACGAAGAGGC GCACGCCTTG CTTGAGAACG ACATCGCCAA ATATGGCAAG AAGTCCAAGT CCGCGCCCGC ACTCGAAGAT TTTGACGAAT CTTTGCTCGT GGCGGCCCGT CGACTCGTCG ATACGGAGGC TGATGAGATG TTGCGAGAGC AAAACGTGTC GAGAGAAGAT TTCGCCGAAG CCTTCTCCGC TGCGCTCGTC GCGGAGCGGA AGAAACTTAT TTTCGTACCG AGTCTCAATG CGCAGATCTC TGTCGATGAA GCGTCGAAAG AACAACAGCT CGAAGCGGCA AAAGCGACTT TCGAACTCGT TAGAGGGGAA ATGGAGAAGG ACGCAAAACG CGCAGCCAAG TTGGAGCAAA AATGTATCCT CCTCACGGCC GGGCTTCAGA AGCGGAATGG AGAATTGTGC AACAAACTGA AGAAGACTGT GGAGGAAGTC AAGGCTTTGT CCACTGAGGC GGCGTCTTAC GCCGTTTTGC ACGTGCAAGA AGAACGCGCA GCGCCTAATA GAATCGAATA CTGGCTCGAG CTTGTGGAAG CCGCGCGAAC GCGCGAAAAG CTTCTTCAAG AGAAGTTTGA GACCCTCACG CGACAGTTGA ACGCGTAAC
|
Protein sequence | MRIQIKGGVW KNTEDEILKA AVMKYGKNQW PRIASLLNRK SAKQCKARWF EWLDPSIKKT EWTREEDEKL LHLAKLMPTQ WRTIAPVVGR TPSQCLERYE KLLDAACAKD DDYDAGDDPR RLRPGEIDPN PETKPAKPDA VDMDEDEKEM LAEARARLAN TKGKKAKRKA REKQLEEARR LAELQKKREL KAAGIAHVRR AKRVRGVDYN AEIAFERKPD AVMYDTREED EAFAKQQSAK VFKPISLAEL EGKKSAKQLD EESKKREAAK QKMQERRDMP GAVQQALKVN DASFFRRSKL MLPTPQVSDR ELEDIAKIGK GGVGLLDDGS ATPASGLLGS YGQTPATSSG LAGRTPMRTP QVGGDAILIE AQQQAARRQQ QSTLFGGAEE AAAVMPTDFA GATPSHAKAA PTPSRSDVSS HMGATPSLHG QTPIRDGLNI NDQYASHFGD LSARERRAHT ASTAASLKSA FMSLPKPQNE YQIDLPDEPM EDEPMEDAVV EDEADVRARE AAALAEYEAI QRRKRSQAVQ RDLPRPTELT PVAPLAEDSI SKLVNEEAHA LLENDIAKYG KKSKSAPALE DFDESLLVAA RRLVDTEADE MLREQNVSRE DFAEAFSAAL VAERKKLIFV PSLNAQISVD EASKEQQLEA AKATFELVRG EMEKDAKRAA KLEQKCILLT AGLQKRNGEL CNKLKKTVEE VKALSTEAAS YAVLHVQEER AAPNRIEYWL ELVEAARTRE KLLQEKFETL TRQLNA
|
| |