Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_47583 |
Symbol | |
ID | 5005510 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 237238 |
End bp | 239284 |
Gene Length | 2047 bp |
Protein Length | 328 aa |
Translation table | |
GC content | 63% |
IMG OID | 640420931 |
Product | predicted protein |
Protein accession | XP_001421243 |
Protein GI | 145353913 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [K] Transcription |
COG ID | [COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.164071 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TGGAGCGAGA TCGCGCGCGC GATGGGCACT CGGAGTGGAC AACAGTGCGC ACAGCGGTGG CGACACAAGG TCAACCCAGG GATCCGACGG GAGCGGTGGA GCGAGGAGGA GGATGAAAAG GTGCGAATGC GGTCGATAAC GTGAGCAAAT GATATTGGTT ACATGCGTTG TGTGCGCAGA ACACAATTTT GGTCGCGTGG CGGGAGAGAT ATCGTCGAGT GGTCCCGCGC GGTCCCGCGT CGTCTCAAAC GATTTGACTG ACGAAGAGAG CGATTTGATT TGACGTGACA GTTGAAGACA CTGAAAGAGC GCTACGGGTC GAGATGGGCA ACGATCGCGC GTGAAATGGG TGGTCGCACG GATCAGCAAT GTATGGGACG GTGGAGACGA CATCTCGATC CGACAGTGAC TCGCGGCGCG TGGGCTCGGG ACGAAGACGA GCTCTTGTGC GGGTTGTACG ACGAGTACGG TCCGCGATGG TCATTCATCT GTCAGAGCGT TCCGGGTCGC ACCGCGCAAC AATGTCGCGC GCGATGGTTT CAAGTCGACG GTAAGCCCAG GGAAGAGCGC GATCATCGCC CGTCCAGGCG GCAATCGCCC GTCGAGACTT CATCTCATGG ACCCGCGGAG GATCATCCAG CGCGGCGATT TTCCGATCAT GAAGGGTTGG TGACGCGTAC TTCGATCGAT TCGATGCCCA TGGCGAGTCT GTCGCCAACA CCAGTCGCTG AAAAACGACC ATTCAGTCAC ATTCTGGCCG AAGTCAGCGC GCGCACGACG CAAAAGACTG GATCGTTAAT AGAATCCGCC TCCATAGCGA CTGCGCTCGG GCGCGCTTCG CCGGCGACTC TCTCAAAGCG CAAACAAGTA TCGACAGCGC TGGATCCGAT GTCGCTTTGG CGCGACGTCG GCGGTACGAA AGCCGTCGGT AGCTTGGCGC CGACACTCGT GGAAGATGGT AAGCGCAAAC GCGGACGCCC GTCGAAGACG GTGGACGCGC CGCGGTCGCC GATGGCGAGG CTTTCGGTAC AATCTCCTCG CGCATCATCG GTGTATGCAC CGCGAACGCG CGCGCGAGCG TCGACGGCAT CTGCATCATC CAAATCTACG CCTCATTCGT CAGTGATTGG TCGATCGACA GAAGAGGATA AACTCTCCGT TTTGCTCGGC GTCGCGCTCG GTCGAAGCGA CGGCGCGGCG CCGCGTTGAA TCGGCTAGGG AAAAATAATG TTGGTTTGCT GCTCGTGTTG TAACGACAAA CTACGAAGAG AAAAACGTTT TTATGAGCGC TGGATTCGGG AGTCGATAGC TACGCCGCGA CGCACTCGAG GAGCGCTTGA AAGTCCGCCT CCTCGTTCGC GTCGGGGAGG GCGTCAAAGC GCGCGTCGTC GAGCCACGCG AAATCGTCGA TGTCAATCCC ATCGAGGTCA TCTTGACTTT CTGAATGAAT CCCCGATTCT GATTCCGCGC CGCGGGCGAG GTCGCCGTCC AGGCTCCCGA GCGCGCCCTC GACGTCGTCC GAGTTCGCGC TCGGCGAGTT CGCCCGTCGT CGTCGTTTGC GGTTGGCATA TTCATCGCGC TTTTCTCGCC GGCGCGTGGC TCGTCCCCGC GCGTCGCTCC AAACGGGCCC TCTCCGACGC GTCGACGAGA CGAATCCCTT CGTCTCCTCT TCGCGTCTTC GATAGCCCTC GCGACGCGCG CGAGTTCCGA GCGCGTAGTC CGGCTTCGCG ACCGCGTATA ATCCGCCCCC GACGACGACG GCGTCCGCGC CGGCGTCCGC TCCCCATCGC TCGCGCGCGA CGACGCCACC CTTCCGTCCC ATCCCGCGCG CCGCTCGCGC GTCAGCTCCG ACAGCGCGTC CTTCAGCACC GCGTTCTCCG TCGCGAGCGC GCGCGTCAAC CGCTCGAGCG CGTCCACGCG TCGCCGCGCG TCCCGCTCGA GCGCCCGCTT CCGCGCCCGC GACGCCGCGC TCGCCGCGCG ATTCGCCTCG AGTCGACGCG CGCGTCGCGT CGCGTCGTCC GCGTCGCGCG TCGCCATCGC GCGCGCG
|
Protein sequence | MGTRSGQQCA QRWRHKVNPG IRRERWSEEE DEKLKTLKER YGSRWATIAR EMGGRTDQQC MGRWRRHLDP TVTRGAWARD EDELLCGLYD EYGPRWSFIC QSVPGRTAQQ CRARWFQVDG KPREERDHRP SRRQSPVETS SHGPAEDHPA RRFSDHEGLV TRTSIDSMPM ASLSPTPVAE KRPFSHILAE VSARTTQKTG SLIESASIAT ALGRASPATL SKRKQVSTAL DPMSLWRDVG GTKAVGKRKR GRPSKTVDAP RSPMARLSVQ SPRASSVYAP RTRARASTAS ASSKSTPHSS VIGRSTEEDK LSVLLGVALG RSDGAAPR
|
| |