Gene OSTLU_47583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_47583 
Symbol 
ID5005510 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp237238 
End bp239284 
Gene Length2047 bp 
Protein Length328 aa 
Translation table 
GC content63% 
IMG OID640420931 
Productpredicted protein 
Protein accessionXP_001421243 
Protein GI145353913 
COG category[A] RNA processing and modification
[D] Cell cycle control, cell division, chromosome partitioning
[K] Transcription 
COG ID[COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.164071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TGGAGCGAGA TCGCGCGCGC GATGGGCACT CGGAGTGGAC AACAGTGCGC ACAGCGGTGG 
CGACACAAGG TCAACCCAGG GATCCGACGG GAGCGGTGGA GCGAGGAGGA GGATGAAAAG
GTGCGAATGC GGTCGATAAC GTGAGCAAAT GATATTGGTT ACATGCGTTG TGTGCGCAGA
ACACAATTTT GGTCGCGTGG CGGGAGAGAT ATCGTCGAGT GGTCCCGCGC GGTCCCGCGT
CGTCTCAAAC GATTTGACTG ACGAAGAGAG CGATTTGATT TGACGTGACA GTTGAAGACA
CTGAAAGAGC GCTACGGGTC GAGATGGGCA ACGATCGCGC GTGAAATGGG TGGTCGCACG
GATCAGCAAT GTATGGGACG GTGGAGACGA CATCTCGATC CGACAGTGAC TCGCGGCGCG
TGGGCTCGGG ACGAAGACGA GCTCTTGTGC GGGTTGTACG ACGAGTACGG TCCGCGATGG
TCATTCATCT GTCAGAGCGT TCCGGGTCGC ACCGCGCAAC AATGTCGCGC GCGATGGTTT
CAAGTCGACG GTAAGCCCAG GGAAGAGCGC GATCATCGCC CGTCCAGGCG GCAATCGCCC
GTCGAGACTT CATCTCATGG ACCCGCGGAG GATCATCCAG CGCGGCGATT TTCCGATCAT
GAAGGGTTGG TGACGCGTAC TTCGATCGAT TCGATGCCCA TGGCGAGTCT GTCGCCAACA
CCAGTCGCTG AAAAACGACC ATTCAGTCAC ATTCTGGCCG AAGTCAGCGC GCGCACGACG
CAAAAGACTG GATCGTTAAT AGAATCCGCC TCCATAGCGA CTGCGCTCGG GCGCGCTTCG
CCGGCGACTC TCTCAAAGCG CAAACAAGTA TCGACAGCGC TGGATCCGAT GTCGCTTTGG
CGCGACGTCG GCGGTACGAA AGCCGTCGGT AGCTTGGCGC CGACACTCGT GGAAGATGGT
AAGCGCAAAC GCGGACGCCC GTCGAAGACG GTGGACGCGC CGCGGTCGCC GATGGCGAGG
CTTTCGGTAC AATCTCCTCG CGCATCATCG GTGTATGCAC CGCGAACGCG CGCGCGAGCG
TCGACGGCAT CTGCATCATC CAAATCTACG CCTCATTCGT CAGTGATTGG TCGATCGACA
GAAGAGGATA AACTCTCCGT TTTGCTCGGC GTCGCGCTCG GTCGAAGCGA CGGCGCGGCG
CCGCGTTGAA TCGGCTAGGG AAAAATAATG TTGGTTTGCT GCTCGTGTTG TAACGACAAA
CTACGAAGAG AAAAACGTTT TTATGAGCGC TGGATTCGGG AGTCGATAGC TACGCCGCGA
CGCACTCGAG GAGCGCTTGA AAGTCCGCCT CCTCGTTCGC GTCGGGGAGG GCGTCAAAGC
GCGCGTCGTC GAGCCACGCG AAATCGTCGA TGTCAATCCC ATCGAGGTCA TCTTGACTTT
CTGAATGAAT CCCCGATTCT GATTCCGCGC CGCGGGCGAG GTCGCCGTCC AGGCTCCCGA
GCGCGCCCTC GACGTCGTCC GAGTTCGCGC TCGGCGAGTT CGCCCGTCGT CGTCGTTTGC
GGTTGGCATA TTCATCGCGC TTTTCTCGCC GGCGCGTGGC TCGTCCCCGC GCGTCGCTCC
AAACGGGCCC TCTCCGACGC GTCGACGAGA CGAATCCCTT CGTCTCCTCT TCGCGTCTTC
GATAGCCCTC GCGACGCGCG CGAGTTCCGA GCGCGTAGTC CGGCTTCGCG ACCGCGTATA
ATCCGCCCCC GACGACGACG GCGTCCGCGC CGGCGTCCGC TCCCCATCGC TCGCGCGCGA
CGACGCCACC CTTCCGTCCC ATCCCGCGCG CCGCTCGCGC GTCAGCTCCG ACAGCGCGTC
CTTCAGCACC GCGTTCTCCG TCGCGAGCGC GCGCGTCAAC CGCTCGAGCG CGTCCACGCG
TCGCCGCGCG TCCCGCTCGA GCGCCCGCTT CCGCGCCCGC GACGCCGCGC TCGCCGCGCG
ATTCGCCTCG AGTCGACGCG CGCGTCGCGT CGCGTCGTCC GCGTCGCGCG TCGCCATCGC
GCGCGCG
 
Protein sequence
MGTRSGQQCA QRWRHKVNPG IRRERWSEEE DEKLKTLKER YGSRWATIAR EMGGRTDQQC 
MGRWRRHLDP TVTRGAWARD EDELLCGLYD EYGPRWSFIC QSVPGRTAQQ CRARWFQVDG
KPREERDHRP SRRQSPVETS SHGPAEDHPA RRFSDHEGLV TRTSIDSMPM ASLSPTPVAE
KRPFSHILAE VSARTTQKTG SLIESASIAT ALGRASPATL SKRKQVSTAL DPMSLWRDVG
GTKAVGKRKR GRPSKTVDAP RSPMARLSVQ SPRASSVYAP RTRARASTAS ASSKSTPHSS
VIGRSTEEDK LSVLLGVALG RSDGAAPR