Gene OSTLU_41266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41266 
Symbol 
ID5002224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp283110 
End bp284270 
Gene Length1161 bp 
Protein Length386 aa 
Translation table 
GC content60% 
IMG OID640417645 
Productpredicted protein 
Protein accessionXP_001418431 
Protein GI145347969 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0574303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0646612 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGACGCGTTC GAGGCGTCAT GTCGGACGAA TCCGAGGCCA GGGAATTCCC GGTGGTGTGC 
GAGGACTGCC TCGGCCCCAA CCCGTACGTG CGCGTGCAAA AGATGCCGCT CGGCGGCGAG
TGCGCGATCA GCGGTCGCCC CTTCACCGTG TTTCGCTGGC GACCCGGGAA CGAGGCGAGG
TACAAGAAGA CGGTGGTGTG CAAAGAGATC GCGCAGGCGA AGAACGTGTG CCAGGTGTGC
CTGCTGGATT TAGATTACGG GATACCCGTC GCCGCGCGGG ACGCCGCGCT GGGACGCGCG
GGAGGGAGCG CGCTGCCGTC GAGCTCGGTG AACCGGGATT TCGCGGTGAA TGAGATCGCG
AAAAAGCTGG ACGAGGGCGA GGACGCGTAC GAGAAGGATG GGAAGGAGAA AAATAACGAA
CTGTTGATGC GGTTGGCGAG GAAGAAGCCG TATTATAACA AGAATAAAAC GCCGATATGC
ACGTTTTGGT TGAGAAACGC GTGCACGAGG AACGATTGTC CGTATCGACC TTGCAACGGG
GATACGCACA TGCCGGAACT GAGCGCGGCG CCAGAGTTGA GAAAGCAAAA TATTAAGGAT
AGATACTTCG GGACGAACGA TCCGGTGGCG GAACAAATGC TCAAACGCGC GAAAGAGCGA
CCGAGCCAAA AGTTGACGCC ACCCGAAGAT GCGAGCATCA CCACGTTGTT TGTAGGCGGC
GTCGACCCGG AAAAGGTCAC CGAGGACGAC ATCAACTCGC GCTTCTATCA GTACGGCGAA
ATCAAGGGCA TTCGCGTGAT TGGGACGAAG AAATGTGCGT TCATCACTTT CGCCACGCGC
GAAGGTGCGG AGAAGGCGGC GGAAGATGCG GCGATAAATC TCGAAATCAA CGGAGAGCGA
TGCCGACTCC AGTGGGGCAA GTCGGCGGCG AAAAAAGCGA GCGGCAACCA AGGGTCTGCG
CCGGCACCGC CACCAACCGT GATGATGATG GCTCCAGGTG TGGAAGCTCC AGCGAATGGG
CAGGCTTTAC CGCCAGATAT GCCGGCGCAT GTGGCGATCC CCATGCCTGC GCCGGCCGCG
GTGGGGCACG CGACCAAGTA CCCGTCGATG GATCCTTCGC AGATGGGAGC GGTTTCAAAG
AAGCAGGAAG CCGAGAAGTA G
 
Protein sequence
RRVRGVMSDE SEAREFPVVC EDCLGPNPYV RVQKMPLGGE CAISGRPFTV FRWRPGNEAR 
YKKTVVCKEI AQAKNVCQVC LLDLDYGIPV AARDAALGRA GGSALPSSSV NRDFAVNEIA
KKLDEGEDAY EKDGKEKNNE LLMRLARKKP YYNKNKTPIC TFWLRNACTR NDCPYRPCNG
DTHMPELSAA PELRKQNIKD RYFGTNDPVA EQMLKRAKER PSQKLTPPED ASITTLFVGG
VDPEKVTEDD INSRFYQYGE IKGIRVIGTK KCAFITFATR EGAEKAAEDA AINLEINGER
CRLQWGKSAA KKASGNQGSA PAPPPTVMMM APGVEAPANG QALPPDMPAH VAIPMPAPAA
VGHATKYPSM DPSQMGAVSK KQEAEK