Gene OSTLU_50098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50098 
Symbol 
ID5002784 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp634425 
End bp636786 
Gene Length2362 bp 
Protein Length572 aa 
Translation table 
GC content59% 
IMG OID640418205 
Productpredicted protein 
Protein accessionXP_001418998 
Protein GI145349140 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID[TIGR01628] polyadenylate binding protein, human types 1, 2, 3, 4 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.442075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.11137 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCCG CGACGGCGAC GAACGTGACG AACGATGCGA AAGCGTCGAA CGCGACGGCG 
AACGCGGATG GGACGACGCC CGCGGCGCAA CAACCGGGGG CGGGGACGAG CTCGCTGTAC
GTCGGGGACC TGGAGACCTC GGTGACCGAG GCGCAGCTGT ACGAAAAGTT CTCATCCATC
GGACCGGTGG TGTCGATTCG GGTGTGCCGG GATTTGATCA CGCGACGATC GCTCGGGTAC
GCGTACGTGA ACTTTCAATC GCCGAACGAT GCGGCGCACG CGATTGATGT GTTGAACTTT
CAAGTCATCA ACGGGAAGCC GATTCGCGTG TTGTACTCGC AGCGAGATCC GGCGGTGCGA
AGATCTGGCG TGGGGAACAT CTTCATCAAG AATCTCGATA AGGCGATCGA CAACAAGGCG
TTGCTCGATA CCTTTGCGCA GTTCGGGACG ATCACGAGCG CCAAGGTGGC GATGGATGGC
CAAGGGAACT CGAAGGGCTA CGGCTTCGTG CAATTCGAGA CCCAAGAGGC GGCGCAGGCG
GCGATTGACA ACGTCAACGG TATGGAACTG AACGACAAGC AAGTGTACGT CGGTCCGTTC
CAACGCCGTG CCGAGCGCTC GAACACCGGC GAAGCCAAGT TTAACAACGT CTATGTTAAG
AACTTGAGCG AAAACTTGAG CGACGAAAAG TTGCGTGAAA AGTTTGCCGA GCACGGCGCG
GTGACGAGCT GCGTCATCAT GCGCGACGAA GAAGGCAAGT CTAAGGGCTT CGGCTTCGTG
TGCTACGAAG AACCTGAAGG CGCGGCGGCC GCGGTCGAAA AGCTCGACGG GTACACCGAG
GATGAAAAGA CTTGGGTTGT CTGCCGAGCG CAAAAGAAGG CTGAGCGCGA AGCCGAATTG
AAGGCCAAGT TCGACCAAGA ACGCCGTGAA CGCATGGAAA AGATGGCGGG CGCCAACCTC
TACATCAAAA ATCTTGAGGA CGGCACCGAC GATGAAAAGC TTCGCGAATT GTTCAAGGAG
TTTGGCACCA TCACCTCTTG CCGCGTCATG CGTGACGCCT CGGGCGTTTC TCGCGGTTCC
GCGTTCGTCG CCTTCTCTTC TCCCGACGAA GCCACCCGCG CGGTGACGGA GATGAACGGT
AAGATGGTCG GTGCCAAGCC GCTCTATGTC GCTCTCGCGC AACGCAAGGA AGAACGCCGC
ATGCGTCTCC AAGCGCAATT CGCGCAGCGC ATGCCGGGCG CCGGTATGCC GGGTGGCATG
GCTCCGTACA TGCCGCCGCC GGGCGTGCCA GGTGCTCCTA TGTACTACGG CCAACCGCCT
CCGGGTATGA TGCCGCCGCA ACCGCAACCA GGCTTCGGTT TCCAACCGGT GATGCCGGGC
GGACCGCGAC CGGGCATGCC GGGCATGCCG GGTTACGGCA TGCCGATGCC GCAACGCCAA
GGCGTTCCGG GCGCCCAGCG TGGCCGTGGT GGCCGAGGTG GCCCGGCCGG TGGCCGTGGC
CAGCGTCAAA ACATGCGATA CAACGCCGCC GCGGCTATGC CGATGCCGCC GATGCCGGCG
GAGGCGGCTA ACCCGATGGC TATTCTCGCT TCTCAGCTCT CCGCCGCCGC TCCGGATCAA
CAGCGCATGA TTCTTGGTGA AGCCCTCTAC CCGCTCATTG AGAGCAAGGA CGCCGCCAAC
GCCGCAAAGA TCACGGGTAT GTTGCTCGAG ATGGACCAGT CGGAGGTCCT TCACTTGATC
GAGTCTCCGG ACGCGCTCAC CTCCAAGGTT CAAGAAGCCC TCGCTGTGCT GAAGGCTGCC
GCCGAGGAAG GTGCGTAATC GCTCCGTCGA TGAAGTATGC ACTTCAAACT CAGTGAGTGA
GGATACGTTT GGTATGAGAA TTCTTTGGCG AGTCTTCTCG TGCGCGCGCA CGAGAAGAGG
CACAACATTT TTAGCGAGCT GCGGGCCGCG CGAGCGCGAG CGAGCTGTCG CGATATTTCA
TCGCGACGTC TCGTCGACGC TTTGACGCGG CCCGCGGCAA CGATACGATA GCACCAATAG
GCGGGCGCCA TTGGGGGGTG CGAGACGGTA TTTCTTTTGT CTATGCGCTC AACCGCGACC
ATGGTGTCCA CAGCCGCCGC GAGAATCACG AATGCTCTCG CCGCGAGCCG ACTGCGGACC
AGAAACTAGC GTGATGGACT CCATCACGTG CATGAAAAAA ACGAAAAAAA AAGGGCGACG
CTGTTGCGCT TTTGAGCCAA GAACAGGTCG CAGTATCAAG ACTTTAAGAC GCATACGTTG
CTGTCTGCAC TTTTGACTCT CGAAATCCAC CAATACGAGC AGCAAATTTT CAAAATATCG
ATACAAAAAA TACAACTCTC TA
 
Protein sequence
MSAATATNVT NDAKASNATA NADGTTPAAQ QPGAGTSSLY VGDLETSVTE AQLYEKFSSI 
GPVVSIRVCR DLITRRSLGY AYVNFQSPND AAHAIDVLNF QVINGKPIRV LYSQRDPAVR
RSGVGNIFIK NLDKAIDNKA LLDTFAQFGT ITSAKVAMDG QGNSKGYGFV QFETQEAAQA
AIDNVNGMEL NDKQVYVGPF QRRAERSNTG EAKFNNVYVK NLSENLSDEK LREKFAEHGA
VTSCVIMRDE EGKSKGFGFV CYEEPEGAAA AVEKLDGYTE DEKTWVVCRA QKKAEREAEL
KAKFDQERRE RMEKMAGANL YIKNLEDGTD DEKLRELFKE FGTITSCRVM RDASGVSRGS
AFVAFSSPDE ATRAVTEMNG KMVGAKPLYV ALAQRKEERR MRLQAQFAQR MPGAGMPGGM
APYMPPPGVP GAPMYYGQPP PAGHAGQGVP GAQRGRGGRG GPAGGRGQRQ NMRYNAAAAM
PMPPMPAEAA NPMAILASQL SAAAPDQQRM ILGEALYPLI ESKDAANAAK ITGMLLEMDQ
SEVLHLIESP DALTSKVQEA LAVLKAAAEE GA