Gene OSTLU_34381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34381 
Symbol 
ID5001112 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp392208 
End bp393338 
Gene Length1131 bp 
Protein Length319 aa 
Translation table 
GC content56% 
IMG OID640416533 
Productpredicted protein 
Protein accessionXP_001416933 
Protein GI145344842 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.693095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.168449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTCTG ATGGTCAAGG TTACGCGGCG ACGACGCCGT CGTCGTCGAG GAAACACGCG 
AATTTGTACG TGAAGAACAT CTCCGAACGC GTCGACGAGT TGACGCTTCG AAGGCTCTTC
GAGGCGTGCG GGGAGGTGCA ATCGTGCTGC GTCATTCGCG ATGTGTCGAC GAATAAGAGT
CGAGGATTTG GGTTCGTTAA GTTTGTCAGC ACGGCGCGCG CTGAGGACGC GATCGAGCGA
TTCAACGGTA AGGAATACGC GGGAAAGATG CTCGAGGTGA AATTCGCGAA CACCGACGGC
GAGAGCGACG GCGCGGGGGG CGCGGCAAAC GCGCCGCCGA GTGATAACGT CTACGTCAAA
GGTCTGCCCC CTTCTTGGAC GCACGATGAT CTGAAGGCAT TTTTTACGCA TTTTGGGCAC
ATTGTTGAGT GCCGTTTGCT CCACGCAAAC AGGAGCACGT CGAGCGGGGC GTTGATTCGA
TTTTTGCGCG AGTCCGAAGC CACGGCGGCG GTCACGCGCG CCAACGGGCG CTTGCTCGTC
CCCAATGGGC CGCCACTCGT CGTGCGCTAC GCCGAGGCGC AAGGAAAAAA TAATAAACGT
TCAACGCAAG TTGTACCCGT GAATACTCAG CGCCTGTCGA ACAACACGGA CGACTCCGCG
CACGACGGGG GGGAGTTGAT CGACGTCCTC GGCTCGAGTA TGAATCTCAA CCGCATATCT
TCCCAGAGCG GTCTCACTGA ATTACTCAGT GTTGGCCCGC AAGACGGAGA CGACTTTGCC
GCGCTTGCGG TGCTCGGTTC GTCCCCTTCG CATAAATTCG ATGCTCCTAT GCAATCCCAT
CAAAAGTTCG CGTCGGCGAC TTCGATGGCG CAAGGTGGCG CTACGATGTG TATCCAAAAT
CTTCCACCCG CTGCGGATGA ATTATTCCTG TACAAAACAT TTGCTCCGTT CGGAGCTATC
AACTCCGTCC AAATTGTCCG CGACGATTGG ACTGGTCTTT GTTCTGGCGT CGCTGTGATA
AACTTTCGAA GTTACTCCGA CGCTTGCGAC GCTCAAAGAG CGTCTCAAAA CGGAAAGAGC
AGGCTGAGCA TTTCTGTTCA GCTTCAGACG GCGAATCTGG CGAACTTTTA A
 
Protein sequence
MYSDGQGYAA TTPSSSRKHA NLYVKNISER VDELTLRRLF EACGEVQSCC VIRDVSTNKS 
RGFGFVKFVS TARAEDAIER FNGKEYAGKM LEVKFANTDG ESDGAGGAAN APPSDNVYVK
GLPPSWTHDD LKAFFTHFGH IVECRLLHAN RSTSSGALIR FLRESEATAA VTRANGRLLV
PNGPPLVVRY AEAQGKNNKH DFAALAVLGS SPSHKFDAPM QSHQKFASAT SMAQGGATMC
IQNLPPAADE LFLYKTFAPF GAINSVQIVR DDWTGLCSGV AVINFRSYSD ACDAQRASQN
GKSRLSISVQ LQTANLANF