Gene OSTLU_38809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38809 
Symbol 
ID5001797 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp695771 
End bp696954 
Gene Length1184 bp 
Protein Length319 aa 
Translation table 
GC content62% 
IMG OID640417218 
Productpredicted protein 
Protein accessionXP_001418087 
Protein GI145347251 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.000594517 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCGA TCGCAGGCGG GCGAATCACC GCGGGCGCGG GCGTGAACTT GCTCGGGCAG 
CACGGCGCGG ATAGGAACGC CGAGGCGACG GTGTACGTCG GGAATCTCGA CCCGCAAGTC
ACGGAGGAGG TGCTGTGGGA GCTGTTCCTG CAGGCGGGAC CGGTGACGAA CGTGTACGTG
CCGAAGGATA GGGTGACGAG CACGCACCAG GGGTACGGGT TCGTGGAGTT CAGAAACGAG
GAAGACGCGG AATACGTGCG TCGAGCGACG CGACGCGACG CGACGCGACG CGATGGGATG
CGAGATCGGT GACGATGATG ATAAAAATAT CCGCGCTTCT GATGAGTTCA CGCTCGTGAG
GTTAGACGAT ATTATGGTTG AATTTCTTCG GCTGAGTCAC CGCGCGATGG GAGACGCGAC
GCGATCGAAG CTCGACGTGG AGACTGACGA GACGGTGCGA TGGCGATGTG CGCGTGTAGG
GTATTAAAAT TTTGAACATG GTGAAACTGT TCGGGAAACC GATCAAGGTG AACAAGTCGG
TGGGCGACAG ACGCGACGAA GTCGGCGCCA ACTTGTTCAT CGGTAACCTC GATCCAGACA
TCGACGAAAA GCTGTTGTAC GACACCTTCA GCGCGTTCGG GGTGGTGATT AACACTCCTA
AAATCATGCG CGACCCGGAT AACGGGGCGA GCAAGGGATT CGGATTCGTG GCGTACGATT
CGTTTGAGGC CTCGGACGCC GCCATCGAGG CGATGAACGG ACAGTTCTTG TGCAACAAGC
AAATCAACGT TCAGTACGCG TACAAGAAGG ATAGCAAAGG CGAGCGTCAT GGGTCACAAG
CCGAGCGTTT GCTCGCGCAA TCGATCGAAC GACCGACGAT GGTTCGCCCG CACACGTTGT
TCAGCGCCGG CCCCTCGAGC ACGCCCGCGG CGATGGGCGG CATGATGGGT TTGCCGCCGC
CTCCGCCCGG AATGGTCGGC ATGCCGCCGC CGCCGCCGGG AATGATGGGT GGAATACCAC
CACCACCCAT GATGGGCGGT TTCGCCCCCG TCGGTGGGTT CGCGATGCCG CCGCCCGGCG
TCGCGCCGCC GCCCGGGACG ATGAACGCGC CGCCGCCGCC CGGGATGATG AACGTGCCAC
CGCCGCCCGG GATGATGAAC GTCCCGCCGC CGCCGCCGCA ATAA
 
Protein sequence
MAPIAGGRIT AGAGVNLLGQ HGADRNAEAT VYVGNLDPQV TEEVLWELFL QAGPVTNVYV 
PKDRVTSTHQ GYGFVEFRNE EDAEYGIKIL NMVKLFGKPI KVNKSVGDRR DEVGANLFIG
NLDPDIDEKL LYDTFSAFGV VINTPKIMRD PDNGASKGFG FVAYDSFEAS DAAIEAMNGQ
FLCNKQINVQ YAYKKDSKGE RHGSQAERLL AQSIERPTMV RPHTLFSAGP SSTPAAMGGM
MGLPPPPPGM VGMPPPPPGM MGGIPPPPMM GGFAPVGGFA MPPPGVAPPP GTMNAPPPPG
MMNVPPPPGM MNVPPPPPQ