Gene OSTLU_32864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32864 
Symbol 
ID5002735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp769779 
End bp771026 
Gene Length1248 bp 
Protein Length415 aa 
Translation table 
GC content56% 
IMG OID640418156 
Productpredicted protein 
Protein accessionXP_001418799 
Protein GI145348735 
COG category[R] General function prediction only 
COG ID[COG1094] Predicted RNA-binding protein (contains KH domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCACC TCGTCGCCGC GTCCAACGCG CGTCCGTCGA CGCGAAATGT TTCGGCGGCG 
GCGGCGGACG AGCCGGACGC CGACGCCGAC GCGGCGGCGA CGACGTCTTC GGGCGCGAAA
AAGGGCCGCT ATCGAAGAGA TAAACCCTGG GATCACGATG GCATCGATCA CTGGAGCGTG
ACGCCGTTCA CGGCGGAGGA TAACCCGAAC GGCGTGCTCG AGGAGAGCTC GTTCGCGGTG
CTGTTTCCAA AGTACCGAGA AAAATATTTA CGCGAGACGT GGCCGAGCGT GACGAAAGCG
CTGAAGGAGC AAGGGGTGAG TTGTGAATTA AACCTCGTCG AGGGTTCGAT GACGGTGCGC
ACGACGCGAA AGACTTTCGA TCCGTACATC ATCATGAAGG CGAGGGACTT GATAAAGCTG
CTCAGTCGGT CGGTGCCGGC GCCGCAGGCG TTAAAGGTGC TCGAAGACGA GACGAATTGC
GACGTGATTA AGATTGGTGG GATGGTGAGG AACAAAGAAC GATTCGTGAA GAGGCGGCAG
CGATTGATTG GGCCCAACGG CTCGACGCTC AAGGCGATCG AGATGCTCAC AGGGTGCTAC
GTGCTGGTTC AAGGGAACAC GGTGAGCGTC ATGGGCGGAT GGAAAGGTTT GAAGATGGTT
CGCAAAATCG TCGAGGACGC GATGAAAAAC ACGCACCCGA TTTATCACAT TAAAGAACTC
ATGATCAAAC GGGAACTGGA AAAAGATCCC GAGCTCGCCA CGCAAAGCTG GGACCGATTC
TTGCCGAAAT TCAAGAAGAA GAATGTCCAA CGCAAGAAGC CCGCCAAAAT CGGCAAGAAG
GAACGCGCGG TTTTCCCGCC GACCCAACCG ATGAGCAAGA TAGACAAACA AATCGAATCC
GGGGAGTACT TTTTGTCCAA AGAAGCCAAG GAACGCAAGG CAGCGTACGA CAAGTTGCAA
AAGCAAAAAG ACACGTCGAC GGACAACCAC AAGAAGCGAC AAGCCGCCTT CGTCGCGCCG
AAGGAGGACG ACAAACCGGC TCGCTCAAAG TCGTCAAAGG CGAAGGAAGA GGACGTCGAC
GCCATCACGG CATCACTCAA GGCGAAGGCG AAGGCGAAAA AGGAGGAAGA CAAGCGCTCG
AAGGCGTCAG CTTCATCCTT CGTCATGGGT GGTGAAGCGA AATCGTCCAA GCGCGATCGA
GAGGACAAGA CGGAGAAGAA GTCAAAGAAG TCGAAGAAGG ACAAGTGA
 
Protein sequence
MAHLVAASNA RPSTRNVSAA AADEPDADAD AAATTSSGAK KGRYRRDKPW DHDGIDHWSV 
TPFTAEDNPN GVLEESSFAV LFPKYREKYL RETWPSVTKA LKEQGVSCEL NLVEGSMTVR
TTRKTFDPYI IMKARDLIKL LSRSVPAPQA LKVLEDETNC DVIKIGGMVR NKERFVKRRQ
RLIGPNGSTL KAIEMLTGCY VLVQGNTVSV MGGWKGLKMV RKIVEDAMKN THPIYHIKEL
MIKRELEKDP ELATQSWDRF LPKFKKKNVQ RKKPAKIGKK ERAVFPPTQP MSKIDKQIES
GEYFLSKEAK ERKAAYDKLQ KQKDTSTDNH KKRQAAFVAP KEDDKPARSK SSKAKEEDVD
AITASLKAKA KAKKEEDKRS KASASSFVMG GEAKSSKRDR EDKTEKKSKK SKKDK