Gene OSTLU_31221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31221 
Symbol 
ID5001336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp513616 
End bp516029 
Gene Length2414 bp 
Protein Length727 aa 
Translation table 
GC content58% 
IMG OID640416757 
Productpredicted protein 
Protein accessionXP_001417267 
Protein GI145345544 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.005519 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCGGATGGA TGTCGACGAC GCGCGCGACG GCGGCGACGA GGACCGTTCC GACGACGACG 
ATGACGCGCG CGAGGACGCG ATCGACGCGA GCGAAGGCTC GAGCGAGGAC GACGAGGACG
AGGACGACGA CGAGGACGAC GACGAGGACG ACGATGATGT AGAATTGGAT GATGCGCAAT
TGGACACGAT AATGACGCTC GAACAAGAGC TCGAGAGCGA CTCCTGGCGC GATTACGCGA
AAGCGGGGAG ATTGATTGAT TTGTTGCGCG CGGCGAACCT GCGCGAGCGG TGCAGAACGG
CGAGGGAACG ATATAACGAA GCGTTCGCGA TGGACGAGGC GCGCTGGTCG GCGTGGATTC
AAGACGAATT GGCGAATAAG ACGACGGGGA CGAAACGCGA ACGGCACACG CGATGCGATG
AGTTGTTCGC GAGGGCGATC GAAGAGTGTG GACGGACGAG CGTGCGGTTG CACATGGGAA
GGTCGAGGAA CGCGATGGAG CTGGAGGCGG ATGAGAGCGA CCGACGCGCG CTTTACGAGA
CGGCGACGGC GGGACCGGGG ATGAACTTTA ACGATGGGCA TTTGATTTGG CAGGCGTATC
GAGCGTTTGA ACTGTCGTGG GCGGCGTCGG AAACGCAGAA AGTGCGGGTG AAGGCGCTGT
ATTTGCGACA GTTGAAGATT CCACAAGCGC AGTCTCAGGC GACGCTCGAG GCGGCAAAGA
CGTGGGCCGC CGACGCGGGT CTTCATAACG CAGACGCCGC GTTCCACGAA GCGTACGCGG
TTGGTAATGC TGCAAAAGTA CTGCGCGAGC CTTACGAGGC GCGTCTGCTC GCCGTGACGA
GTTCGCATGA AGGCGACGCG AAGCTGCTTC GAGCGTACGC AACCTACATT GATTTTGAAA
TGGCGAGCGG ATCTCCGGAC AGAGTCGTTC ACTTGTACGA ACGCGCGCTT TCGTCGCTGC
CGTATGTCGC AGAGCTCTGG CGAGACTACG TGTTGTACGT GTGGTCGATA TCGTTCAAAT
CAGCAGAAGC AGCTTCGCGC ACTTTGATGC TTCGCGCCGT TCGCATGTGC CCGTCTAGCG
TTCTGTGGAA GTCGGTGCTC GAGCTGGAGT CGTCGTACGA CTTGTACACG CATGCGCTGA
GAACGAAGTT CAGAGATCCG AACGACTACG GTGCGGTTCT TACAAAAGTT CTCACACAGT
GCGTACGTTT GGACGATTGG GCAAAAGCGT CCGGGTGCGT GACTTTTGGT TTCGAGCAGA
TGGCCAAGGA TTATTCGCCC AACATCATGG CGGCGGCGGC GATCCATGTC TTGAAAGAAC
TAGTAGACTA CGCGTTGATG AAAGATCACA AGACGACGGT CGCGTCATTC ATAGACGCTG
TTTTTGCGCA ACTCGCGGAG CGGGCACCGT TCAAGACCGC CGCTGAATTT GTCATCCTTC
GAACTGATAC GTCGCGCCTT ATAAAGAAGA CGCAAAAGGA AGTACTAGAC CTGTACGATG
TAGCGTTACA GCGAGGAACC GTGGAACCCG TCGGCCTGTC GAAGAAGACG GTAGATGAAG
AGACCCGTCG ATTGGCTGGA ACGGCTATGC TACTGAAAGC AAAGGCGCGA TATCTGTCGA
CATTTGCACC AAAAAAGTAC GACGATTTCA ACGAAAAGAC CAACGCTATG GCCGCTGTGC
GGCGGTACGA GTTAAAACTC GCGTGTGAGC GATTGGCTGC GAACGCGGCG GCGAAAGAGC
GCATGAAAAG CGGTGGAAGC CGCCGCGAAC GCGCCGCCGC GCGTCGCGCC GGAGCCGAAC
CCATTCCACG CAAACGAACT CGCGAAATTC CAGAGGGAGG CGACAAGACT GATATGAATC
TTGCCGGAAT GGATCATGAC GCTCGTGTCA AGACGCTTTT CCCAACAAGA GACACGCAAA
CAGCCTTTGT CAAAAATTTG TCCTGGGACG TTACGGACGC CGAGCTCATG GAGTTCTTCA
CCGGCGCGGT GAGCTGTCGA ATCGTCAAGG ACAAAGCCAC TGGTCGTTCG CGAGGAATCG
CGTACGTTGA CTTTGGAGAA GAAGCTGCTC TGAACGCTGC AATCATGCGA TCCGGTGAGG
CGCTCAAGGG AAGACTGGTT GATATCGCGA AGAGTCGGCC TCCCGGTGAC GACGGACCCG
ATGGTCGCGG TGGACGTGGC GGTGGACGTG GCAGTCGTGG CGGTGGTCGT GGTGGCGGGC
GCGCCGCACC GTCTGTGGCT TCTGGACGTG GTCGCGGAGG CTTGGGCCTT ATGCCTCGCG
CGATAACGGT GACGCGTACG GACAACGGCG AAGGCGCACA AGCAAAAACT AACGCAGACT
TCAGGGCAAT GTTTGTGAAG GGATCATCGC AGTAGAGACG CTCGTAGTAG TGATGAAAAA
AACGTTGTTG TATA
 
Protein sequence
MTLEQELESD SWRDYAKAGR LIDLLRAANL RERCRTARER YNEAFAMDEA RWSAWIQDEL 
ANKTTGTKRE RHTRCDELFA RAIEECGRTS VRLHMGRSRN AMELEADESD RRALYETATA
GPGMNFNDGH LIWQAYRAFE LSWAASETQK VRVKALYLRQ LKIPQAQSQA TLEAAKTWAA
DAGLHNADAA FHEAYAVGNA AKVLREPYEA RLLAVTSSHE GDAKLLRAYA TYIDFEMASG
SPDRVVHLYE RALSSLPYVA ELWRDYVLYV WSISFKSAEA ASRTLMLRAV RMCPSSVLWK
SVLELESSYD LYTHALRTKF RDPNDYGAVL TKVLTQCVRL DDWAKASGCV TFGFEQMAKD
YSPNIMAAAA IHVLKELVDY ALMKDHKTTV ASFIDAVFAQ LAERAPFKTA AEFVILRTDT
SRLIKKTQKE VLDLYDVALQ RGTVEPVGLS KKTVDEETRR LAGTAMLLKA KARYLSTFAP
KKYDDFNEKT NAMAAVRRYE LKLACERLAA NAAAKERMKS GGSRRERAAA RRAGAEPIPR
KRTREIPEGG DKTDMNLAGM DHDARVKTLF PTRDTQTAFV KNLSWDVTDA ELMEFFTGAV
SCRIVKDKAT GRSRGIAYVD FGEEAALNAA IMRSGEALKG RLVDIAKSRP PGDDGPDGRG
GRGGGRGSRG GGRGGGRAAP SVASGRGRGG LGLMPRAITV TRTDNGEGAQ AKTNADFRAM
FVKGSSQ