Gene OSTLU_42368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42368 
Symbol 
ID5003289 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp302538 
End bp304439 
Gene Length1902 bp 
Protein Length563 aa 
Translation table 
GC content57% 
IMG OID640418710 
Productpredicted protein 
Protein accessionXP_001419339 
Protein GI145349849 
COG category[A] RNA processing and modification 
COG ID[COG5107] Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0945405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.143131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTGA AGGCTGTCGA TCTCAAGAAG CGCTGCGCCG ACGCTGGGCT CGACGATAGC 
GGCACGAAAG CCGATCTCGT GGGAAGATTA CTCAATCAAG CGTCAAACGA AGATTCTTCG
CCCCCGGACG CCGGCGATGC GCCCGAACGC GTCGTCGCGA CGGACGCCGA CGACGCAAAG
TTAGAAGATG CGGACGTGGA TGCGGTCGAA ACCGACGGGA GCGGCGATCA CGATGATTTG
CTGACGCATT TAAGAGAGAA TCCGTCAGAC GCCGCCGGGT GGGAACGGTG CGCGAGACTG
GCGTTGGCGA CGACGATTCC GAGCGCTCGA CCGCTGTTCG ACGCCATCAC CGAACAGTTC
CCTCGATCAT CGTTGGCGTG GTGCTGGTAC GTCGACGCGG AGCTTTCGAA GAACGATGCG
GGGACGCCGG ACGACGAAGC GATCCGCGCC ATTTTCGGGA AGTGCTTGAT ACCGTGCCCG
AGCGCGTTGC TCTGGCGAAG ATACGCGTCG TACATGGCGA GCACGAACGA TGTGACGACA
GAAGAGGGGG TGAACACAAT GAAATCGGTG TACGAGTACT CCGTGGACGT CGTCGGCGAG
GACGCGGACG CTGGTGATCT TTGGATGGAC TATTGTCAGT TCTTGCGCAG CACCGAGGCG
ACGCTGATTG TCACCGACGT CGCTGTGGAG CAAGCGCCGA GTGCGAGAGA TATGATCGTT
CGCAGGACGT ACCAGAAGGC GATTTCAGTG CCGATGCACA AATTAGATGC AGTTTACAAG
GTTTACGAAG CGTTTGAGCT TGAGAAGAAC AAGGCGCTCG CGAGGGCGTT GTTGCAGGAA
ATAGCGCCAA AGTTGTTGCT CACACGCACG GCGCTCGGTA AGAGGAAGAA GGTGCTCGCC
GACGTCGTCG TGGGCGCGGT GTGTGTTGAT CCAGTGCAGC GCGGTGCGGA TGGGTTGACT
TCGATTATTT GTCCCGCGTT CAATGCCGCT GGGTGTAGAG GTTGTGCGCG CGCGCACGAA
TGTAAATTTT GCGGCTCGCC AGCGCACGGG GCGAAGGCCT GCAAGAGTGG ACGATACGCG
CTTCTTGCGG CATCACCCAC GGCGTGCGCA GCTCAATGGG CAGAGATTAT CGATTTTGAG
AAGTCAAACG TGCAAAAGCT TGAAGGTGCA ACGCCGAATG AACCGTCCCC ACAACTCTAT
GCGCGCGTGA AGCACGCCTA CGAATTAGCA GGTCTGTCTC TCGGTGAGAC ACCTGAGTTT
TGGCTAGAGT ACGCGCACTG GCACGAGAGC GAGAATCGTT CAGACGAGGC GGTGGAAGTT
TTACAGCGAG CGCGGGAGGC GTTACCGTAT TGCACGCTTA TCACTTTTGC GTCGGCGGAT
ATCGAAGAGA CTCGCGGCGA CGCCGACGCG TGTAGAGCCA TCTACGAGTC CGTGCTTGAC
GCCTATGAAG AGAGCGCTGA CGAAGCCATC GAGCGAGGTG AGGAGATAAT GATGCCCTCG
GATATAATTC TAACGTACTG CGAATACGTT CGCGCGTCGC GTCGTGTTGG AGATCAAGAT
TCGAGTCGAA AAGCTTTCAT GCGAGCCCGT AAGGCGCCCG GGGCGACGTG GGAAATTTAC
GCAAACTCCG CGATGATTGA ATGGCAGTAC GATAAGAGCG ATAAACCGGC GAGAAACATC
TTTGAGCTCG GGCTTAAGAA ATTTTTGACG TCGCCAGACT ACGTCGAGCG TTATGCCGAG
TTCTTGATTG GCGTGAACGA TGTTGCCAAT GCCCGAGTTT TGTTTGAACG TTCCCTGAGT
GAATCGCCGT CGATGAAGAT TTGGGATATG TTTGTCGATT TTGAGCGTTC GCATGGCACA
GTGGATACGA TTTTAGATGC GGAGGCACGG CGTAATGCCG CG
 
Protein sequence
MRLKAVDLKK RCADAGLDDS GTKADLVGRL LNQASNEDSS PPDAGDAPER VVATDADDAK 
LEDADVDAVE TDGSGDHDDL LTHLRENPSD AAGWERCARL ALATTIPSAR PLFDAITEQF
PRSSLAWCWY VDAELSKNDA GTPDDEAIRA IFGKCLIPCP SALLWRRYAS YMASTNDVTT
EEGVNTMKSV YEYSVDVVGE DADAGDLWMD YCQFLRSTEA TLIVTDVAVE QAPSARDMIV
RRTYQKAISV PMHKLDAVYK VYEAFELEKN KALARALLQE IAPKLLLTRT ALGKRKKVLA
DVSGRYALLA ASPTACAAQW AEIIDFEKSN VQKLEGATPN EPSPQLYARV KHAYELAGLS
LGETPEFWLE YAHWHESENR SDEAVEVLQR AREALPYCTL ITFASADIEE TRGDADACRA
IYESIMMPSD IILTYCEYVR ASRRVGDQDS SRKAFMRARK APGATWEIYA NSAMIEWQYD
KSDKPARNIF ELGLKKFLTS PDYVERYAEF LIGVNDVANA RVLFERSLSE SPSMKIWDMF
VDFERSHGTV DTILDAEARR NAA