Gene OSTLU_41331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41331 
Symbol 
ID5002464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp168648 
End bp169682 
Gene Length1035 bp 
Protein Length344 aa 
Translation table 
GC content56% 
IMG OID640417885 
Productpredicted protein 
Protein accessionXP_001418397 
Protein GI145347899 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.060749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0231462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAGC AGTTGACGCG CACGAGCAAG CGCCTGGCGA CGGATTCCAT GCGATCGTAT 
CTCAAGGATA TCGGTTCCGT CACGCTCTTA AACGCCGGTC AAGAGGTCGA ACTCGCCAAG
CGCATTCAAG ATTTGATGCA TTTGGAGAGC ATTCGCGAAA ACCTCGTCGA GGAGACGGGT
CCTGGCGCCG AAGTCACCGA CTACCAGTGG GCGTCGGCGG CGGGTTTAAA CGTGCAGGCG
CTCCATCAGC GTTTGCGCGA CGGTAAGTCG GCGAAGAACG AGATGATTCA AGCCAACTTG
CGCTTAGTGG TCTCCATCGC GAAGAAGTAC GCCAACAGTA ACATGAGCTT CCAGGATTTA
ATCCAAGAAG GGTGCGTCGG TTTGATTCGC GGGGCGGAGA AGTTTGATTT CCAACGCGGG
TACAAGTTTA GTACGTACGC GCACTGGTGG ATTCGCCAGG CGGTGACGCG TTCGATTAGC
GACCAAAGTC GCACGATTCG CTTGCCCGTG CACTTGTTTG AAATCATCTC CCGCATCTCG
AAGATGGAGC AAAAGTTTGC GTTGCATAAC GGTCGCAACC CGACGACGGA GGAAATCGCC
GCAGAGATGG ATATGTCGGC GGAGAAGATT ACTCAGATTA AAAAGGCTGC GCAAGCGCCC
GTGTCGCTGG CTCAGACCAT GGGTGGAGAT AACAAAGGAC GCACCGTCGA AGACACCCTC
GTGGACGTCA CCGCGGAGGG CCCAGAGAAG GTGAGCGGCA AGTCCCTGTT GAAGGAGGAT
TTGGAAAACG TACTGAACAC GCTCAATCCG CGCGAGCGGG ACGTGTTGCG ACTTCGGTAC
GGATTAGATG ACGGTCGCGT GAAGACCCTT GAAGAGATCG GGACGGTGTT CTCCGTCACT
CGCGAGCGCA TTCGACAAAT CGAAGCCAAG GCTCTTCGAA AGTTGAAGCA ACCGTCGAGG
AATTCGATTT TGCAAGAGTA CTTCGCCGAC AGCGACGCGT CCTCGTTACC GAAGCCGCCG
CCGATGAACC CGTAG
 
Protein sequence
MSKQLTRTSK RLATDSMRSY LKDIGSVTLL NAGQEVELAK RIQDLMHLES IRENLVEETG 
PGAEVTDYQW ASAAGLNVQA LHQRLRDGKS AKNEMIQANL RLVVSIAKKY ANSNMSFQDL
IQEGCVGLIR GAEKFDFQRG YKFSTYAHWW IRQAVTRSIS DQSRTIRLPV HLFEIISRIS
KMEQKFALHN GRNPTTEEIA AEMDMSAEKI TQIKKAAQAP VSLAQTMGGD NKGRTVEDTL
VDVTAEGPEK VSGKSLLKED LENVLNTLNP RERDVLRLRY GLDDGRVKTL EEIGTVFSVT
RERIRQIEAK ALRKLKQPSR NSILQEYFAD SDASSLPKPP PMNP