Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_41331 |
Symbol | |
ID | 5002464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 168648 |
End bp | 169682 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | |
GC content | 56% |
IMG OID | 640417885 |
Product | predicted protein |
Protein accession | XP_001418397 |
Protein GI | 145347899 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.060749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0231462 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAGC AGTTGACGCG CACGAGCAAG CGCCTGGCGA CGGATTCCAT GCGATCGTAT CTCAAGGATA TCGGTTCCGT CACGCTCTTA AACGCCGGTC AAGAGGTCGA ACTCGCCAAG CGCATTCAAG ATTTGATGCA TTTGGAGAGC ATTCGCGAAA ACCTCGTCGA GGAGACGGGT CCTGGCGCCG AAGTCACCGA CTACCAGTGG GCGTCGGCGG CGGGTTTAAA CGTGCAGGCG CTCCATCAGC GTTTGCGCGA CGGTAAGTCG GCGAAGAACG AGATGATTCA AGCCAACTTG CGCTTAGTGG TCTCCATCGC GAAGAAGTAC GCCAACAGTA ACATGAGCTT CCAGGATTTA ATCCAAGAAG GGTGCGTCGG TTTGATTCGC GGGGCGGAGA AGTTTGATTT CCAACGCGGG TACAAGTTTA GTACGTACGC GCACTGGTGG ATTCGCCAGG CGGTGACGCG TTCGATTAGC GACCAAAGTC GCACGATTCG CTTGCCCGTG CACTTGTTTG AAATCATCTC CCGCATCTCG AAGATGGAGC AAAAGTTTGC GTTGCATAAC GGTCGCAACC CGACGACGGA GGAAATCGCC GCAGAGATGG ATATGTCGGC GGAGAAGATT ACTCAGATTA AAAAGGCTGC GCAAGCGCCC GTGTCGCTGG CTCAGACCAT GGGTGGAGAT AACAAAGGAC GCACCGTCGA AGACACCCTC GTGGACGTCA CCGCGGAGGG CCCAGAGAAG GTGAGCGGCA AGTCCCTGTT GAAGGAGGAT TTGGAAAACG TACTGAACAC GCTCAATCCG CGCGAGCGGG ACGTGTTGCG ACTTCGGTAC GGATTAGATG ACGGTCGCGT GAAGACCCTT GAAGAGATCG GGACGGTGTT CTCCGTCACT CGCGAGCGCA TTCGACAAAT CGAAGCCAAG GCTCTTCGAA AGTTGAAGCA ACCGTCGAGG AATTCGATTT TGCAAGAGTA CTTCGCCGAC AGCGACGCGT CCTCGTTACC GAAGCCGCCG CCGATGAACC CGTAG
|
Protein sequence | MSKQLTRTSK RLATDSMRSY LKDIGSVTLL NAGQEVELAK RIQDLMHLES IRENLVEETG PGAEVTDYQW ASAAGLNVQA LHQRLRDGKS AKNEMIQANL RLVVSIAKKY ANSNMSFQDL IQEGCVGLIR GAEKFDFQRG YKFSTYAHWW IRQAVTRSIS DQSRTIRLPV HLFEIISRIS KMEQKFALHN GRNPTTEEIA AEMDMSAEKI TQIKKAAQAP VSLAQTMGGD NKGRTVEDTL VDVTAEGPEK VSGKSLLKED LENVLNTLNP RERDVLRLRY GLDDGRVKTL EEIGTVFSVT RERIRQIEAK ALRKLKQPSR NSILQEYFAD SDASSLPKPP PMNP
|
| |