Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34749 |
Symbol | |
ID | 5003770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 343510 |
End bp | 344631 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | |
GC content | 56% |
IMG OID | 640419191 |
Product | predicted protein |
Protein accession | XP_001419564 |
Protein GI | 145350332 |
COG category | [S] Function unknown |
COG ID | [COG2850] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.533757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCGA AAAGCGCTGT CTTGGCGAGA CAACCTGCGA GGACGTCGGG AACGAATGTG AGACGCGTTC ATCGTCCGTT GCCCGCGGAC TTCAAGAACA CGATCGCCGC GCAGCGCGTC CCGGCTGTCA TCAGTGGACT CGATATCGGC CAAGCGCCGT GGACGTGGAC GCCAAGTTAT CTGGCTTCCC TCGACGGCGT TCCAGAGAAG CTTGTGAGCG TTCACGTCAG TCGTGATCCC AAGCTGGACT TCGTGCGCAA AAACTTCAAA TACGTCGTGA TGCCTTTTGG TGAACTACTG GCGAAAGTGA ACGATGCGAG CGATGACAAT TTCTACTATT TGCGAAGCAT TGGGGAGAAT CCGCGAAAAG AGCCGGCGCA CGCGCTTCTA CAATTCCCAT CGTTTGCGCG CGATTTGAAA CTTCCGAGCG AGTTTTGGGG ATCCGAAGAC AACTACTTCA GCGCCGTCGT TCGCGTGAGC AGTGGCGATT TACAGCTCTG GACGCATTAT GACGCCATGG ATAACATGTT GATTCAGCTT CATGGCGAGA AGCGTGTGCT TCTGTTCCCA CCGTCCGTGT CAGGCGACTT ATATCTTGAA GGTTCGTCAT CCGTCGTCCG CGACGTGGAC GATCACGATC GAGAATCGTT CCCACGATTC GCGCGCGCTC GAAAAGCGGC GTTGGAAGTC ATCTTACAAC CAGGTGACGT ATTGTACATC CCCGCGCTTT GGGCGCACCA CGTCACCGCC TTGCACGGCC CGTCGATTGC GCTCAACGTA TTTTTCCGAC ACCTCCCCAC GAGTGGATAC CCATCGAAAG ATTTGTACGG GAACGCCGAC CCAATCGCGG CTGCGAGTGC GCTCAAATCA ATAAACTCCG CGATCGAATC CTTGAAAGAG TTGCCGCTAG ATTACCGTGT ATTTTACGCT GGCGTCGCAG CGGCGAGACT GGAGAGTGAG CTTGGCGTCG AATCCGCTCG AAGAGCGCTT GCAACGGTGA ACGACGACAC GCCGAAATCG CGCGGGATGA ATTCGCGCGC AACAAAAGGT ACAGGCGTGG TCGGCACAGT CTTATCTGCT CTTGCATGCC TGCTCATTAC ACGCCGCGCT TCGCGGAAGT GA
|
Protein sequence | MAPKSAVLAR QPARTSGTNV RRVHRPLPAD FKNTIAAQRV PAVISGLDIG QAPWTWTPSY LASLDGVPEK LVSVHVSRDP KLDFVRKNFK YVVMPFGELL AKVNDASDDN FYYLRSIGEN PRKEPAHALL QFPSFARDLK LPSEFWGSED NYFSAVVRVS SGDLQLWTHY DAMDNMLIQL HGEKRVLLFP PSVSGDLYLE GSSSVVRDVD DHDRESFPRF ARARKAALEV ILQPGDVLYI PALWAHHVTA LHGPSIALNV FFRHLPTSGY PSKDLYGNAD PIAAASALKS INSAIESLKE LPLDYRVFYA GVAAARLESE LGVESARRAL ATVNDDTPKS RGMNSRATKG TGVVGTVLSA LACLLITRRA SRK
|
| |