Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31869 |
Symbol | |
ID | 5001995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 670087 |
End bp | 671679 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | |
GC content | 58% |
IMG OID | 640417416 |
Product | predicted protein |
Protein accession | XP_001418081 |
Protein GI | 145347239 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.736523 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.605369 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGAGG CGACGCTCGA TGATTCGCTG TGGAAGTCGC TGTGTGAAGC TCGATGGCGG TGGCGGGCGA GCGGGACGTG GGCTCGGAGC GCGCGGTTGC CTTTCGAGTC GGTGTTTGGC ACGAGCGCCA ACGCCGAAAA GCGCCGAGAT ATTTTCCGCG GGGGGGAGGG CTGGCGACGG GCTTATAAGG AGCGTTTGCA AACCCGAGGA ATCGCACGAG TGGCGGGTAA GCGCGTGATT CGAGTTGAAT GCGACGGAGG AGTCATGCAG GCGAGCGCGC TGCAGCGCGT TCTCAACGCC ACCTTGCCGG GCGACGTCGT TTGCTTGGGC AAAGGCACGT ACGAAGGATC GTTAACCATT CCTCGCGGGA TCGAGATTGT GGGTGTCGAC AAACGAGAGA ACGTGCTCAT CGTGAGCGAC GAAACTCCAG CGATGATGAC ATCGACGGCG ATGAACGCTT CCCGTTCCGT CGCGTCCGTG GTTACCAACG TCACGCTGTT GCGACGGGGC TCTGCGAAAC GCAGCAGCTC GTCTGGATAC GGCCACCAGG CGTGCGTGTA CGTTTCCGAC GGCTCGCGAT TGCGACTTGA CAGTTGCGAT ATCGTGAGCG CTGGTGAAGG CGTGGTGGCG ACGGCGCAAG ACTCCGCGGT GCACGTGCAC GCGTGTAACA TTCACTCAGT GCTGTCGTCG TTTTTAAGCA CGTCGCGACG CGGCAGCTCG CTCACGGCGT GCAGAATCAC CGCCGCCAAG TCTAGCGTCG AAGACGCGCA CGAAGAGGAA GTCATCGATG AGATCGAAAG CTTACCTTCG TCTTTGGGGT ACGATCGTTT GTTTGCCGCC GTCACGGCTT TGTCGGGCCC GGTGGAGATT TGCAACAATC GCATCGTAAA CGGGTTCGCG CATGGCGTCG TCTTGTTTGA TTGCGCGCAC GGCAATATCC ACGACAACCT CATCGCGAAC AACGTTGGCG CGGGTATTTC CGTGGGTGTA TCTTCCACGG CAAATATTTC TAACACGATA GTCGCGAACA ACTCCAGTGT CGGTATCGCA ATGTGCGGTC GGGGCACGAT TCGGCACTCT GAAGTCCGAG GCAACGCATT CAACGGGATC GATATCGCGC AGCGGTACAC GAACCGCGAC TACCTTACTG CGAGATTTGA CGCAGGTACT GACGAAGAGC TCGATCTTGA AGAAGAATTT TCCGCATTTC TCATGGATTT AGACTCGGAT GACTTCGAAA ACGAAAAATC TTCCGAGGAG ATTGACGTCC TCGTCGAGGG GTGCCATGTT TCCAAAAACG CGAACGACGG CGTGTGCGTG TCTGGTGGCG CGAATGTTGA CGTGATTCAT TGTGAGATCA ACGGAAATCT GTGCAACATC GCGATAGATC GTGGAAACGT GCGATGGAGC CGCGTGCTTG TGGAGGGAGA AAGTATGCAC GCCGACGCGC CAAACGTGCG CGTCGCCGAG TCGCACTCGA CGTTGATCCC GATGCCAACA AGCATCGAAG GACCGCAGTT CGTCGATGCG ACAGTCATGC CTAAGCTCCG ACGATTCATT CCGAATCCGT CTCCGCTTAC TGTCGTGCTG TGA
|
Protein sequence | MREATLDDSL WKSLCEARWR WRASGTWARS ARLPFESVFG TSANAEKRRD IFRGGEGWRR AYKERLQTRG IARVAGKRVI RVECDGGVMQ ASALQRVLNA TLPGDVVCLG KGTYEGSLTI PRGIEIVGVD KRENVLIVSD ETPAMMTSTA MNASRSVASV VTNVTLLRRG SAKRSSSSGY GHQACVYVSD GSRLRLDSCD IVSAGEGVVA TAQDSAVHVH ACNIHSVLSS FLSTSRRGSS LTACRITAAK SSVEDAHEEE VIDEIESLPS SLGYDRLFAA VTALSGPVEI CNNRIVNGFA HGVVLFDCAH GNIHDNLIAN NVGAGISVGV SSTANISNTI VANNSSVGIA MCGRGTIRHS EVRGNAFNGI DIAQRYTNRD YLTARFDAGT DEELDLEEEF SAFLMDLDSD DFENEKSSEE IDVLVEGCHV SKNANDGVCV SGGANVDVIH CEINGNLCNI AIDRGNVRWS RVLVEGESMH ADAPNVRVAE SHSTLIPMPT SIEGPQFVDA TVMPKLRRFI PNPSPLTVVL
|
| |