Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29324 |
Symbol | |
ID | 5006523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 55296 |
End bp | 56903 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | |
GC content | 68% |
IMG OID | 640421944 |
Product | predicted protein |
Protein accession | XP_001422465 |
Protein GI | 145356497 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00930438 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.914315 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCGCC CGCGCGCCGC GCGCCTCGCC GCGTTCGACG TCGCCGGCGT CGTCGCTCTC GTCGCCGTCG CCGCGGCGTG GTGCGTCGTC CTGTCCACGG CGTACGGCGG CGAAAACCGC TGCGACATGA GCTACATGTA CCCGAGTTTC TTCCCGGTGA CGAACGTCGC GCTGGCGTCG CCCGCGCACG CGCTGTATCG CTACCGCGAC GCGCGCGCGC GCGATGACGA CGTCGACGAT CGCGCGCGCG CGAGCGGCGA CGCGAGCGCG TGCGTGGTGA TCAACGTCTT CGTCCCGGGG ACGGGGGGCG GTTACGGGCA AGCGCGAAGC GCGGCGAGCG GGACGCTCGA GAGCGAAGGA TGCCGCGTCG AGCACTACGC CCTGGATTTC GCGGAAGAAC TGAGCGCGTA TAATGGAATG CTACTCGCGA GACAGGCGAA AAGCGTGGCG CGGGCGCTGG ATGGGTTGAG GACGCTCGAG CGCGCGAGGA CGGAAAGACG CGTCGTGGCG GGATGGAGCG TCGTGGGACA CAGCGCGGGA GGATTGGCGG CGATGGAAGC CGCGGCGAGC GCTCTGGGAA GTGAGTTAGG GCTGACGATA GTGAATCTAG CGACGCCGAG CGCGTGGAAT CCGGTGAGTC TGACGTTGAC GCAGCGCGCG CGAGAGGCGG GCGTGCGGCG ACGTTGGCGA ACGAGCGACG CGAGACGGAG CGTGGCGCTC GTCAGCGTCG TCGGTGGCGA GCGCGATCGG CAGGTGAGCT CGAGCCCGAT GGGTTTTGTC GATGATTGGG TAGAAGAAGG GCTCGGTGTG AGCGCGAGCG CGAGCACGGC GTCGCGGGTG GGCGTGTCGG CGGATCATCG ATGCGCGGCT TGGTGCAAAC AGTTAATCGT CGCGATCTCG CGTGGATTCG CCGAGGCGTT TCCACGATCG AAGGTTCTCA CCGCGCACCA GCGCGCGCGC GCGTTTCGCG ACGCACTTGG TGATAAAGAC GCGGCGCGCG CGCCGTCACC GACTTTCGGG GTGCCCGCGC GAGCGTACGA GGCGACGCAA GAGCGCACGC CGAGCGCTCT GGCGGGCGTC GTCGCCAACG TATTAGCGCC GCGGTCGGTG ATGAGCGCCT TCGCGTTCGC GCTCATCGCA CACGCAGCAT CACCGAGCAT ACTTCGTGAG CGCGTCGTGG AATCCGTGAT CGCGTCGCTC GCGGTCTACG TCGCGAGTGG GTGGATAGCG AATCGCGCGT TCGACGTTCG CGGCGACGTC AGACGCGCGT TGACGACGTG CATGGTGCTC GCGCACGTCC CATCTCTCAT CGCGTGGGCG CACCACGTTC TTCGCACCTT ACCGACGTAT GGTTTTCACG CATTGGTGCC GCCGTTCGGT GCGATGGACG CCGTCGCGCT CTGCTTCGCC GCGGCGTCGA CGCGTTTCGG CGCCATCGCT CGCGATCAGT CGTCGGGCGC GTCGCGCGCA CGTCAAGTCG CGTGGTGTCT CGCCGCCGGC GCCGCCGCAC CGGGCAACGT CGGTCTCATC CACATCGCCG GCGCCGTCGT TCACTGCGCG GAGACGCTCC GTCTTCCTCG CGCGTCGCTG CCGAAGAAAG TCGCGTGA
|
Protein sequence | MPRPRAARLA AFDVAGVVAL VAVAAAWCVV LSTAYGGENR CDMSYMYPSF FPVTNVALAS PAHALYRYRD ARARDDDVDD RARASGDASA CVVINVFVPG TGGGYGQARS AASGTLESEG CRVEHYALDF AEELSAYNGM LLARQAKSVA RALDGLRTLE RARTERRVVA GWSVVGHSAG GLAAMEAAAS ALGSELGLTI VNLATPSAWN PVSLTLTQRA REAGVRRRWR TSDARRSVAL VSVVGGERDR QVSSSPMGFV DDWVEEGLGV SASASTASRV GVSADHRCAA WCKQLIVAIS RGFAEAFPRS KVLTAHQRAR AFRDALGDKD AARAPSPTFG VPARAYEATQ ERTPSALAGV VANVLAPRSV MSAFAFALIA HAASPSILRE RVVESVIASL AVYVASGWIA NRAFDVRGDV RRALTTCMVL AHVPSLIAWA HHVLRTLPTY GFHALVPPFG AMDAVALCFA AASTRFGAIA RDQSSGASRA RQVAWCLAAG AAAPGNVGLI HIAGAVVHCA ETLRLPRASL PKKVA
|
| |