Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17033 |
Symbol | |
ID | 5004066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 207092 |
End bp | 208942 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | |
GC content | 68% |
IMG OID | 640419487 |
Product | predicted protein |
Protein accession | XP_001419936 |
Protein GI | 145351126 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0241463 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCGG CGACGCGGTC GCCGCTCGAC GCGATCGCGC GAAACGTCGA CATGCGCGCC TCGGCGCTGA GCGCGGTGGA GAAGTTGGCG CTCGAGGCCG ACGCGACGGC GCTGTCGCCG ATCGACGTCG CGCGCGCGCC GCGCGTCGAG ACGACTCGCG CGCGCGAGAG CGACGCGACG GCGCGCGAGA GCGACGCGAA CGCGCGCGAG GACGACGCGC GCGAGGACGA CGCGGACGAG GCGTCGACGG CGACGACGGA GTATCGAGAG ACGGTGGCGC TGCTGCGCGC GCAGACGACG GCGCTGCGCG AGGCGAACGA AGCGCGCGAG GAGCTCGAGG AAGCGCTTCG CGAGGCGGAG ACGACGAAGG CGACGCTCGA GGGCGACGCG CTGGAGCGAA TGCGCGCGGT GCGGGGAGAG ATGGAATCGC TTCGCGCGAC GCTGGAGGAG GTGGTGGAGA GTCGGCGACG CGCGCGAGAG GGGGAGGCGA ACGCGGTGGC GCGAGCGAAG GCGTCGGAGG AGACGCTCGC GCGCGAGAGG GAATCGCACG AGCGCGCGGT GGAGGAAATG CGCGCGGAGC TGGCGCTCGC GCGGTCGAGC GCGGAAATCG CGGCTCGAGG CGTGCGAACG AAGGACGGCG TCGAAGTCAC CGCGGAGCTG TACGAACAGT CGCACTCGAT GGAAAAGAAG GCGTTGAAGG ATGCGCAGCG GGCCAAAAAC GCGCTCGTGG AGCAGAAGAA AACGCTCGAG GCGCTGGAGA AGCGATTGCG CGACACGAGG TCGCGCGCGG ATTTAGCCGA AGCCAAGTGC CGAGAATTGA AATCGGCGCA AAAACGATGG GATCGATTGG AAGCCGGTGG TGATAACGCG TTCGCACAGT CCCCTGCCTC GGCAGCTACG CCGCGGACGC TGCGACAGAG CCATCGCGAC GTGCGGGCGT ACGCCGCCCA GCGCGTGGAG GCGTTGGAGA AGGATGTCAA GACGACGCGG CGCGAAGCTA CGCTAGCGAC TGCGGATTTA GATTCAAAGT TAGCCGCCGA GTCAGGCACG CGTAGCGCTG TGCAAGGAAG CTTGGAGAGC GCACGCGCGC GTATCCGCGT GCTCGAAGGT GAGCGTGATG CGTTGCGAGC GCGCGCCGAG GCGGCAGAGA CTGAGGTCGC AGATTTGAAG ATGACGCTCA AGTTGACGGA GGAACGCGCG GCGCAAGCGG AGGAAACGGC GGCGAAAGAA ATCGCGGCGG CTATGGACGC GATGCACCGA GTTCATTCGG CAGAAGACGC CCTGATGTCC GCCTCGCACC AGCAAGAGGA GGCGGTCAAG GCGAAAAGTG CAATGAAGCG GCGCGAATAC GGAGAGAAGA CGCGTGAGCT CGAGGAACGC ATCGAAGCCG CTGAACGGGC GCGCGATGCC GCAGCGCAAC GCGCACAAGT GGCGGAGGAT GCACGCAGAC GCGCCGAGTC CGAACTTGCG CGCGCGCAGA CGAGCACAGC GCGCGACGGC GACGGCGCCG ACGTCCGCGA ACGAATTTGC GCCGCCGAGG ACAGAGCGCT TCGCGCCGAG GAAGAGCTCT CTCAAGTCCG TAAAGCCGCG CGCGATGCCG CGCGCAACGC AAACGAATCC ATCTCGCCGT CGCCCGAACG CGTGGCGCAC CGACCACCGT TCTCGCAATC GCGACCGACT CGTCTTGCGC GCGAGCGCGA CGGCCAACGC GCCGCCGCCG CCGAGCTCCT GCACAGCCTT CGCGCCAAAC TCGAGGCCAG TTCCCCGTCG ATCGGGCAGA AGTCGAAAAG TAGATTCACG GATGCCGATC ACGACGACGC CTCGTACGCT ACGCCCGTGA AACGGGCTTG A
|
Protein sequence | MPPATRSPLD AIARNVDMRA SALSAVEKLA LEADATALSP IDVARAPRVE TTRARESDAT ARESDANARE DDAREDDADE ASTATTEYRE TVALLRAQTT ALREANEARE ELEEALREAE TTKATLEGDA LERMRAVRGE MESLRATLEE VVESRRRARE GEANAVARAK ASEETLARER ESHERAVEEM RAELALARSS AEIAARGVRT KDGVEVTAEL YEQSHSMEKK ALKDAQRAKN ALVEQKKTLE ALEKRLRDTR SRADLAEAKC RELKSAQKRW DRLEAGGDNA FAQSPASAAT PRTLRQSHRD VRAYAAQRVE ALEKDVKTTR REATLATADL DSKLAAESGT RSAVQGSLES ARARIRVLEG ERDALRARAE AAETEVADLK MTLKLTEERA AQAEETAAKE IAAAMDAMHR VHSAEDALMS ASHQQEEAVK AKSAMKRREY GEKTRELEER IEAAERARDA AAQRAQVAED ARRRAESELA RAQTSTARDG DGADVRERIC AAEDRALRAE EELSQVRKAA RDAARNANES ISPSPERVAH RPPFSQSRPT RLARERDGQR AAAAELLHSL RAKLEASSPS IGQKSKSRFT DADHDDASYA TPVKRA
|
| |