Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33632 |
Symbol | |
ID | 5003897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 543340 |
End bp | 544850 |
Gene Length | 1511 bp |
Protein Length | 497 aa |
Translation table | |
GC content | 66% |
IMG OID | 640419318 |
Product | predicted protein |
Protein accession | XP_001419630 |
Protein GI | 145350475 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00684373 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.434583 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCGCGCGCGC TCTCCCCATG CGCGACCGCG CGCGCGAGAT TCGCCTCGTC ACCCTCGGTC TGTCGCCCCG GGACACGCAC GCGAGATGCG TCGACCTCGC GCGCGAGCTC GACGACGCGG TGCGCGACGT GACGGCGCGC ATCGACGCCC GCGCGACCGC ACACGCGAAT GCGCTCGCGA CGCTTCGACG ACGCATCGAA CACCTGCGCG CGCGCGTGCG CGACGTCGGG TCGAGCTCGA GCGCGCGGAC GGTGATGCAC GCGGCGCGGT ACCCGATGGG CGCGCGAAAC GGAGACGTGG AACGGATGTT CGGGACGAGC GCGCCGACGA CGGCGGCGAG CGCGGCGGAG GACGAAGGCG ACGCGGAACG GAAAATGATG GAGGAGTTAC TGGCGATCGA ACGACAGACG GTACATCGAG AGATGACGGC GGAGATGTTT TATGCGAGCG AGACGCGCGA GAGCGCGAGC GGGCGCGCGC GGGTGGTGAG CGGGCGGGAC GGATTGGGGA CGTTGCCGGC GAGTTTGGAG AGCTTGAACG AGTGTTTGTT GTTTAATTCC GACGTGAATC CGTACGTGGA GTACGCGCTG GTGGATAATT TGATGGACGG TGAGGACGAT GGTGGTCGGG GCGATGAAGA CGGCGAAGAC GAGGGAACGC GCGAGGGAGG GGAGTACGAG GACGATTGGA AAAATTTGGC CGAGGCGCCA AAGTCGATGC AAAACGCGTT GTATGATTCC ATGCAAGCCA CGCAGTTTGG ATTTCGACCG ACGATGGGCG CGATGCCGTC GCTGGATTTA CCCACATCGC TGCCTTCGTT ACGCTATGCG GCGGACATCA GCTGGTCCGG GGCGACGAAG ACGGCGGCCC CCATAGCGCC TTCGTCCACG GCGACGCCGC TACCTGACGT CGTCGTGACG CCCTCGGTGC TGGCGCCGCC GCCACCTTTG CCACCGACGC CGCCACCACC GCCACCACTG CCTCGGGAAA ATTTGCACGC GTCCGCGCCG CCGGCGCAGC TCCCACCGGG CGCGAACGTC GGCGACGGCG GGCGGAGCGC GCTCATGGAC GCGATTCGCA ACAAGGACAA CAAATCGAGG TTGCGCAAAC GTGGGTCAGC GGCGCCGAGC GCGGACGCCG CACCGCCGCC ACCTCGCGGC GCGCCGATGG AGCCCGACTT ACGAGGCGCG CTCATGGACG CTATTCGCGC GAAACCGATG CTGAAATCGT CGAGTCGCCC CGAGTTGAGC GACGGCGATC CGCCCGCCGC GGCGCCGCCC GCGCCGCCGC GACAAATGTC CATGATGGAG GAGTTAGCGA ACAGTTTAGG ACGACGTCGC TCCGCCATCG TGGCATCCGG CGACCACGAC GTGCGCGAGA AGGTCATCGA GACGACGACC CAACGCCCGA GTGGCATCGT CGGCTTGGGC GATTTCATCA AGGCGAAGGA GAGTGCGACG AACTGCGACG AAGACCAAGA CGACGACGTG AGCGATTGGG ATAGCGACTA A
|
Protein sequence | MRDRAREIRL VTLGLSPRDT HARCVDLARE LDDAVRDVTA RIDARATAHA NALATLRRRI EHLRARVRDV GSSSSARTVM HAARYPMGAR NGDVERMFGT SAPTTAASAA EDEGDAERKM MEELLAIERQ TVHREMTAEM FYASETRESA SGRARVVSGR DGLGTLPASL ESLNECLLFN SDVNPYVEYA LVDNLMDGED DGGRGDEDGE DEGTREGGEY EDDWKNLAEA PKSMQNALYD SMQATQFGFR PTMGAMPSLD LPTSLPSLRY AADISWSGAT KTAAPIAPSS TATPLPDVVV TPSVLAPPPP LPPTPPPPPP LPRENLHASA PPAQLPPGAN VGDGGRSALM DAIRNKDNKS RLRKRGSAAP SADAAPPPPR GAPMEPDLRG ALMDAIRAKP MLKSSSRPEL SDGDPPAAAP PAPPRQMSMM EELANSLGRR RSAIVASGDH DVREKVIETT TQRPSGIVGL GDFIKAKESA TNCDEDQDDD VSDWDSD
|
| |