Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33567 |
Symbol | |
ID | 5003808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 428426 |
End bp | 429616 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | |
GC content | 57% |
IMG OID | 640419229 |
Product | predicted protein |
Protein accession | XP_001419594 |
Protein GI | 145350398 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.416798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0211717 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGCCG AGGTGGGGTT GATGGCGGCG TACACGCTGT ACGCGATGAT GACGCTCGAT GGGAATTGGG TGCTGTTGTT GGTTTTCACG CTGCTCTCGG CGCTCGTGGC GTACACGTGG AGGGACGAGT TGAACTTGGT GGCGTCGATG ATTTCTGTGT CGACGATTAG CTTGTCGGAT AACCCGCACC TGGTGACGGT GACGATCGGT TTGCAGTGTC TCGTGATGGC GTTCGTGGCG CCGATGGCGT GGTTCGCCGT CCAGGCGTCG CAGCACGGCT CGGCGATTAT CAACCAATAC GCCACCGAAT TCAGTAACGA CGCGTGCACT GGGTATTACG GTCAATCCGT CGATTGTTGC AAGTGGAACA TCGACAGTTG GGTGGGGCCT TATTACGCGC TCGTCGTCAT AGCGTGCGTG TGGTTCACGT CGTGCGCGCT CGAAGCGCGC ATGTACGTCA TCGGAGGCGT CGTCTCGCAG TGGTACTTTG CCCCGGCCGG GACAAAGAGT TTCAAGGGCA CGACGAGAAC GTCCGTGAGT AACGCGTACG GACCGTCGTT TGGGACGATT GCGTACGGCG GCTTCGTGAT CACCGTCGTC GAAATAATTC GAAGCATGGC GAACAAGTCT CGCCGGGAAC GCAACAATTA CGGCAACCCG CTTTGTTGCC TCTTTTACGC GATGCTGGAC TGCATCTTTG CCGTTATCGA GTACCTCAGT CGATTTGCCA TGATTCAGGC TTCGATCACC GGCGAAGCGT TTTGCGATGC CGCGAGGAGC ATCAACGATC TCCTCAAAAG AAACTTTCTC TTGGCGTACG GCGCGTACGC CTTTCCGAAA CATATTTTGG GCTTCCTCGT CTTCGTCTTG GCCGCCCTTC TCGGCTACTG CGTCAACATT TTGAGCAAGC ACGTCTTCGC CGCGAACTCC CTCGGCGCGA TCGTCAACGG AATCGGCTCC TTCTTCATCG CTTACATCGT CCTCAGCTTC TTCGTCATGA TTTTGCTCAA CTGCGTCGAC GCCGTCTTCG TCTGTTACGC CTTGGACAAG GATCGCGCCG CGGTGCACCA TCCAGACTTG CACAAAGTCT TCGACGAGGT CACGCGCAAG CAGCGCGCGA TCGAAGAGTC CGATGCGGAG GGTATGGAAG AGCCATTGAT CTCGGGCAAG CCCAAGTACG CGTCCATGTA G
|
Protein sequence | MIAEVGLMAA YTLYAMMTLD GNWVLLLVFT LLSALVAYTW RDELNLVASM ISVSTISLSD NPHLVTVTIG LQCLVMAFVA PMAWFAVQAS QHGSAIINQY ATEFSNDACT GYYGQSVDCC KWNIDSWVGP YYALVVIACV WFTSCALEAR MYVIGGVVSQ WYFAPAGTKS FKGTTRTSVS NAYGPSFGTI AYGGFVITVV EIIRSMANKS RRERNNYGNP LCCLFYAMLD CIFAVIEYLS RFAMIQASIT GEAFCDAARS INDLLKRNFL LAYGAYAFPK HILGFLVFVL AALLGYCVNI LSKHVFAANS LGAIVNGIGS FFIAYIVLSF FVMILLNCVD AVFVCYALDK DRAAVHHPDL HKVFDEVTRK QRAIEESDAE GMEEPLISGK PKYASM
|
| |