Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33818 |
Symbol | |
ID | 5001014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 809609 |
End bp | 810814 |
Gene Length | 1206 bp |
Protein Length | 402 aa |
Translation table | |
GC content | 58% |
IMG OID | 640416435 |
Product | predicted protein |
Protein accession | XP_001417064 |
Protein GI | 145345107 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.458743 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCGT TAACGCTACC GCGAGGCGCG CACGCGCGCG CGACGTCAGC AATGCTCGCA CACGTTACGT GTCCGATATG TCTCGAACCG TTCACGGACG CGCACTGCGT GGCGACGTGC GGACACACGT TTTGTCACGG GTGCGCGAGC GCGGCGCTCG ACGCGGCGGC GGCGGCGCCC CGAGACGGCG GCGCGGACGC CGGGGTGAGC GCTCGGTGCC CGACATGCTC GTCGATGTTT ACCAGCGCAC AGTTAGTGCC AAATGCGGCG GTGAACGCGA TGGTTGCGGC TATGAAGTCA GGGGCGAGCG AAGCGGCGGC GGCTTCGACG AGCGCCGAGG ACGGCGAAGA CGATTTGGAA CGATTGACAC CTTTGGTGAA GACTTTGAGC GAGAAACATC GAAGTTTGGT GCTGGAATCA CGCGCAGTGT CGCGAGAGGT GTTGAAGGAG TTCTTAGTAG AGAGCCGAGC GAGGAAGCAG GCGAGCGCGG TGGCGCTCGA GCGAGAGTTG AGGTGTTTAG ACGCCGACAT CGACGCGGTG CGAAGGGAAA TTGAGGCCTT GGGTGGCGGT GCACGAGTGT CGCACGAACG TAGTGATTTG CACGATAAAG AAGTGATCGC GCACGCGATG GAAGCGCTGG GTTTGACGCG GCCGGGTGAG TCGCAAATCG TCATCGACGA GTCGAAGCGA CGCAGAGTGT TGAGGCAGTT TAGCGAATTG CAGAGCTGGT ATTCCAAGAG AAGAAGCGCA GAACGAGACG ATGTCACCAG CGATGGTGGT AAGTCGAGTG GTAGCGCGCT CAATGGTGGT CGAGGGTACG CCCCGGACTC GACGACTATG GAGGAGTTTT CGACGATTAT TGACACGTTT AAACGATACT CGAACATTTC CATCGCGGCG GAGATTCGCG GCGAAGAAGA CGCGTCAAAC CCAGGCGCGC CCGTCAGCAG CATTGAGTTC GATTCGACGC AAGAGTACTT CGCTACCGCA GGTGTGTCCA AACGAATTCA GTTTTACAAC CTCGAGCACG TGCTCGAGGG GTCACAGCAA CCCGCGGATG AAATCAACAC TCGGTCCAAG CTGACGTGCT TGTCTTACAA CAAGTTTGTT AAGCATCACA TAGCGGCGAG TGATTACGAA GGTGTCGTTT CGGTGTGGGA TGTCGAGAAA AAGTGCAGCA TCATAGATTT TGAGGAGCAT GAAAAG
|
Protein sequence | MSALTLPRGA HARATSAMLA HVTCPICLEP FTDAHCVATC GHTFCHGCAS AALDAAAAAP RDGGADAGVS ARCPTCSSMF TSAQLVPNAA VNAMVAAMKS GASEAAAAST SAEDGEDDLE RLTPLVKTLS EKHRSLVLES RAVSREVLKE FLVESRARKQ ASAVALEREL RCLDADIDAV RREIEALGGG ARVSHERSDL HDKEVIAHAM EALGLTRPGE SQIVIDESKR RRVLRQFSEL QSWYSKRRSA ERDDVTSDGG KSSGSALNGG RGYAPDSTTM EEFSTIIDTF KRYSNISIAA EIRGEEDASN PGAPVSSIEF DSTQEYFATA GVSKRIQFYN LEHVLEGSQQ PADEINTRSK LTCLSYNKFV KHHIAASDYE GVVSVWDVEK KCSIIDFEEH EK
|
| |