Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_89323 |
Symbol | |
ID | 5005720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | + |
Start bp | 372530 |
End bp | 374176 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | |
GC content | 63% |
IMG OID | 640421141 |
Product | predicted protein |
Protein accession | XP_001421640 |
Protein GI | 145354750 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.883237 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0403088 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGG TCGCGCGGTC GCTCCTGCTC TGGCTGACGC TCGCGCCGAC CCTGGCGCGC GCGTCGGCGC ACGCGTACGC GCGCGACTCG CTGTACGCGA CGAACGACGC GGCGATCCTC GTCGCGGGCG CGGAGGGCGT CTTCGCGTCG CGCGTGCGCC TCGAGCGCGA GACGACGGAC GTCCGAAAGC GATGGCTCGG CGCGAATCCG CGCGTCGCGG ACGGGCGCGC GTACGTCCGA CTGGACGCGC TGACGTTCGA GCGGCCGCGG GAGACGGCGC GGGCGAGGGG CGCGACGGGG GGGGCGAGCG GCTTGGTCGA GGCGGCGACG TTTAAACGCG AGGACGTCGA TCGGATCGGC GTGTGGGACG ACGAGGCGGG CGCGAGGAAG TTTTGTTGCA CGGGTGACAT GGCGAAGCGA GGGCTGTGCG AGAAGGGAGA GGTGGGACGA TTGGTGGTGC GAGGGCGGGG CGACGGCGGC GCGACGGCGC CGTGGAAGAC GGAGATTTGG TTCGAGGGCG ACGACGTCGA GGCGAGGAGC GATGTGCAGG CGGTGAGCGT GCGGGAGACG GGGATGTATT ACATGTGGTT CGTGGTGTGC GATCCGGAGC ACGCGGGGGT GACGGTGAGC GGGAGGACGC TTTGGAAGAA TCCGGATGGA TATCTGCCGG GGGCGAAGAC GGCGCTGTTG CCGTTTTACG GCTTCGCGGC GATGGCGTAC CTCGGGTTGG GGTTCGCGTG GGCGATGGCG TACGTGGGGA ATTGGCGACA CGTTTTAGAG CTGCATAATT GCATCACCGT CGTGCTGGCG CTGTCGATGT GCGAGACGGC GGTGTGGTAT TTCGATTACG CCAACTGGAA CGCCACGGGC TATCGCCCGT ACGTGTTCAC CGTGGTCGCC GTCTTGCTCG GCAGTCTTCG CACGACGCTC AGTCGCACGC TCGTGCTCAT GATGTCCATG GGGTACGGCG TCGTTCGCCC CACCCTCGGC GGGTTGAACG CCAAAGTGGT GTCGTTGAGC GTTTGCTATC TCTTCTCCAC CGCCGTCAAG GACGTCGTCG AGCACGTCGG ATCGGTGGAT GACTTGAAAC CCGGCGCGAG GTTGTTTTTG GTGCTGCCGG TGTCGGTGTT TGATTCCGTG TTCTTGATTT GGATCTTCAA CTCGCTGTCG AGGACGCTCA CGCAGCTCGT GTTGAGACAA CAAAAGCAAA AGCTGTCGCT CTACCGCGCG TTCACCAATC TCTTGGCGGC GAACGTCGTG CTCTCGGTCG GTTGGCTCGC GTACGAGATG TGGTTCAAGA GCACGGACAT GATTGAAGAG AAGTGGGAGT CGGTGTGGAT GTTGACTGCG TTTTGGCAAG CGTTATCCTT CGGTTTACTC GCCGGCATTT GCTTTTTATG GCGCCCCGCG AGCGAGTCGA CGCAGTACGC CTACAGCGAG CTCGCGAACG ACATCTCCGA AGACGCGTGG TGGGGCGAGC TCATCACCAA CGACGACATC GAGCAATTCG CGGGGTCGTC GAAAATGTCC AAGTCACCGC GCGTGATGAA TAGTGCGAAG AAGACGCGAG CGATGAACGA CTTTTCGCTC GACGCCGACG ACGACTCGGC GGCGGAAATC GAAATGGAAA TGGGAAAGAT TGACTGA
|
Protein sequence | MRAVARSLLL WLTLAPTLAR ASAHAYARDS LYATNDAAIL VAGAEGVFAS RVRLERETTD VRKRWLGANP RVADGRAYVR LDALTFERPR ETARARGATG GASGLVEAAT FKREDVDRIG VWDDEAGARK FCCTGDMAKR GLCEKGEVGR LVVRGRGDGG ATAPWKTEIW FEGDDVEARS DVQAVSVRET GMYYMWFVVC DPEHAGVTVS GRTLWKNPDG YLPGAKTALL PFYGFAAMAY LGLGFAWAMA YVGNWRHVLE LHNCITVVLA LSMCETAVWY FDYANWNATG YRPYVFTVVA VLLGSLRTTL SRTLVLMMSM GYGVVRPTLG GLNAKVVSLS VCYLFSTAVK DVVEHVGSVD DLKPGARLFL VLPVSVFDSV FLIWIFNSLS RTLTQLVLRQ QKQKLSLYRA FTNLLAANVV LSVGWLAYEM WFKSTDMIEE KWESVWMLTA FWQALSFGLL AGICFLWRPA SESTQYAYSE LANDISEDAW WGELITNDDI EQFAGSSKMS KSPRVMNSAK KTRAMNDFSL DADDDSAAEI EMEMGKID
|
| |