Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37056 |
Symbol | |
ID | 5001373 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 107551 |
End bp | 108861 |
Gene Length | 1311 bp |
Protein Length | 415 aa |
Translation table | |
GC content | 68% |
IMG OID | 640416794 |
Product | predicted protein |
Protein accession | XP_001417428 |
Protein GI | 145345882 |
COG category | [S] Function unknown |
COG ID | [COG5505] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.0190175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCGCG ACGCGCTCGC GTCGGTCGCG CGCGCGCGCG CGCCCGCGCC GGCGCGCGAA CGCCGCGCGA AGTCGCGCGT TCGGGACGGC GGTGCGTCCC GAACGCGCCG AACGCGCGAC GCCGCGACGC GCGCGCTCAT CGCGCCGACC GACGCGCTCG GTGTTTGGAC CGCGGTGCTC GCGTGCGGTG CGTTCGGGCT TTGGGCGGAG AAGCGCCCGT GGGGAGCGAA CGCGGGCGGC GCGCCGCTGG TGAGCACGCT CGCGGCGCTC GCGCTCGCGA ACGCGGGAAT CATGCCGACG GACGCGCCGA CGTACGGGAT GATTAATGGG TTCTTGTTAC CGCTCGCGGT GCCCATGTTG CTGTTCACGG CGGACGTGCG ACGAGTGCTG CGCGGATCGG CGCGCTTGTT GCCGTGTTTC GTCGTCGGCG CGCTCGGGAC GACGCTCGGG ACGCTGGGGG CGTTCGCGGC GGTGCCGATG ACGGCGCTAG GAGCGGAGGG GTGGAAGATG GCGAGCGCGT TGATGGCGCG ACACATCGGG GGGGCGGTGA ATTTCGTCGC CGTGGCGAAC GCGTTGGAGA TGACACCGAA TATCATGGCG GCGGGTTTGG CGGCGGATAA TTTGATGAAC GCGTTGTACT TTATGGGATT GTTCGCGCTG GCGAAGGGCG TGATGCCGAA AACGCGCGAC GGCGGAGACG TCGACGCGCG CGAGAGCCAG GGCGATGGCG ACGTCGCGCT CGACACGGTC GCGCTCGTTG AGGGCGAGGG CGCGGGTGAG CCTTTCAGCG CGTTGCGCGC GTCGTACGCG CTCGCCGTGG CGGCCTCCGT CGGATACGCG GCGAAGCTCA TCTCCGCCGC GCTCAATCTG CGTGGCATGG ACATCCCGAT CATCACGTTG ATAACGGTCG TTCTCGCGAC GGCGATACCT AGACGTCTCG GAGCGCTCGC GGGCTCGGGC GAGGCGCTCG CGACGCTCGT CATGCAAGCC TTCTTCGTCG CCGTCGGCGC GTCGGGATCC ATCACCCACA TGCTCACCAC TGCGCCGTCC TTATTCGTCT TCTCCTGCCT ACAGGTCGCC ATTCACTTAG CATTCCTCCT CGCCGTCGGC AAAGCGCTCA GATTCGACAA AGCGAACGCT TTACTAGCCT CCAACGCGTG CGTCGGCGGT CCCACCACAG CCGCCGCGAT GGCTTCGGCG AAAAATTGGA AATCCCTCGT CGTCCCCGCC ATGCTCGTAG GCGTTCTCGG GTACACCGTC GCCACCTTCC TCGGAATCGC GTTCGGCAAA ACTGTCTTAG CTCGCATGTA G
|
Protein sequence | MSRDALASVA RARAPAPARE RRAKSRVRDG GASRTRRTRD AATRALIAPT DALGVWTAVL ACGAFGLWAE KRPWGANAGG APLVSTLAAL ALANAGIMPT DAPTYGMING FLLPLAVPML LFTADVRRVL RGSARLLPCF VVGALGTTLG TLGAFAAVPM TALGAEGWKM ASALMARHIG GAVNFVAVAN ALEMTPNIMA AGLAADNLMN ALYFMGLFAL AKGVMPKTRD GGDGEGAGEP FSALRASYAL AVAASVGYAA KLISAALNLR GMDIPIITLI TVVLATAIPR RLGALAGSGE ALATLVMQAF FVAVGASGSI THMLTTAPSL FVFSCLQVAI HLAFLLAVGK ALRFDKANAL LASNACVGGP TTAAAMASAK NWKSLVVPAM LVGVLGYTVA TFLGIAFGKT VLARM
|
| |