Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27364 |
Symbol | |
ID | 5005403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 142900 |
End bp | 144528 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | |
GC content | 61% |
IMG OID | 640420824 |
Product | predicted protein |
Protein accession | XP_001421217 |
Protein GI | 145353859 |
COG category | [A] RNA processing and modification |
COG ID | [COG5104] Splicing factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0285818 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAGG CGCAGACCGA GCGCGCGGAC GCGCGAACGG CGACGCGGGA GGCGCAAACG CTTCGTCATC CAGAGGGACA TCTTGAACGA GAGGCGACGA CGGCGGCGAC GACGGCGTGG GAAACGCACG TCGCGCCGGA TGGGCGAACG TATTACTATC ACCCGGAGAC GCGGCGGAGC ACGTACGCGA AGCCGGAGGA GATGATGACG ACGATGGAGC GCGCGGAGGC GGCGACGCGG TGGCGAAAGT TTACGGCGCC GGCGGCGGAC TCGACGGGAG CGATGAAGAC GTATTGGGCA CACGAGGACA CGGGAGTGAC GACGTGGGAG ACGCCGAAGG AGATTGAGGA GGTGAGGGAG ATCGTGCGGC GAGCGGAGGC GCGGGCGACG GGGCCGGGAG GTCGAGGAAC GGGAGCGGGG CGCGCGGAGA GTCGAAACGA GAGCGGACGA GCGAGCGGCG AGGATGCGGC GCCGTTTGCG AAGGAATACG CGAGTATGGA GGAAGCGAAA CAGGCGTTTA AGAAGATGCT CGCCGAGTAC GGCGTTCGGG GATCGACGAA GTGGGATGAA GTGGTGAATC GAGCTGGGGC GGATGCGCGG TTTTCGGCGC TGCGGTCGAC GGGGGAGAAG AAGCAGTGTT TGAATGAGTA TCAGATGGCG CAGGCGAAGA TCGAGCGCGA GGCGAAGAGA ATGGCGGAAA AGAAGGCGCG CGAGGCGTTT AGAGCGATGC TCGAGGAGCA CGGAGAAGCG TTGGGCTTGA CGTCGAATTC CAGATTGTCG CGAGACGGAT CGCTGGAGCA GGCGCTGCGG GACGACGCGC GGTGGCGCGC GGTGACTGAT CAACGCGAAC GTGCGGAAAT ATTTGAAGAC TACACGCGAG ATTTACGTGT GCGCGAAAAA CACGAGCGCG AGCACACGAA AACGAAGCGT GCTTCGGAGT TTAGGGAGTG TTTGATCGAA GCTGGCGCGA CGTCAGAGAT GACGTGGCGA AAGATTTATG AAGTCGTCAA GGACGACCCG CGATGCGAGC GATGCGAACC TTTGGCTCGT TTGGATGTGT TCGAAAGCAT AGTGCGTGAT TTGGCTCGCG CAGAGAGGGC CAAACTCGAA GTCGAGCGCA AGGCCAAAGC GCGCGAAGAG CGCAAGCGTC GCGAAGACTT TGTTGCGCTG CTCGCCGAGT CGCAAGCCGA CGGCATCATA ACGCCACGGA TGCCGTGGAA GTCGTTCGTG AAGCGCATCG AGAACGACGA GCGATACGTC AGATTGTGTC AAAACCTCGA TGGATCGCGA CCGCGCGAGT TATTTGAGGA TTTAATCGAC GAGATCGAGG GCGAGATTGA TCGAAAGTTG GACGATTTTG AAGACTTGCT GCGCGACGGA TACAAGGCGC GCGAGTTGCA CGGCAACACG ACGTGGGAAA AGGCGGAGAA GCTGTACAGA CACGACAAGG CTTGGAAGCA AGCGCCTCGG GACGAGGCGC GTAAGCTGTT TGTGAAGTTT ATCGCGAAAG TTTTCCGGCG TGAGCAAGAG AAAGAGCGCC GTAAGCGCGA GGGCATTCGC AGCGAGGACG ACGCGGACAG GCCGTCGTCG CGCAAAAAGT CTCGTCGCGA TCGTAGCGCG AGTCGATGA
|
Protein sequence | MSEAQTERAD ARTATREAQT LRHPEGHLER EATTAATTAW ETHVAPDGRT YYYHPETRRS TYAKPEEMMT TMERAEAATR WRKFTAPAAD STGAMKTYWA HEDTGVTTWE TPKEIEEVRE IVRRAEARAT GPGGRGTGAG RAESRNESGR ASGEDAAPFA KEYASMEEAK QAFKKMLAEY GVRGSTKWDE VVNRAGADAR FSALRSTGEK KQCLNEYQMA QAKIEREAKR MAEKKAREAF RAMLEEHGEA LGLTSNSRLS RDGSLEQALR DDARWRAVTD QRERAEIFED YTRDLRVREK HEREHTKTKR ASEFRECLIE AGATSEMTWR KIYEVVKDDP RCERCEPLAR LDVFESIVRD LARAERAKLE VERKAKAREE RKRREDFVAL LAESQADGII TPRMPWKSFV KRIENDERYV RLCQNLDGSR PRELFEDLID EIEGEIDRKL DDFEDLLRDG YKARELHGNT TWEKAEKLYR HDKAWKQAPR DEARKLFVKF IAKVFRREQE KERRKREGIR SEDDADRPSS RKKSRRDRSA SR
|
| |