Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_41444 |
Symbol | |
ID | 5002517 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 699525 |
End bp | 701075 |
Gene Length | 1551 bp |
Protein Length | 311 aa |
Translation table | |
GC content | 62% |
IMG OID | 640417938 |
Product | predicted protein |
Protein accession | XP_001418551 |
Protein GI | 145348215 |
COG category | [R] General function prediction only |
COG ID | [COG1439] Predicted nucleic acid-binding protein, consists of a PIN domain and a Zn-ribbon module |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCGT GGTCCAAAAT CGTGCGCGAC GATCCCGCAC CGGCGCCGGT CGACGCCGCC GAAGCGTCCG CGGCGGCGGC GGCTGAGGCG ACGCTCGAGA GCAAACTGCG CGACGCCGCG ACGCTGAAGG CGGTCGTCGA CGCCAACGCC GTGTTCAAGG GCTACGCGCT GACCGACCCG AACGTGCTGT GCGTGACGAT CGCGGAGGTG CTGGATGAGA TTCGGGACGC CAAGGGACGA GACGCGGTGG CGGCGAGCGC GGGCGCGCTC GAAGTCGCCG AACCGAGCGA AGAAGCGATC GAAGCGGTGA AACGGTTCGC GAGTAAGACC GGAGACGTGC ACGCGCTGTC GAGGGTAGAC ATGAAGCTCA TCGCGCTGGC GTATGACTTG GAGGGAAGAT GTCACGGGGT GGAACATTTG AGGACGGAGC CGGCGCCGCC GAGGACGCAC GCTAAAAAGA CGAATCGGTT CGAGAAGCAG CCGGGATGGG ATTACGTGCC GAACGCGGAC GATTGGGCGG AGTTGGACGA TATGAATAAG CTCCAGGAGG AAGCCGAACG CGAGATGCGG GAAAAGATGG CCAAGGTTTC GATCGAGCAG GCGGCGGAGG AGGAATTGCG AAAGGAGCGC GAGGCGGAGA CGGCGGTGGC GAGGGAACGA CGCGCCGCCG AGGAGGAACG CGTGCGAGCG TTGAAGGAGA AGGCTGCGGA GGCGTTGGTG GCGCAGGAAC ATATAGTTAA GGACGTCGAG GGCGATACCG ACGAATGGGC GCCGGTCATT TGTCGAACGA CGCGCGTGCG TCGCCAAAAG CGAGAGGAGC GCGCGCGTTT AGCGGCGGAG GAAGCCGAAC GTCGAGCGAC GGCGAACGCG GAAGTTGAGG GAGCCACGCC CGAGGAACTC GAAGAGCAAA GCAAACGAGC GACGGACTTC TTCACCTCTC GTGGTGAAAT CGAGGCGAAT GTGGAGGAGG AAGAGGACGA CGACGCGTCC ACTCGTAGCG ACGACGACGA GGAAGTCGAA CTCGAATCGT GCGTCTCCTC CGTAACGGCT GATTACGCCA TGCAAAACGT CATTCTGCAG ATGGGCCTTA AACTCGTCGC GCCCGACGGC ATGCGCATCG AGCACCTACG GCGATGGGTT CTTCGCTGCC ACGCGTGCAA CGAAATCACC CGCAATCTCA CTCGTATGTT TTGTCCCAAG TGCGGCAATC AAACGTTGCA AAAAGTCGAG CACACCGTCA CTCGCGACGG CGTCGAACAA TTCGGCGTTC GTAAAAAGTT TGTTTTACGC GGCAGCAAGT ACACCTTGCC CGCGCCCAAG GGTGGTCGCA ACGCAAAGAA AATAATCTTA CGCGAAGACC AACTCATGAG CGTGCGGCTG ACTAAGAAAC AAGTAGGCGA AGACGTCTTC GCCGCCGAGT ACAACGAGGA ATCGTACGCC GACGCCAAGC ACTTCGCCAG TCAGAAGACG GCGTACGAAA TCGGCGGCGG CGACGTCCGT CGCAACCCGA ACGAACGTCG TCACGTCGCC ACGAACAGGC GTCGAAAGTA G
|
Protein sequence | MSAWSKIVRD DPAPAPVDAA EASAAAAAEA TLESKLRDAA TLKAVVDANA VFKGYALTDP NVLCVTIAEV LDEIRDAKGR DAVAASAGAL EVAEPSEEAI EAVKRFASKT GDVHALSRVD MKLIALAYDL EGRFELESCV SSVTADYAMQ NVILQMGLKL VAPDGMRIEH LRRWVLRCHA CNEITRNLTR MFCPKCGNQT LQKVEHTVTR DGVEQFGVRK KFVLRGSKYT LPAPKGGRNA KKIILREDQL MSVRLTKKQV GEDVFAAEYN EESYADAKHF ASQKTAYEIG GGDVRRNPNE RRHVATNRRR K
|
| |