Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_88476 |
Symbol | |
ID | 5003965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 452667 |
End bp | 455876 |
Gene Length | 3210 bp |
Protein Length | 1069 aa |
Translation table | |
GC content | 62% |
IMG OID | 640419386 |
Product | predicted protein |
Protein accession | XP_001420008 |
Protein GI | 145351275 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.289922 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.626674 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAAGC AAAAGTACAC CGCGTTCGAC GTCGCCGCCG TCGTCGCCGC GCTGCGCCGC GCCGCGCTCG GGTGCTGGCT CGCGAACGCG TACGACGTCG ACGCGACGTC GGGAAACAAA AAGTTTCTGT TAAAGCTGAA TAAACCGTCC GGCGCGGTGG CGCGAGACGC GCGCGCGGAC GCGACGACGG CGGAGAGCGA GAAAATACTG GTGTTCATCG AGTCGGGGAC GCGGGTGCAC ACGACGAGGT ACGAGCGAGG AAAGACGACG GCGCCGACGG CGTTCACGGC GAAGCTGCGA GCGCGCGCGA AGGGGAAACG CTTGACGGAC GCGAGGCAGC TGGGAAGGGA TCGAGCGATC GATTTCACGT TCGGGGGGGG AGGGGAGAAT GAGTGTCATC TGATCGTGGA GTTGTACTCG CAAGGGAACG TGATTTTGTG CGATGGGAAT TACACCGTGG TGGCGCTGTT GCGATCGTAT CGAGACGGCG GCGACGTCAA CATTTTGCCG AATCATCAGT ACCCGTTGGA GCGGCTGAAG GGATTTCAGC TCGGTGGGTA CACCCGGGAG GACGTGGTGA GCGCGTTGGC GCGCGGGGTG TTGGCGACAG AGGAGGAGAC GATGGGTGGG GACGCGCGGC GGGCGCCGGC GACGCTTCGC GAGGCGTTGT GTCGAGCGTT CGGGTACTCA CCCGCCATCG CGGATCACGT GGCGTTGACG GCGTCGATCG AGCACGGCTC GAACGCATCG CTACCGCTGA GTGAGGCGTG CGTCGATCGG CTGACGGCGG CTGTGCGAGA TTTAGAGAGT TGGTTCGAGG GCGTCACGAC GGGCGACGTC GTCGCCGTGC CCAACGTGTG CACGAAGATG GACGCCAACG CCGACGGCAC GGACGAGATC GAGATCTTCG ATGATTTCTC CCCGTTCTCG TTGAAACAAA ATGAAGGCCG ACCGACGAGG AAGTTCGAGC TTCCCAAGGG GTTAGACCCG GTGTGTGCGT TCGACCACGC AGTCGACGAG TACTTCATCG CTCTCGAGGC GCAGTCGCAA ATCTTAGCGC GACGTAAAGC TGAGGCGCAA GCGTTGGCTA AATTAGAAAA ATCACTCAAG GATCAGAAAA GTCGCGTCGA GCAGCTCGAA CGCGAGCGCG AGAAAGAGGA GCAACGCGCG GTTCTCATCG AGTACAATCA CGAAGCCGTG GACACGGCGA TTGACGCCGT GAATTCGGCG TTGGCGAGCG GAATGTCCTG GCCCGAGCTC GAGGCCATGA TCAACGAAGA GCGCCGGCTC GGGAATCCGG TGGCGGGGAT GATCAAGTCA CTCGATCTGG CGAATAATCA AATCACTATC ACGCTCGCGA ATCATCTCGA CGAAGTCGAC GAAGTCGACG CGGCGAGCGG TAAGCGCAAA CGAGTCGCCG TGGGCGTGGA TTTGGGATTG AGCGCGCACG CCAACGCGTC CATGCGCTTC GCAGCGAAGA AGAAACACGC GGAAAAGTTT AGCAAGACAG TGGATGCGCA GTCCAAAGCC GTGGCCGCGG CTGAAGCCAA GGCGAAGGCG GCGATGGAAA AGGCTGCGAA CGGATCGTCC ATCGCGCGCG CCAGACAACC GCTTTGGTTT GAAAAGTTCA ACTGGTTCAT CACGAGCGAA AATTGTTTGG TGCTTCAGGC GAAAGACGCG ACGCAGGCGG AGATGCTCAT CACACGGTAC ATGCTCCCAG GCGACGCGTT CGTGCACGCA GAGGTACCGC AGGCGCCCGT CACCTTGGTG AAACCGCCGC CTGGCGTCGA CGTGCGAGCG GTGCCGGCGT ATTCGCTCGT ACAGGCGGGC GCAGCGGTGA TGTGTCGTAG CAGCGCTTGG AATTCACGCG CGGTTAAATC GGCGTGGTGG ACGAGCTCTG AGCGCGTGAG CAAGATTTCC CCGGTCGCCG GCGACGCGCT TCCGCCAGGC GTCACGCACG TCGCGCACGC GGACAAGCAA TTCCTGCCGC ACGCGCAACT CGTCATGGGG TTCGGATTAA TGTTCGTCGT GAGTGAGAAG AACGCGGAAG CGCACAAAAA CGAACGATTG GTGCGAAGCG ATTTCAACAT TTTAGAAGAA GAGGGCGACG AGACGTTCGA CGAGGACGAC GAGCAAGAAG ACGAGCGAGA CGACGCGAAT GGCGGCGCCG AGATTAGAGG AGAAGTCAAC AAACTCGCGG CGTTCTTAGA TGGTGCCGTG GGTTTCGCAG GAGAAGATGA GCGATCGAGC GTAGACGACG GCGACGAGAA CGACGATGAC GACGGCGACA CCGTCGCCGC CGCGCCGTCG CCCGCGAAGC CGGCGACTCC GCGCATGTCG GCCAAGGAGC GCAAGACGCT GAAGAAGAAG AAGGGCGGCA AGGGAGGGAA TGATTCGGAC GGCGACGGCG ACGACGATTT CATCGACCCA TTAGCCGAGA TGAAGAAGAA ATCATCGTCG AAAACCGTGC CGATCGAGAC GAAGAAGGCA CCTCGCGGCA AAGCGGCGAA ACTCAAGCGC GCCAAGGCCA AGTACGCGGA TCAAGACGAG GAGGATCGGG CGCTTGCGAT GGAATTTCTC GGTGCGAGCG GTGGCAGCGG CGGGAAGAAA GTCACGGGCG CGTCCAAGAA AGCGGCCAAG GCGGCGGAAA AGTTTGAAGA GCGCAAACAG GAGAAACCGA CGGCGCCGAG CGCGCCCGAA CCGGCACCTG CGCCGTTCGT CAAGCGTCGC GAAGCCGCGG CGGCCGCGTT CGCGCGCAAG GCGGCCGCTG ACGACGATTT TCCGGCAAAA TATCAGGACG ACGAGGACGA CGACGAGTCA AAGGCGTCTC TCGTCCCAGA CGAAGCGTCC ATCGAAGAGC GCCTAAAGCT CGACGCCGAA CGTCTCGAAA TCGTCAACCG AATCGTCTCT GCGCCGTTCA AAGACGACGA CATCGAGTAC TGCCTTCCCG TGTGCGCCCC GATCACCGCC ACCAACGCGC TCAAATACCG CATGAAAGTC ACCCCTGGCT CGCAGAAGAA AGGTAAAGCC GCGAAACTCG CGATGGAAAT CCTTTCCCGC GCGCCCTTCG CCACGCCTCG CGAGCTCGCG TGCGTCAAAG CCGTCGCCGA CGTCGACGCC GCCGTCGCGC TCCCCGCGGG GTGTAAGATT AGCTTACCAC CCGGGGCGGC GAAATCCATG TCCAAGGGCG GTAAGAAGAA GCGCCGTTGA
|
Protein sequence | MPKQKYTAFD VAAVVAALRR AALGCWLANA YDVDATSGNK KFLLKLNKPS GAVARDARAD ATTAESEKIL VFIESGTRVH TTRYERGKTT APTAFTAKLR ARAKGKRLTD ARQLGRDRAI DFTFGGGGEN ECHLIVELYS QGNVILCDGN YTVVALLRSY RDGGDVNILP NHQYPLERLK GFQLGGYTRE DVVSALARGV LATEEETMGG DARRAPATLR EALCRAFGYS PAIADHVALT ASIEHGSNAS LPLSEACVDR LTAAVRDLES WFEGVTTGDV VAVPNVCTKM DANADGTDEI EIFDDFSPFS LKQNEGRPTR KFELPKGLDP VCAFDHAVDE YFIALEAQSQ ILARRKAEAQ ALAKLEKSLK DQKSRVEQLE REREKEEQRA VLIEYNHEAV DTAIDAVNSA LASGMSWPEL EAMINEERRL GNPVAGMIKS LDLANNQITI TLANHLDEVD EVDAASGKRK RVAVGVDLGL SAHANASMRF AAKKKHAEKF SKTVDAQSKA VAAAEAKAKA AMEKAANGSS IARARQPLWF EKFNWFITSE NCLVLQAKDA TQAEMLITRY MLPGDAFVHA EVPQAPVTLV KPPPGVDVRA VPAYSLVQAG AAVMCRSSAW NSRAVKSAWW TSSERVSKIS PVAGDALPPG VTHVAHADKQ FLPHAQLVMG FGLMFVVSEK NAEAHKNERL VRSDFNILEE EGDETFDEDD EQEDERDDAN GGAEIRGEVN KLAAFLDGAV GFAGEDERSS VDDGDENDDD DGDTVAAAPS PAKPATPRMS AKERKTLKKK KGGKGGNDSD GDGDDDFIDP LAEMKKKSSS KTVPIETKKA PRGKAAKLKR AKAKYADQDE EDRALAMEFL GASGGSGGKK VTGASKKAAK AAEKFEERKQ EKPTAPSAPE PAPAPFVKRR EAAAAAFARK AAADDDFPAK YQDDEDDDES KASLVPDEAS IEERLKLDAE RLEIVNRIVS APFKDDDIEY CLPVCAPITA TNALKYRMKV TPGSQKKGKA AKLAMEILSR APFATPRELA CVKAVADVDA AVALPAGCKI SLPPGAAKSM SKGGKKKRR
|
| |