Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25087 |
Symbol | |
ID | 5003863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 618585 |
End bp | 620278 |
Gene Length | 1694 bp |
Protein Length | 562 aa |
Translation table | |
GC content | 59% |
IMG OID | 640419284 |
Product | predicted protein |
Protein accession | XP_001419652 |
Protein GI | 145350520 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0599879 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGCGT GCGTGCGAAA ACAGAATGCG TTGGATGTGA TGCCGACGAG CGCGAGGACG GGGTGCCCGA TAGCGATTCG ACGCGCGACG TTGGACGACG AAGATGAAAA ACAGGCGGCG AACGGCGTGT TCATCGCGCG AGTCGTCGTG CGATACCGAG AGGAGCATGA ACTGGAGCAG GCGTTGCTAG ATGTGGCGAT TCAGTCGCGC ACGGCGGGAG AGTTGGCGGG GAGACTGTGC GCCGGAGGGA CGCCGCCAAG AAGAAATCGC GCCGACGAGG ACGAGGCGGA GGCGAACGAC GGATCGATCG AAGAACTCGC GAGCGTGCTC GGTGAAGACC CAAAAGACGT GAAAGCGCGT GTGTTGGAGT GTTACGATCG ATGTGAGCCA TTTCTTCCAG CGCGCGTTCG AGATTTGCTC GGACGCGAAC GCGTGTACGA AGTCCGAAGC GCTTGCGCCG GCGCCGCACT CACGGCTATT CGTTCTTTTA TTTTACGCGT GGCGACGGCG GGTAAGTGGA CGCACTGGGG CATCATAGAA GGTACGGTCG AGGTCGAGCT TCCGTTAGCG TACGACGTCA AGTTCGTGGA TACGCCCGGG ATTGATGCGT CGCAATTCTC TCTCAAGCGA TTGCGCCGAG TAATTGGCGC TGAAGCCGAG GAGCGATTCG GGATGTTCAT ATACGTGTGC GGTAAAGATG CACCGAAAGA GTGGGAGTGG AGCGCTCTGC ATGCAATGGG TACGCTAAAG AAGATCGTCA CCGAGGGTCC GCGGCTGTGT TTGTGCTGGC CCGTGGAGCT CGTGGTGAAC GCCAACGGAC GTCCAGATGG CTTTGTCACG AATCTCCAAA TCAAAAAGTT CCAGGATCGC ACGTTGAAGA TGCTGCGATC GGACTCTAAT CATTGGCGCG AACGTCTTGA TCAATGTATC GTCGCATCCA AGGTCGATGG CTTGCGCGGT GATTTAAGAC GCGGCATGGC GCTGGCGTTT GTTCGGGCGT TTAAGCCCGA CGCCTCGACG ATTTTGAAGT ATTCCATGTA TCAGCTGGCG TTTTCGGTGG AAGAAGAGCT CAAACCATCA CCGCCGAGCG ATAACGGCAG TGGAGTCTTA CCGAGCACCG CGGTGGTCGC TGACGCAAAA CCTTCGACAG ACCCTTGTCC GAAGACCAAG CGAAAGATTG ACGAAGAAGC CGCGACGAAA GCGTCGCCAC CAAAGAAACG CCTTAAGAAG CAACCACGCC AGTGTACCAT CGCTGATCCA TTCGTGTTCA ACGAAGACGA TGGTCTTTTT GCCGACGAAG ACGCCGCAAC ACCTCCAATG TTCGGCAAGC ACATCGCAAA AACCAAATCA ACGCGCTCGC CGCTCGTCGA CGCCAACGAA AAGTCTCGCG CGACGGCGGC GGCGCGACGC AAAACCAAAC CGTCGAACAC CGCACCGAAG CGACCGCTCG TCTTGACGTC GACGACCGTT TCTCCGCCGA AGTCGAAGCA TCGAGGTGCT GAAGCACCGC AGCCGAAGCT TTCCCCTCAA CTCTCCGCGA CGCCCGCGTC CAAGCCTGCG CGAAATCGGC CAAAGTTCGT CGTCGCCTCC GTCGACGACG GTCCCACCCC ATCCCCACCC CCCGCGCGTC CGAAACGCCC GCGCGAATGC ACGAAACAAC CTTGGTGGGT CGTCCAATCG CATCGATAGC TCCA
|
Protein sequence | MEACVRKQNA LDVMPTSART GCPIAIRRAT LDDEDEKQAA NGVFIARVVV RYREEHELEQ ALLDVAIQSR TAGELAGRLC AGGTPPRRNR ADEDEAEAND GSIEELASVL GEDPKDVKAR VLECYDRCEP FLPARVRDLL GRERVYEVRS ACAGAALTAI RSFILRVATA GKWTHWGIIE GTVEVELPLA YDVKFVDTPG IDASQFSLKR LRRVIGAEAE ERFGMFIYVC GKDAPKEWEW SALHAMGTLK KIVTEGPRLC LCWPVELVVN ANGRPDGFVT NLQIKKFQDR TLKMLRSDSN HWRERLDQCI VASKVDGLRG DLRRGMALAF VRAFKPDAST ILKYSMYQLA FSVEEELKPS PPSDNGSGVL PSTAVVADAK PSTDPCPKTK RKIDEEAATK ASPPKKRLKK QPRQCTIADP FVFNEDDGLF ADEDAATPPM FGKHIAKTKS TRSPLVDANE KSRATAAARR KTKPSNTAPK RPLVLTSTTV SPPKSKHRGA EAPQPKLSPQ LSATPASKPA RNRPKFVVAS VDDGPTPSPP PARPKRPREC TKQPWWVVQS HR
|
| |