Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_526 |
Symbol | |
ID | 5002514 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 685235 |
End bp | 688012 |
Gene Length | 2778 bp |
Protein Length | 802 aa |
Translation table | |
GC content | 58% |
IMG OID | 640417935 |
Product | predicted protein |
Protein accession | XP_001418304 |
Protein GI | 145347709 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [L] Replication, recombination and repair |
COG ID | [COG5049] 5'-3' exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.306945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.90792 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGTCC CGAGCTTCTA CCGCTGGATC GCGCAGAAGT ACCCGAAGAT CGTCGCCGAC GTCGTCGAGG ACGAGCCCGT CGACGCGCTC GGACGCCGGG TGGAGCTGAA CAGCGCCGAG GCGAACCCGA ACGGCATCGA GTTCGACAAC CTGTACCTGG ACATGAATGG CATCATACAC CCGTGCTTTC ACCCGGAGGA CCGACCGGCG CCGACGACGG AGGAGGAGGT GTTTGAGTGC ATATTCGATT ACATCGATCG GTTGTTCTTG ATGATACGAC CGAGGAAGGT GCTGTACATG GCGATCGACG GGGTGGCGCC GCGAGCGAAG ATGAATCAAC AGCGAAGTCG ACGGTTTCGA AGCGCGCAGG AGGCGAGAGA GAAAGCGGAG GAGGAAGAAA AGTTGAGGGA GAAGTTGATC AGGGAGGGGG TGAAGGTGCA GCCGAAACAA GAGTCGGGGG TGTTTGATTC GAACGTAATC ACGCCGGGGA CGCCGTTCAT GGGACGGCTG AGCGAGGCGT TAAAGTACTA CGTGCACGAT AAGTTGAACA ACGATCCGGG ATGGCGGGGA ATTGAGGTTA TCTTTAGCGA TGCGAGCGTA CCGGGGGAGG GAGAGCACAA GGCGATGCAC TACATACGGC AACAAAGGGG GCTGCCGGGG GCGAATCCGA ACACGCGGCA CGTCGTCTAC GGTTTGGACG CGGATTTGAT CATGTTGGCG CTCGCGACGC ACGAGCCCCA CTTTTGGATC TTGCGCGAGA TTGTGTTTCA GAAGAAGGAT AACGAGGCGC CGCAGACGCT GGGGCTCGGG ACGGAGGAGA CGAAGAAAAA GGTGGCCATC GCGCGTAAGC CGTACCAGTT GCTCAGCGTG AGCGTCTTGC GCGAATACTT GGCGTTGGAC ATGCGCCCTG TCGCACCGAC GCCGTTCAAG CTCGACCCAG AGCGGATGTT TGACGACTTT ATCTTCATGT GTTTCTTCGT CGGAAACGAT TTCTTACCGC ATTCGCCGAC GTTGGAAATC CGTGAGGGCG CCATCGACTT GTTGATGACG CTCTACCGAA ACTGTCTTCC GACGCTCGGT GGATACCTGT GCGCGGACGG CCGTCCCAAT CTCTCCATCG TCGAAAAGTT TGTGCGGCTC GTGAGTGAGC ACGAGGACGC CATCTTTCAG AAGCGTGCCA AGAAGGAGGC GCGAATGCGG AGTTCAAGAC AACGCGATAA ACAAAAGGCG AAGGATTACT ACGAGCGCCA GCGCAAAGGG AGCGGTACCA ACGTGCCGCA ACACCGCGTG CTCGGTGGAT CGAGAAACGC GAGTGATCGC GCGCCCGCAG CGCCGACAGA ACAGCTCGTC GCGCTCGGTC GCGGAAAACC GACGCCACCT CCGAGCGCGC CGAAGACGGC GGCGGAGAAC AAGAGCGCCG CGGATGCGCT GCGCGAACGA TTAAAGACGC GAGGTAAGCG CGCCGCCGAA GCGACGGCTG ACGTCGCGCC GGAAGCCGAT ACCGACGCCA AGAAGGCTAA AGTATCGGAC GCGGATAAGG CTGAAAAGGA CGAAAAAGCA AAGGAGTTTT GGAATCAGCT CGCGGAGAAG GCTGAATCTG AATCAGCCAC GACGGTTGAA ACCATCGACA CCGTCGAAAG CGATGAAGTG CCGTCGCACC CTTTTGTACA ATCGACGGCA CCCGACGCGC AGCCCGGCGA TTGGATGTGT CCCACAGGGT GCGGAAGCAT GTACGCATCT AAAGGTTCGT GCTTCAGGTG CGGATGCCCA CGTCCGAGCG AGGTGCGAGA GTTCAAAGCG GGCGAGGTGA TGGACAGTAA ATCCTTCTTG AAGCAACTTG AAGGCATCGT CAAGGCCATG GGTGAGCGCG AGGAAGAAAC GGACAACATT CGGCTCGGAG AAAGTGGCTG GAAAGATCGC TATTACGAAG CAAAGATGCA GGCAACGCCA CAAACGCGAG ATGAGATCAT TCGGGGCATG GTCATTGAGT ACGTGCGCGG CTTAATTTGG GTGTGTCGAT ATTACTTCGA AGGGTGTTGC TCTTGGAGTT GGTTCTACCC ATACCATTAC GCCCCGTTCG CGAGTGATTT ATATGACTTG AGTACGATCT CTACCGATTT TGATCTCGGC AAACCGTTCA AGCCGTTTTC GCAGCTCATG GGCGTGTTGC CGGCGGCGTC TTCGCACGCG TTGCCCGCAG CGTTCGCGCC GTTGATGTCG GACAAAGACT CTCCTATCAT CGACTTTTAT CCCGAAGACT TCGCGCTCGA TATGAACGGG AAGCGTTTTA CGTGGCAGGC CGTGGCACTG CTCCCTTGGA TCGACGCCAA CCGTCTGCTA GAACAAACGG AGATGCTCGA ATACACGCTC ACCGCCGAGG AAAAGCGCCG AAACTCCATC AACGAGGAGG AAATATACGT CAATGCCGCG CATCCGCTCG CAAAACAGTT TCTCGAGCTC GAAGAGCGGG AAGACGAAGA CGTGGAAAAA ACGCTCAAGA TGGATCCGAA GCTTAGTAAA GGCATGAACG CCACGCTCGT GTCGGTGAAA CGCGACGCGC AGCCCACGAT GATACCGTCG CCGATGTCGA GTCGCCAAGA CATCTCGAAC AACAAAGTAG TCGTGGCGTC GATGCGTTTG CCGACCGATA GATTTGTGCC GCCGGTGCTG ATGCCCGGTG CGGTGTTACC GACGCCTATG GTGACGGAGG CTGACTTGCC GCCGCCGCCG CAGTTGTTTC ATCAGACGGA CAACTACGGG CGCAACAACA ACAACAAC
|
Protein sequence | MGVPSFYRWI AQKYPKIVAD VVEDEPVDAL GRRVELNSAE ANPNGIEFDN LYLDMNGIIH PCFHPEDRPA PTTEEEVFEC IFDYIDRLFL MIRPRKVLYM AIDGVAPRAK MNQQRSRRFR SAQEAREKAE EEEKLREKLI REGVKVQPKQ ESGVFDSNVI TPGTPFMGRL SEALKYYVHD KLNNDPGWRG IEVIFSDASV PGEGEHKAMH YIRQQRGLPG ANPNTRHVVY GLDADLIMLA LATHEPHFWI LREIVFQKKD NEAPQTLGLG TEETKKKVAI ARKPYQLLSV SVLREYLALD MRPVAPTPFK LDPERMFDDF IFMCFFVGND FLPHSPTLEI REGAIDLLMT LYRNCLPTLG GYLCADGRPN LSIVEKFVRL VSEHEDAIFQ KRAKKEARMR SSRQRDKQKA KDYYERQRKG SGTNVPQHRV LGGSRNASDR APAAPTEQLV ALGRGKPTPP PSAPKTAAEN KSAADALRER LKTRGKRAAE ATADAMGERE EETDNIRLGE SGWKDRYYEA KMQATPQTRD EIIRGMVIEY VRGLIWVCRY YFEGCCSWSW FYPYHYAPFA SDLYDLSTIS TDFDLGKPFK PFSQLMGVLP AASSHALPAA FAPLMSDKDS PIIDFYPEDF ALDMNGKRFT WQAVALLPWI DANRLLEQTE MLEYTLTAEE KRRNSINEEE IYVNAAHPLA KQFLELEERE DEDVEKTLKM DPKLSKGMNA TLVSVKRDAQ PTMIPSPMSS RQDISNNKVV VASMRLPTDR FVPPVLMPGA VLPTPMVTEA DLPPPPQLFH QTDNYGRNNN NN
|
| |