Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34085 |
Symbol | |
ID | 5000650 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 842715 |
End bp | 845831 |
Gene Length | 3117 bp |
Protein Length | 1007 aa |
Translation table | |
GC content | 59% |
IMG OID | 640416071 |
Product | predicted protein |
Protein accession | XP_001416776 |
Protein GI | 145344514 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.297969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.623648 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGCGA GACTCGATCG AGACGATCCG CTGGGCGTGG ATCTGACGCT GCGAGGGGCG TCGACGCGGC ATGGGGCGGG GAAACGGATG ACGCTGTATG ATTACGCGAG GACGGTGAAG GCGGCGCATC CGCGGAAAAT ATCGTTGATT CGAGTGGGAG ATTTTTACGA GTGCTTGGGA TACGACGCGG TGATGCTGGT GATGCACGCG GGGTTGAATC CGATGGGGAT TTCGGGGGTG CCCAAGGCGG GGTGCCCGGT GGTGAAGATT CAGGAGACGC TCGATCGGTT GACGTCGAGA GGGTACTCGT GCGTCGTTTG CGAAGAGGTG CCGCAGATGA ATAGATACGG CCAACCGACG CCGCCTAAGG ATCGGTACAT CGCCGCGATC GTGACGCCGG CGTCGCCGAA TTACGTCAAA GGCGCGGCAT CGCAGGGCGA AGACGTCGAT TTCGGGGATG GGAGCGCGCC ACCGGTGATT GGGCTCGCGA GCAGCGCGCT CGGGTACACC GTAGTCACCG TCGAACCGGA TCTGCGTCGA GTATCCGTCC TCGAGGGATT GACGGCGGAA GCCGCCGCGG CGAAGCTGTC GGCGGGCGGC ATCGCGCCGC CGCTTTACAG GCACGCAAGT CTGGGAAGTG GTCACGCACA GAGCGGTCGG CACGCGGCCG GCACGAGCGC GCCGAGTCGG CGTTTACGCT GGGAGGTGCA ATCGATTTTG AGCGCCGCCG AAGGCGTCGA CGCGGTGAAA TACAACGGCG ACGTCGTGGA GAAGCTCCTA GATCTCGTGC GTTTGGATCA CGGACTCGGG CCCGAAGATG CTTTCCATCG CGTGACTGTG CCTAACAAAG GCCGCCCAGC GCCGCTCTCA CTCGCGACCG CGCAACAGCT CGGAATCTTG CCGTCGCGAT CCGTGCCTCC GCTCTTGACG CATCTGTTGC CCGAGAGGTC GGCACCGGCG GCGTGTCGCT CGTACTTACA AGAGCTATTG CTTCATCCAC CGCCACCTCA AACTGCGCTC GCGATTCAAG AGGCTTCGGT GCTTTTCACG AAGTCGACGT CGGCCATGCC GCAACTCGAA GTCCTGCCGC CGAGCAAAGT GGCGAAACTT TTGGCCCAAC GCGAGGCGAG CCATACATTT TTCGCCGATC TCGCGTCCAT GGCGCGCGGT GTCGAGGCAC TGCTAACGAA CAGCAGCGCG GACATTCGTC GTGCCGGACA TTTGTTGATC GATCCCACGT CTTTGAAGCT CGGCGCCAAG CTGAACGGCG ACGCGCTCGC CAAGGTGTGC TCGGAGGCGA GCGCGATGAT CGAAAACGTG GTGAGCGAAG ACGTTCTGAA TGGAATCGCA GTTGTTCGTC AGAAGAGCGA TGAAGACGAA AGCGACGACG AGGATGAGCG CGATGACGCA TCTTTCGTCA TGGACGGTTT AGATCAACCA CTCAAGCTCC TCAACATACC GAACAGATTT ATGTTTGAAA ACGAGCGTTG GCGAGGCAGA GTTCGTCGCG AGCACATTCC CGAAGCGATC GATCGGGTTG AATCCGCCGC ACGCGCGCTT GAAACGGCAG TGAACGAAGA TTTCATGCCA ATCATCGAAA AAGCCGCGGC CGAAAAGAAG ACAAAAGCGA GGAAATGTGA GTTGGAACAC GACATGCGAA ACAACGCGCT TTGGATGCGT CACGCTCCAA AAGATGCGAT GAAAATGGAC GATTTTATTC ATCCGCGCGA TAGATTCGGA AAAGAGGTCG CAGATAGATG GACGACGGCA CGAGTCGAGC TTGCTTTGGA CGATTACAGA GTTGCGACGC AAAAAGCCGC GGTCGCAGTC TCTGACACTC TCATCAACCT CGCCGACGAT TTGCAGGAGC ACATATCGTC GCTCGTGGGC GCGGCGACTC TTTCAACGGT GACGATTGCG ATTCTCGCCC ACGCAAGCAA CGCCATCAAC AAACGTTGGA CGCCGCCAAC GCTTCTTCCC GAAGGCGACA CGGCGAGTCC GCTTGCGGTC GAGGGCTTGG TACCGTTTTG GATGCGACTT GATGGCGCGG AGACGGTGCC GAATAGCTTT GACATCGATG GTTTGGTTTT ACTCACCGGT CCAAATATGG CGGGAAAGAG TACGGTTTTA CGCTCAGTGG CCGCGCTAGC GCTTCTTGCG CAGTGTGGAC TACACGCACC CGCGATTTCC GCCCAAGTAC CTCGCTTAGA CTCGCTCATA GTTCGAATGG CGAGCACGGA TTCCCCAGTC GAAGGTTTGA GCTCGTTTGC GGTTGAGATG CTCGAAATTA AATCCATGCT TAGCTCTTGC ACCGCTGGTA GTCTGATCAT GGTTGACGAA CTTGGTCGAG GCACGGAGGC TTCGCACGGC ACTGCAATAG GCGGCGCAGT TGTCGAGGCG CTCGATGAGT GCGGCGCGCG TGGAATTTTC GCCACGCACC TGCACGGCAT TCTGGATTTG CCGTTGCGCG TCTCGCCGTG GACGCGTCGA GCGCGCATGG AGACGGCAAA GTCGGATGAT GGAAGCACGC GCCCGACGTG GAGAATGGTC CCAGGCGAAT GCCGTGAGTC ACTCGCGCTC CAAACCGCCC TTGATTGCGG TATTTCACAC GCCATCGTCG CTCGAGCCAA TGCATTGCTC GAGGAACAAA CGAGCATCCC GCTCGTAAAG TTGAGCGACT CAGAGCAGGC GACGTTGATC GAGAAGCAAG ACACCAGCCC GGAAAGACCG CGTGTCGATG GTGAATATTT GAAACTTCTC CTCGCTGAAT CAACTGCGCG AGCGCTTCAA CTTGAAAATG CGCAAGTGAT TCACGTGGGT CCGAATCAAA CGCCGCCGAT TGGCTCCGCC GGACAAACGT GCGTGTACAT TCTTCGCCGC GGAGACGGCT GGTGCTATTG CGGCGAGAGC GACCATCTTC CTACGCGACT CGCGACGCAT CGTCAAAGTT CCCAGCGTCT CATCGAGCTA GTGTACGTCG CGGTGCCGAA AGAAGCGGGA GGAAAGAGCG CCGCGCGCGC CCTCGAGAGC CGCGTCATCC AAGCGTTACA GCGAGCGCGC GTGCCGTTGT GGTCGGATCA AGACGCCGCA CACAAACATT TTGGCGCCGC AGGGTGA
|
Protein sequence | MIARLDRDDP LGVDLTLRGA STRHGAGKRM TLYDYARTVK AAHPRKISLI RVGDFYECLG YDAVMLVMHA GLNPMGISGV PKAGCPVVKI QETLDRLTSR GYSCVVCEEV PQMNRYGQPT PPKDRYIAAI VTPASPNYVK GAASQGEDVD FGDGSAPPVI GLASSALGYT VVTVEPDLRR VSVLEGLTAE AAAAKLSAGG IAPPLYRHAS LGSGHAQSGR HAAGTSAPSR RLRWEVQSIL SAAEGVDAVK YNGDVVEKLL DLVRLDHGLG PEDAFHRVTV PNKGRPAPLS LATAQQLGIL PSRSVPPLLT HLLPERSAPA ACRSYLQELL LHPPPPQTAL AIQEASVLFT KSTSAMPQLE VLPPSKVAKL LAQREASHTF FADLASMARG VEALLTNSSA DIRRAGHLLI DPTSLKLGAK LNGDALAKVC SEASAMIENV VSEDVLNGIA VLLNIPNRFM FENERWRGRV RREHIPEAID RVESAARALE TAVNEDFMPI IEKAAAEKKT KARKCELEHD MRNNALWMRH APKDAMKMDD FIHPRDRFGK EVADRWTTAR VELALDDYRV ATQKAAVAVS DTLINLADDL QEHISSLVGA ATLSTVTIAI LAHASNAINK RWTPPTLLPE GDTASPLAVE GLVPFWMRLD GAETVPNSFD IDGLVLLTGP NMAGKSTVLR SVAALALLAQ CGLHAPAISA QVPRLDSLIV RMASTDSPVE GLSSFAVEML EIKSMLSSCT AGSLIMVDEL GRGTEASHGT AIGGAVVEAL DECGARGIFA THLHGILDLP LRVSPWTRRA RMETAKSDDG STRPTWRMVP GECRESLALQ TALDCGISHA IVARANALLE EQTSIPLVKL SDSEQATLIE KQDTSPERPR VDGEYLKLLL AESTARALQL ENAQVIHVGP NQTPPIGSAG QTCVYILRRG DGWCYCGESD HLPTRLATHR QSSQRLIELV YVAVPKEAGG KSAARALESR VIQALQRARV PLWSDQDAAH KHFGAAG
|
| |