Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_40782 |
Symbol | |
ID | 5002498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 425163 |
End bp | 428504 |
Gene Length | 3342 bp |
Protein Length | 1113 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417919 |
Product | predicted protein |
Protein accession | XP_001418476 |
Protein GI | 145348063 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.119207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.227387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGATCG AGTCGAGCGA TGACGATGGT GGGAGCGATT TCGACGCGGA GGCGGAGGAA GAGGACGACG ACGACGAAGC GTTTGAGGCG GATTCGAGCG AGGCGTCGAG CGAAGACGAG GAAGAGGAGG ACGACGACGA TTTCTCAGAG AGCGAGTCAA CGCCGGTGAA GAAGAAGAAG ACGACCAAAG TGGCGGTGAA GAAGGTGGTG CCGACGTCGA CGACGAATGG ACGAAAGAAG ACGACGGTGA ATACGACGTC GGCGCCGACG AAGGCGACAG AGGCAAAGGA TTTGAACGCG CAGGACGTTC TCGCAGGCGT CGGTCCGGAG CAATACGACG CGCGAGATCG GTTGAAATTT CCATTTATGC AACCGGAGAA GATTAAGGAT GCCGATGGTC GGCGTCCTGA CGATCCGGAC TACGATCCGT CCACGTTGCT GTTGCCGTCG ACGTTTCCAA AAATGCGCGA CGCCAGCGGA GTGCAGTGGA CGGTTTCCCC CGGGCAGGCG CAGTGGTGGA AGTTCAAGGC GGCGAATTTC GATTCTGTAT TGTTGTTCAA GATGGGCAAG TTTTATGAAA TGTTCGAAAT GGACGCGCAC ATCGGTGTGC GAGACTTAGG ATTGATGTAT ATGCGCGGCG AGCAACCGCA CGCTGGATTC CCAGAGAAGA ATTACGCGAT GCACGCCGAG CAGCTCGCTC GCAACGGTCA CCGCGTCGTG TGTATTGAAC AGACAGAGAC ACCAGCGCAG CTTGCCGAGC GAAAAAAGAA AGACAAGACG TGCAAGGACA CTGTGGTTCG CCGCGAAATG GTTCAAGTGT TGACAAAAGG TACGATGGTT GACACCGGTA TGTTGAACTC TTCGCCGGAT GCTGCCTTCG TTTGCTCTAT AATAGACGGC TGCGAAGAGG AGGACGGCGA AGGTTGGGTC GGTTTGTGCG CTGCAGACTG CGGTACGGGT CGTTTCCTCG TCGGTGCGTG GCGCGACGAC GAAGGTGCAA GCTGTTTGCG CACAGCACTT GCGGAGCTGC GCCCGGTTGA AATCTTAGTT CCGCCCACTG GCTTGAGCGC GCGAGCGAAG ATGGCAGTTT TGGATATGTG CTCACACGCT CAGCAGCGTA CGTTTAAGTC GACGAGTGCG AACGAAGCGC TGGAGGACGC CGAGGCGGAG GGATACTTCA AGACGCTTAA GACTGGTTTG CCAGAAGCCA TTAAAGAAAT GCGAGACACT GCATGCCATC CTGCTCGCGA ATGCGGTATC GGAGCGTGGG GCACGGTCGT TGCGTACTTA CGCGCCGCGC TGATAGACGC CGATTTAGTG CCACAAGGGC GAGTAGAGTC CCTGCACACA ACAGACGCCG GCGCGCGGGA GCACTTGGCG CGATGGGCAC ACTCTACGCA TGTCGCCATG GATGCCGCTG CACTTTCTGG GCTTGAGGTG CTCGAGAATA CCGCAGGTGG CTCCGCGGGT ACGCTTTTGG CGTCGCTCGA TCGGTGTGTG AGCGGGCCCG GTCGTCGTTT GTTGCGCCGT TGGGTGTGTC GCCCTTTGAC AAGCGCGTCA GCTATTCGAG CTCGACAAGT CGCCGTCTCT ATGATGCGGG GATGCGGGAT CGAGGCCACA GGGATTGCGC GCAAGCTTTT GCGCGCCGCG CCAGACGCGG AACGCGCCAT CTCGCGCGTT GTCGGATCGA GCGGCGAGAA AGGTCGATCC GCCTCGCACG TCGTCTTGTA TGAAGATGCG GCGCGAGCAA AGTTGAACGA TTTCCTCGCA GCGCTCGAGG GTATTCGTGC GGTTCGCGAC GCTACAAAAG CGATCGCGGC GTGCGTTGAT GCATGTGAGA AATCTGACGT GCTTCGCGCA TTGTGCATCG TAAACGACGC CGCGGCGACG CGCGAGGACG TGTTCACCGC AGTCGGCGGC GTCGCGATGC CGGATCTGAG CGCGCTGGAC GAAATGGAGT CTGCGTTCGA TTGGAACGCC GCCAAATCGA GTGGGAGAAT CGAGCCGGCG CAAGGCGTCG ACGCCGATCT GGACGCTGCC GAGGAGCAAC TTACAGCAGC GGACGCGGAT CTGGCGTCGT GGTTGGAAGA GGCGCGCGGC GAGCTCGGTG GTCACAAAAC GGAGGTTTGT TTCGTGAATG CAAACAAAGA TACGCATCTT GTCGAAGTTC CTGACCGCCT CGCGTCCAAG GTCCCTCATC ATTGGGTGCG TGAAGGTAAG CGCAAAGGAT ACGAACGGTT CACGTGCGAT GATTTGGTAC CCTTGCGCGC GAAGCGAGTT GCGGCGGAGG AAGCGCGCGA GGACGCGCTC GCCGGCGTGT TCCGTCGAAT CGTCGCAAAG TTTTGCGAGA ACGCTTCGGA GTGGCAAGCG GCGGCAAGTG TTGGTGCAAT CATAGATGTT CTAGCCTCTT TGGCTGTCGT GAGCGAGGAA ATGTACGCTT CTTGTGGAGC AGTTTGCACC CCGAAGGTGC ACCCGCAACC GCGAGACGGC GAGCCCGCGA CGCTTGAGTC CGTCGGCCTG TCGCACCCGT GCGCGTCTTC TCTCGCGCGC GCGTTCGTTC CGAACGACGC CAGGCTTGGT GGTAAACATC CGGGTTTCTG TCTCATCACT GGTCCGAACA TGGGTGGCAA GTCCACGTAC CTCAGGCAAG TCTGCTTGGC CGCGATCATG GCGCACGTGG GCGCCGACGT TCCCGCGGCC AAGTTTGAGA TGACAGCTAT GGATGCGGTT TTTGTGCGCA TGGGCGCCAA AGACAATTTA GCGGGTGGGC AGTCGACATT CATGGTCGAG CTCAGCGAAA CTGGAGCGAT GCTTCGTCGA GCGACGACGA ATTCCCTAGT CGCGCTCGAC GAACTCGGCA GAGGAACCGC CACCGCGGAC GGTACAGCCA TCGCGTGTGC GGTTGCGTCG CATCTCATCG ACAAGCGCTG CCGTACGCTC TTCAGCACGC ATTACCATAG ACTCGCCGAC GACCACGCGC GCGACCCTCA CGTGGCGTTG GCACACATGG CGTGTCGAGT CGAGACGCCG AGCGGAGCAG GTGTCGAGAC GCTCGGACGC GAAACGGTCA CCTTTCTGTA CACCCTCGCG AGCGGCAACT GTCCGCGAAG CTACGGTGTC AACGTCGCGC GTCTCGCTGG TTTACCCGAG AGCGTTTGCC TCGCCGCCGC GCGTCGCGCG GCACACCTCG AGGCTGGCCA ACTGGAGAAG CTCGTCGGCG ACGAGCGCAG TCGCGAGCGC GTCGTCGACG CGTGCCGCGA AGTCCTCCGC GACGTCTCCG CCGCAGACTT GACCGAACCA AACGTTCGTC GCGCGCAAGA TAAAGCCCGA ACCGCCCTGT GA
|
Protein sequence | MVIESSDDDG GSDFDAEAEE EDDDDEAFEA DSSEASSEDE EEEDDDDFSE SESTPVKKKK TTKVAVKKVV PTSTTNGRKK TTVNTTSAPT KATEAKDLNA QDVLAGVGPE QYDARDRLKF PFMQPEKIKD ADGRRPDDPD YDPSTLLLPS TFPKMRDASG VQWTVSPGQA QWWKFKAANF DSVLLFKMGK FYEMFEMDAH IGVRDLGLMY MRGEQPHAGF PEKNYAMHAE QLARNGHRVV CIEQTETPAQ LAERKKKDKT CKDTVVRREM VQVLTKGTMV DTGMLNSSPD AAFVCSIIDG CEEEDGEGWV GLCAADCGTG RFLVGAWRDD EGASCLRTAL AELRPVEILV PPTGLSARAK MAVLDMCSHA QQRTFKSTSA NEALEDAEAE GYFKTLKTGL PEAIKEMRDT ACHPARECGI GAWGTVVAYL RAALIDADLV PQGRVESLHT TDAGAREHLA RWAHSTHVAM DAAALSGLEV LENTAGGSAG TLLASLDRCV SGPGRRLLRR WVCRPLTSAS AIRARQVAVS MMRGCGIEAT GIARKLLRAA PDAERAISRV VGSSGEKGRS ASHVVLYEDA ARAKLNDFLA ALEGIRAVRD ATKAIAACVD ACEKSDVLRA LCIVNDAAAT REDVFTAVGG VAMPDLSALD EMESAFDWNA AKSSGRIEPA QGVDADLDAA EEQLTAADAD LASWLEEARG ELGGHKTEVC FVNANKDTHL VEVPDRLASK VPHHWVREGK RKGYERFTCD DLVPLRAKRV AAEEAREDAL AGVFRRIVAK FCENASEWQA AASVGAIIDV LASLAVVSEE MYASCGAVCT PKVHPQPRDG EPATLESVGL SHPCASSLAR AFVPNDARLG GKHPGFCLIT GPNMGGKSTY LRQVCLAAIM AHVGADVPAA KFEMTAMDAV FVRMGAKDNL AGGQSTFMVE LSETGAMLRR ATTNSLVALD ELGRGTATAD GTAIACAVAS HLIDKRCRTL FSTHYHRLAD DHARDPHVAL AHMACRVETP SGAGVETLGR ETVTFLYTLA SGNCPRSYGV NVARLAGLPE SVCLAAARRA AHLEAGQLEK LVGDERSRER VVDACREVLR DVSAADLTEP NVRRAQDKAR TAL
|
| |