Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37894 |
Symbol | |
ID | 5003976 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 476054 |
End bp | 480632 |
Gene Length | 4579 bp |
Protein Length | 502 aa |
Translation table | |
GC content | 64% |
IMG OID | 640419397 |
Product | predicted protein |
Protein accession | XP_001420191 |
Protein GI | 145351670 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.525832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAACGC CGTTGGAGGT CGTGGAGGCG ATCGAGCGGT GCGTGAGCGT GCCGGGGGGG GAGGTGCTGG ACGCCGCGAG CGAGACGCTG AGGGCGATTC GCGTCGAGCA ACGGCGGATT CGAGAGGAGT TACGAACGAT GCTGAACGCG ACGAGCAAGG AGATGGCGAG GAAGAATTTT GCGGAGCGCG CGCAAATCGT CACCAGACTT GGCAGACAGT GCATACCGAT GAAACTCGGG AGCGCGGGCG AGCTCCCGGG AGTCGTGCTC GACGTCTCCG GCACCGGGAA CACGGTGTTC AAAGAGCCGC AAATCGCGGT GCCATTGAAT AATGCTCTGG CCACGCTTTC CGCGAGCGAA GATGCGGAAA TAGAACGAAT TCTAGTCGAG CTCACCTCAA TCGTGCAAAC GCACGCGGAT GTTCTGCTCG ACGCGAACGA GGCGTTGACA GAGTTGGACG TGGCGAACGC GCGCGCCCGG CACGCGGAGT GGTTCGACGG CGCGGAGCCG ACGATCGTGG ACGCCAATCA AGGCATGTGC GTGCGTGAGC TGCAGCATCC GTTGTTGTTA GAGCGGCATC TGACGCCGTT GCCCAAAAAA GCCGCCATTG GAGAAGAGGA ACAGGTATCG GCGTTCGGCG AGAACGACGC CTCCGAAGAC GATTCGGCGC ATTCGCAGCG ACACGTCAAT CGTCGAGATG TGCGCGATGT GGTCGTCCCG ATTGATTTCA ACGTCGATTC GTCTATCAAA TGCGTCACTA TCACCGGTCC GAATACGGGC GGTAAAACCG CGTCGTTGAA AGCGATCGGC GTCGCGTGTT TGATGGCGCG CGCTGGTTTA TATCTGCCAT GCGAATCCGG TTGCGAGATT CCATTCTTTC GTCACGTCAT CGCCGATTTA GGAGATTCGC AAACCCTCGA GCTCGACGGC GGCTTGTCCA CCTTTGGCGC GCACCTCAAA GGTCTGCAAC GCATTTTGGA CGCCGCGACC GACGATACAT TGGTCTTGCT GGACGAGCCC GGGAGCGGCA CCGACCCGGC GGAGGGCGCG TCTCTCGCCG TCGCCGTCCT GAACAAACTC TCACGCACGT CTCGTTTGAC GATCGCGACG TCGCACTACG AAGAAGTCAA GGAAGCCACA CTCGCATCCG ACACAGCTCA AGTCGCGGCG GTGGAGTTCG ATCTTCAGTC GCTGCAGCCG ACGTATCGGT TGCTGTGGGG CGAAACCGGC AAGAGCAACG CGTTGCACAT CGCAGCGGGG CTCGGATTAG AACCGTGGAT ACTCGCCGAG GCGCGCATCG CGCTAGCCAA GGCGGATGCC AACGCAGAGG TAGACGCCAG TGGCGCGATC GCGCGGGAAA ATCGCGCGAA GTTGGCCAGC GCACTTGATG AAGAATGCGA CGTGCAGCTC GCCCGCCGAG CCGCGGCGGC GGCGACGCTC GAAGAGACGC GCGCGCTGTT CGACGAAGTC AGGAGTAAAT CTGCGCACTT GGATCTGAGG AAACAAATCA TCAGGGATGA CGCCAACAAT GAGATCGAGC GAAAGATTGA AGAAGCTCGC GAGTTATTGG CGGCGTGCGA CACGCGAGAG GACATCGACG ATGTCGTCGG CGCGTCGCTT CCCGCCGGAT GGGTCGTCGA CGCGAGCGGC GAAGCCGTCC CGGGCGACAG CCTTGACAGC GCGTCCTCGC GATGGATTCC CAAAATTGGC ACGCTCGTCG TCGTCCGTCA GCTAGGGAGC GCAGAGGCAG AAGTCATCGA AGTCCACCCA GATGCGAACG AAATCACCGT CAAGCTCGGT CGGATCAGTA CTCGGGTTTC GCTCGCCAGC GGTGTGAGTA AGGTTGACGT GAGCAAAACG AGTTGGCGAC GATAGCTGTA CTGTAGTAGA AGTATATTTT ACATTCATTG CAAACACACA ACGCTGCGAC AGTGAAGCGG TCGCGGTTAG TCGGATAGTC GGCGCGAACG GAAAGCCAAC AGCGCGCCAC GGATTCCTCA GTCGAGTTGT CATGAATCAT CTTCATGAGG ATGGTCAGCA AAGATTAAGC TATTTGAAAC ATCATCGGAA ACCACGAGGG CGAGAACAGG TTCGATCATC GCGTGTACGC ACCTAGCACT AAAACGTCGG CGCGATAGTG GACTCTTCCG TGAACACGCC CTGCTCTACT GCGACGTCCA ATTCTCGAAA CGCCAACACG CGTCTGGAAG ACTCCTTCAA CGTGCGCGCG ACCCAGTCGT CCTTGTCTTC CCTCAGATCG AACATAATTT TTTCATGAAG CTTTAAGAGC AAATACACGT CTAGCGCGGC ATACCGTCGA ACGTCATCTG TTAACGGACG ATCCGCCCAA AGCGTCGATT CCTCGACGGC GTACAACTTC TTCACGCGAA CCTTCAAGTC ATCAGCCACC GCGGTCTCCG CGCTCGTCAG ATGCGTTCCC GCGCACTTCA CGATCCCCGC GACGCGATCG ATCATCTTCC CGAGAGCCCG ACGCGCGGCG ACGTCGAGCA CCTGCACGTC CATCACGTTT TCCAATCGCA CGTCGAATTG ATGAAACAAC GCGTCCGAAT CCATGCGACA ATCAAACATG AGCTTCATCG GCGCCTCACT CGACTCCAGG ACATCCCGCA ACCCTCCCGC GTTTCTGTCT CCGAACGCTC GTCCCCCGAG CGACTGCACG TCGATCAAGT AAATCGCGTC GCGCGTCGCG CACTGCACCA CCGTCACCGG ACCCGTGCGC GACATTCGCA CTCCCTCGCA ATCAACCGCG ACGACGCTCG ACGCCTTGAT CGTCTCCACG CATGCCGGCA TTTTGATTTC ATAATTCGTC GCGTCGATGA TTTCGCACCG TTGATCCAGC GTGTTCACGA TCTCGAGGCA CATCGGATCC GACTTCGCGT TCGTCGGCGC CGCCGGCTTT TTCCGTCGCT TCTTCTTCTT CTTCGCCGCG CCGCCGGACC CACCGGCTTC CGCGCGCTTA CCGCCGCCCT CGAGCGCGCC GCGTTCGAGG CCAGACGAAT CATCGCCGTT CGCGCGCTTG AATCCGAGCA CGACGCCGCC GTACGCGTCG TCGTCGTCGT CGAGCGTTTC GCCCTTCGCC GAGGCGGCGG CTCGGCACGG CTTACAAAAC CGCGTCGACT GCGCGGTCTG ACGCTTGAAC GTCGACGCGC ACTTGGCGCA CACGAGCCTG GGTTTGTTGC CCATCCGCGC GACGCCGCGC GCGCGCGCGA GCGCGAGAAC GCGCGCGGGC GGGCGCGCGC GCGAGAGCGC GGCGCGCGAC CGGGACGCGA CCGCGACCGC GACCGACGAC GCGCGACCGC GCGCGGACGC AGAGGACGAC GCGCGCGCTG AAACGCGCGC GGGGATGAGC ATACACCGAA ACGACGCGCG ATCGGCGCGC GAGCGCCGGT GCGCGCGACG GTGTGACGCC GCGACTCGCG CCGACGCGCG TTCGGCGTCA ACCGACGCCG GCGCCGAGCG GCGCGCGACA TCGCGTCGAC GTCAAGTCGC AACCGGGATA CGCGCGGCGC GCGTCGGAGA CACCCAAATC GCCGTAGCAC GCGCATCGAC TGCTCGCGCG CCCTTCGGCG TCCGTCGTGG TTTCGTTGCG CGGGTATCGC GGTCCCGAGT CGTCCTCGCG GCAGCGCACG TCGACGAAGT CGCCGTCCGA CGTGGAATCG CAATCCGGGA ACATGCGCGC GCGCTGTTCG CGCGCCGCCG CACCCGCGTC GTACCGCTCC AACCGCGCTC GAATCTCGAG CGTCAACGCC GTCGCCGCCC CCGTCTCGGC GTCGTAAAAC TCGCCGTTCG CCAAGACGCC CGCGGCGCGA TACTTGTCGC GCATGAAATC CCTCCACCCT CGGATCGCCG CCAACTCCCC GTCCGTCAGC CCCGAGACGT CGCTCACGCA CCCACTCTCG CTGAAGTCTC CCGTCGCGAA CGCGCGGCTC GCGTCGCGTC CCGCGAAGCA CGTCGCGTAG TCGCCGTCGC CCGCGTAAAA CCGTCGCCCC GCCGACACGT CGAAGCACTC GCCCAGCACC GCCAGCCACA CGCGCGGCCG TCGTCGCAGC GCCGCGCGCG CGCGCGTCCA CGTCGACGCG TCGCTTCGCC GCGCGCGATC GACCCTGCGC GCGAGCGCGT CGGGCGCGAC CATCGCGACG CGCGCGTTCG CGTTCGCGTT CGCGTTCGCG TCGCGCCGAC GCGCGACGAC CGCGACGACC GCGACGACGA CGGCGAGCGC GACGCGCGCG AGTCGCGCGC GCGCGTCGTC TCGCGCCATC GCGCGCGCCG TCGCGGCCGC CGGCGCCGTC TGGCGTCGCG CGTCTGGCGC CGCGCGTCCG ACCGACCGCG TCCGACCGAC CGCGTGTATG CGTAAACAGT GAGACGTACG ATACCACGAC ATTCGGCATC GAATTCAAAC GACCGCCCCG TTCGCCGCGG CGCGCGACAC CGCGACGCGC GCGCGACGTC CGACGCGGTC GTCATGCTCC GACTGAAATC GCGAAGCGTT CTGTGCACGC CGTCCAGCGG GCTGTCCAGA GCGCTGCGCA CGCTGAACGT CGCGACGCCG AACGCGCGC
|
Protein sequence | MRTPLEVVEA IERCVSVPGG EVLDAASETL RAIRVEQRRI REELRTMLNA TSKEMARKNF AERAQIVTRL GRQCIPMKLG SAGELPGVVL DVSGTGNTVF KEPQIAVPLN NALATLSASE DAEIERILVE LTSIVQTHAD VLLDANEALT ELDVANARAR HAEWFDGAEP TIVDANQGMC VRELQHPLFR HWRRGTDVRD VVVPIDFNVD SSIKCVTITG PNTGGKTASL KAIGVACLMA RAGLYLPCES GCEIPFFRHV IADLGDSQTL ELDGGLSTFG AHLKGLQRIL DAATDDTLVL LDEPGSGTDP AEGASLAVAV LNKLSRTSRL TIATSHYEEV KEATLASDTA QVAAVEFDLQ SLQPTYRLLW GETGKSNALH IAAGLGLEPW ILAEARIALA KADANAEVDA SGAIARENRA KLASALDEEC DVQLARRAAA AATLEETRAL FDEVRSKSAH LDLRKQIIRD DANNEIERKI EEARDGLSRA LRTLNVATPN AR
|
| |