Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43144 |
Symbol | |
ID | 5005428 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | - |
Start bp | 629151 |
End bp | 630843 |
Gene Length | 1693 bp |
Protein Length | 542 aa |
Translation table | |
GC content | 56% |
IMG OID | 640420849 |
Product | predicted protein |
Protein accession | XP_001421519 |
Protein GI | 145354496 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | [TIGR00583] DNA repair protein (mre11) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.275751 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.938333 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACCGG CCGCGGGGGT GAAACCGCCC GATGCGAACA CGCTGCGCGT GCTCATCGCC ACGGACACGC ACCTGGGCGC GCACGAGCGC GATCCGATTC GAAAAGATGA CGCGTTTTTA GCGTTTGAAG AAATCTTCGA TCACGCGAGA AAACAACTCT GTGATTGCGT GTTTCTCGCG GGAGACGTGT TCGACGTGAA TAAACCGAGC CGAGAGACGC TGGTGCGGTG CATGGACGCG CTGCGGGAGG CGACGCGAGG GAATAAAGGG ATCGAGATTG AAGTTTTGAG CGATGGGAAG GAAAACTTTC CGAGTCGAGG TGCGCGCGAA GCGAGGCGAA GACGAGGTCG AAACGATGAC TGACGGCGCG GATGCGGGAA AATCAGGCAT GGCGAATTAT GAGGATCCGA ATTGTAACGT GTCGTTGCCG GTGTTTAGCA TACACGGGAA TCACGACGAT CCCGCGGGGG AGGCGAATTT GAGCGCGATG GACGTGCTCG CGAGCGCGGG ATTGGTGAAT TATTTCGGCA AGCACGCGCT CGGTGGGGGG GGAGCCGGTC GCGTGGACTT GAAGCCGGTA TTGTTACGTA AAGGACAGAC TAAGGTGGCG CTGTACGGGT TGGGATACAT TCGCGATAAT CGTTTACATC AAATGTTTAG CGTCAAGGGA TGCGTGCGAT GGCATCGACC GGCGGAGACG GAGGATTGCT CGTCGAGTTC GTGGTTTAAC GTGATGTTGA TTCATCAAAA TCGAGCCGCG CATTCGAAGA ATGCGATTTC CGATCGTTAC TTGCCGAGTT GGTTGGATTA CGTCGTTTGG GGGCACGAGC ACGAGTGTTT AGTGGAGCCA ACCGAGAGCG CGCAGGGATT TCACGTGTCG CAACCGGGAT CGAGCGTGGT GACGTCTTTG ATTGAAGGCG AGGCGAAGGA GAAGAAGATT TGCGTGCTCG AAGTGCGAAG TGATCCGGAG AATCCGAATA GCGCGCCATT CTGGCGCACG ACGCCCATCA CCTTGCGAAC GACGCGACCG TTCGAGTTTG AGCAAATGAC GTTGGCGAAC ACGCCCGAGC TCGAAGGCGC GGATGCCCAA GGCGTGGCGA CGTATCTGGA GAACCGCGTG AACGCCATGA TAGTCCGGGC GGGGCGCAAG CATAGAGAAC GACACGCGAA AAATGGGAGA GACGATGTCG ACATGCTCGA CCGCTTGAAT TTGCCTTTGA TTCGTCTGCG CGTGGATTAC TCGGGCGGCT TTAGCACCAT CAATCCGCAG CGCTTCGGTC AAAAGTTTGT CGGCAAGGTG GCGAATCCGC ACGATGTTTT GCTCTTTCAT AAATCTCAAA AGAAGCAACG TCGCGATGGC GTGGACGTGG ATGAAGACAT GATCGATGAG GAGGCGGCGG CGTTGGAGGA GGAAGACGCC CTCGCCGATG GCATGCTCGA GAATCAACGA CGAATCGATC GACTCGTGCG CGAACACTTG TCGACGAGCG ACGGTTTACA ACTTCTCACA CCTAACGATC TCTCCGCCGC GCTCGACGAT TTCGTCAATC GCGACGAAAA GGCGGCGATT TCCAAGCTTT GTCAAACGCG CTTAAAGGCG GTGCAAACAT CGGTGAATGC GGATGATCAA GAAAACACCG ACGACGTCGA TCGATTGACT TCGAAGATTT ACGAAGCCGT GAAGGTGCAG TTA
|
Protein sequence | MRPAAGVKPP DANTLRVLIA TDTHLGAHER DPIRKDDAFL AFEEIFDHAR KQLCDCVFLA GDVFDVNKPS RETLVRCMDA LREATRGNKG IEIEVLSDGK ENFPSRGMAN YEDPNCNVSL PVFSIHGNHD DPAGEANLSA MDVLASAGLV NYFGKHALGG GGAGRVDLKP VLLRKGQTKV ALYGLGYIRD NRLHQMFSVK GCVRWHRPAE TEDCSSSSWF NVMLIHQNRA AHSKNAISDR YLPSWLDYVV WGHEHECLVE PTESAQGFHV SQPGSSVVTS LIEGEAKEKK ICVLEVRSDP ENPNSAPFWR TTPITLRTTR PFEFEQMTLA NTPELEGADA QGVATYLENR VNAMIVRAGR KHRERHAKNG RDDVDMLDRL NLPLIRLRVD YSGGFSTINP QRFGQKFVGK VANPHDVLLF HKSQKKQRRD GVDVDEDMID EEAAALEEED ALADGMLENQ RRIDRLVREH LSTSDGLQLL TPNDLSAALD DFVNRDEKAA ISKLCQTRLK AVQTSVNADD QENTDDVDRL TSKIYEAVKV QL
|
| |