Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31440 |
Symbol | |
ID | 5001581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 876828 |
End bp | 878252 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417002 |
Product | predicted protein |
Protein accession | XP_001417629 |
Protein GI | 145346298 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0708] Exonuclease III |
TIGRFAM ID | [TIGR00633] exodeoxyribonuclease III (xth) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGCGC TGTTCGACGC GCTCGGAAGC GATTTGAAGG TGATTTGTTT GCAGGAAACA AAGCTGAGCT CGAGCGGCGA CGTCGAGCGG CTGGAACGCG TCGAAGGTTG GGACGGCGCC CACGCGGTGT GCGAATCCTC CAATCACCGC GTGGGATACA GCGGCGTGGC GGTGTACTGG CGCTCGAACG AGATTTGCCC GACGTCGATC GAGCGGGGAG TGTGCGCGAA AGGCGACGCG GGCGAGTCGA CCATGTGGGC GGGAGAAATC GCGCCGTTCG CGGACGACGC GACGCGCGCG AAAGAGATCG ACGGCGAGGG CAGGGCGTTG TGGGTCGATT TCGGTGAGTT CGTTCTGTGC ACGGTGTACG TGCCGGCCGT TTTCGGCGAT CCAGCGGTCG ATGAGAAAAC AGCCGAGCGC GCGGCGTTCA AGCGCGATTT TTTGAGCGCG CTCGAGGCGA GATACAAGAG TCTGCGCGAA CGCGGTCGAA ATGTGATTTT ATGCGGTGAT TGGAACATCG CACCGTCGTG GAAACTCGAT CGCGCGGATG AAGACCCGAA CGCGGTGGAA CCTCGAAATC CATCGCGCGA TTGGCTCGCG GCACAACTTG CCGGGGACGC GATGGTGGAC GTGTTTCGCG AATTTTTTCC GACGCTCGGC GATGCGTTCA CGTGTTGGAA CGTTGCGAGT GGAGCGCAGT TGTCCAATTA TGGATCACGA ATCGATTATT TCTTGTGCGA TCGAGCGGTG ACGTTGAAGC GCGTCCGAGG CGTCGGTGTG GCACAAAAAT TTGAAGGGAG CGATCACGCG CCAGTCTATC TTGAGCTCGA GGAATCGATG TGGAGGCGGA GAGATTCGCA ACAAACGCCT CCGTCTTTGG CGATTTCGAT GCTTTACCCA GGTCGACAAA CCACGGTAGA TTCAATATTC GCACGCGCAT CGTCGACGAG CAACGCGACG CCGGAATTTC TTAACGCGGC GTCTCAGTCG CGAGCGAAGC CGACGCGCCC AAGCGCCCGC GCCCAATCCC GCGCGGGCGT CTCAGACGCG CCCAAGCGCA AGCCCGAAGC GACGTTGAAA GATTTCTTCG TCGTCAAGTC CAAAAAGAAA GAGCCGGATG ACCGAAACGA ACGCCAACTA GACACAGTAG AACAACCTAT AGCACCCACG GCGAACGCAT TCGAATCGCG CGAGACGAAA GTGAGCTCCG AAGAAGCGCG AGGCGCGTGG ATGAACACGT TCGCCAAAAT GGCGCCTCCC AAGTGCAAGC ACGGCGAAAC GTGCAAAGTA CGCACGGTGA AGAAGAAGGA GAGCCCACAC TGTGGACGCG TGTTCTTTTG CTGCCCGCGC CCGGCCGGCG CGCGCACCAA TCCCGATTGT GACTGCGGTT TCTTCCTCTG GCGAGAGCAT CGCGCGCCGA AGTAG
|
Protein sequence | MKALFDALGS DLKVICLQET KLSSSGDVER LERVEGWDGA HAVCESSNHR VGYSGVAVYW RSNEICPTSI ERGVCAKGDA GESTMWAGEI APFADDATRA KEIDGEGRAL WVDFGEFVLC TVYVPAVFGD PAVDEKTAER AAFKRDFLSA LEARYKSLRE RGRNVILCGD WNIAPSWKLD RADEDPNAVE PRNPSRDWLA AQLAGDAMVD VFREFFPTLG DAFTCWNVAS GAQLSNYGSR IDYFLCDRAV TLKRVRGVGV AQKFEGSDHA PVYLELEESM WRRRDSQQTP PSLAISMLYP GRQTTVDSIF ARASSTSNAT PEFLNAASQS RAKPTRPSAR AQSRAGVSDA PKRKPEATLK DFFVVKSKKK EPDDRNERQL DTVEQPIAPT ANAFESRETK VSSEEARGAW MNTFAKMAPP KCKHGETCKV RTVKKKESPH CGRVFFCCPR PAGARTNPDC DCGFFLWREH RAPK
|
| |