Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_35638 |
Symbol | |
ID | 5002876 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 542907 |
End bp | 544595 |
Gene Length | 1689 bp |
Protein Length | 551 aa |
Translation table | |
GC content | 54% |
IMG OID | 640418297 |
Product | predicted protein |
Protein accession | XP_001418738 |
Protein GI | 145348608 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1111] ERCC4-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0251063 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.525211 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCGG AAGCGATTCA AACGTACGTC TATCCGGCGC AGATTTCGCG GCGTGATTAT CAGTTTGACA TGGCGCGTAA CGCCCTGCTG ACGAACTCAC TGGTATGCCT GCCCACGGGT TTGGGCAAGA CGCTCATCGC GGCGGTGGTG ATGTACAACT ATTACCGATG GTTTCCCACG GGGAAGATTA TTTTTATGGC GCCCACGCGA CCTTTAGTGG ATCAACAGAT GTCAGCGTGT CACACCGTGG TGGGGATTCC CGCCTCTGAC ACCATCGTAT TGATGGGAAG CACGAAGAAG GACGACAGCG GATCGAGACG AGACTTTTGG CGAGAGAAAC GTCTCTTCTT TTGCACCCCG CACACGGTAG CGAACGATTT GGAAAGCGGC GATTTGGACG CCAAACAAAT CGTGTGCGTC GTCGTCGATG AAGCGCACCA CGCGCGAGGA CAGTACGCTT CCGCGGAAGT GCTTCGTTTA CTTCACGAAC GAAAAGTGAG ATTTCGTTTA TTGGCGCTCA CGGCGACGCC GGGGCAAGGA TTGGACGAAG TTCAAAAAGT CGTCGAGACG CTGCGCATCG GACGTATCGA TTTCAGAAGC GACCAAGACC CCGATGTTTC GCGTTACACG CACAAGCGCG AGATGACTGT GGAAAAGGTC AAGCCCGATC AGGCGATGTC TCACGTGCAG GACATGCTGT GCGAGCTACT GCGCCCGTGC TGCGCGCAAC TCGTCAACAT GGGCGCGCTC GGCGAAGCGG GCTTTCGAAT GTTGACGTTT ATTAAAAATA AAGCCACAAA CGGGGCATCG CGCGTCGAAC CACCGGCGTG GTTCACGCTG CAATCCGCAC AGCAGGCGAT ATATAGAAAT CAGCACCAAG TGCGCGCCAG AGGGCAAGCG TTAGGCTTGC TCGAGACCGC GATGGAACTC AGTAAGGCGT ACGAGCTTTT GCTCAAGTAC GGCGCCAAGA GCGCGTACGA TTACATCGAT AAGCGCGGTC GAGACAAGAG CAACACGCTC GTTCATCGCA GCGACCCAGT ATCAGTAGAG TTGGTGGATT TGATTCGTTC GATGTCGTCA AACGGCGCAC ATCACTCGCC AAAACTCGAC AGATTGACGT CGATTCTAAA GCAACACTTC AGAGACGCTA CGGCGGATAC GCGAGTGATC ATTTTCACAT CGTACCGTGA AAGCGTCAAG GACATCGTTC AAGCCCTTCG CGAGGTCCCC GCTGGCGAGG ACACCGCGTG CAAGATTAAA GTTGCAGAGT TCGTCGGTCA GGGCGATACG GGCGCGACGG GTAAGAAACG GGCGCCTGGT GCTACCTCGC GCGGTACTAA GGGACAGACG CAAAAAGAAC AAAAGCAGAC GCTGGACGAT TTCAGAGCCG GAACGCTGAA CACACTCGTG GCGACATCCA TCGGAGAAGA GGGATTGGAC ATTCCGAGCG TAGATTTGAT TTTTTTCTTT GACGTCGTTG ATACCATTCG TGCCATTCAA CGAATGGGTC GAACTGGTCG AGCGCGCGAT GGCAAAGTCG TCATTCTCGC GACGGAGGGT AAAGAATACG CCAAGTTTAC GAGCGAGCAG AAAAAGTACG AAACTTTGAT GACGTGCCTA CGAATGCCAG AAACGCACTT TCGGCTCGAC AAAAAGTGTC CTCGCATCGT TCCAGATGGC GTGACACCG
|
Protein sequence | MDPEAIQTYV YPAQISRRDY QFDMARNALL TNSLVCLPTG LGKTLIAAVV MYNYYRWFPT GKIIFMAPTR PLVDQQMSAC HTVVGIPASD TIVLMGSTKK DDSGSRRDFW REKRLFFCTP HTVANDLESG DLDAKQIVCV VVDEAHHARG QYASAEVLRL LHERKVRFRL LALTATPGQG LDEVQKVVET LRIGRIDFRS DQDPDVSRYT HKREMTVEKV KPDQAMSHVQ DMLCELLRPC CAQLVNMGAL GEAGFRMLTF IKNKATNGAS RVEPPAWFTL QSAQQAIYRN QHQTAMELSK AYELLLKYGA KSAYDYIDKR GRDKSNTLVH RSDPVSVELV DLIRSMSSNG AHHSPKLDRL TSILKQHFRD ATADTRVIIF TSYRESVKDI VQALREVPAG EDTACKIKVA EFVGQGDTGA TGKKRAPGAT SRGTKGQTQK EQKQTLDDFR AGTLNTLVAT SIGEEGLDIP SVDLIFFFDV VDTIRAIQRM GRTGRARDGK VVILATEGKE YAKFTSEQKK YETLMTCLRM PETHFRLDKK CPRIVPDGVT P
|
| |