Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_52010 |
Symbol | |
ID | 5006599 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | - |
Start bp | 276424 |
End bp | 279394 |
Gene Length | 2971 bp |
Protein Length | 848 aa |
Translation table | |
GC content | 57% |
IMG OID | 640422020 |
Product | predicted protein |
Protein accession | XP_001422701 |
Protein GI | 145356981 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0480] Translation elongation factors (GTPases) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00490] translation elongation factor aEF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0299674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00863878 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | CGCTCGCGCA CGCCGACGAC GAAGAGACGC CGCCCGATCG CTCGCCATGG TGAAGGTGCG CGAATGTTTC GACGCGATGC GCGCGTCGCG GGCGCCGCGC GCGCGTCGCG GATGCGCGCG CGCGCGCGAA CGCGCGCGAG GATGATTACG CGATCGAGCG ACGCGACGGA CGGGCGACTG ACGAACGTTT TCGCGCGATA GTTTACCATC GATGAGCTGC GCAAGCAGAT GGATCACAAC AAGAATATCC GTAATATGTC TGTGATCGCG CACGTCGATC ACGTACGTCG GCGAGGCGAG GCGACGCGCG AATCGATGCG ACGGATTGAT GATTGATTGG GGTTACGACA CCGCGGGTGA CGGGACGAGG CTGAACCCTA ATGCGAGGCG ACCGCACGCG CACGCGAACG CGGACGCGTG AGACTGACGA AAATACGCGC GTTCGGACGG TGAAATGAAT AGGGTAAATC GACGCTCACG GATTCTCTCG TCGCCGCGGC GGGGATCATC GCGCAGGAAA ACGCCGGGGA CGCGCGCTTG ACGGATACGC GTCAAGACGA GCAAGATCGG TGCATTACGA TCAAGTCCAC GGGTATTTCG TTGTTCTACA CCGTGTCTGA CGAGGATTTG GCGCGCTTGC CGAAAGATGT GCCGCGTGAT GGTAACAACT ACCTGATCAA CTTGATTGAT TCTCCGGGTC ACGTCGATTT CTCGTCCGAG GTGACGGCGG CGTTGCGCAT CACCGACGGC GCCTTGGTCG TCGTTGATTG CGTCGAAGGT GTTTGCGTAC AAACCGAAAC CGTGCTTCGT CAAGCGCTCG GTGAACGCAT TAAGCCTGTG ATGACGGTGA ACAAGCTCGA TCGCTGCTTC CTCGAACTCA TGCTCGACGG CGAAGAGGCG TACCAAAACT TCTGCCGCGT CATCGAAAAT GCGAACGTCA TCATGGCTAC CTACACGGAT GAGGCGCTCG GTGACGTGCA AGTCGCGCCG GAGAAGGGTA CCGTGTGCTT CTCCGCTGGT CTTCACAACT GGGCGTTCAC CTTGACCGTT TTCGCGAAGA TGTACGCCGC CAAGTTCGGC ATTGACCAAG ACGCCATGAT GGGCAAGCTT TGGGGCGACA ACTTCTTCGA TCCGAAGGAG CGTAAGTGGA CCAAGAAGAA CACTGGCTCG AAGACGTGCA TGCGTGCTTT CGTCCAGTTC TGCTACGAGC CGATCCGTCG CGTCATCGAT GCCGCGATGA ACGACAACAA GGACAAGCTC TGGCCGATGC TCGAAAAGCT TCAAGTGAAG GATAGGCTCA AGCCCGCTGA CTTGGACTTG ATGGGCAAGC CGTTGATGAA GCGTATCATG CAAACCTGGT TGCCGGCCGA TGTTGCTCTT CTTGAGATGA TCATCTACCA CTTGCCTTCC CCGGCTACGG CGCAAAAGTA CCGTGCGGAT ACCCTCTACG AGGGTCCGCT CGATGACGCC TACGCCAACG CGATTCGCGA GTGCGATGCC AACGGTCCGC TCATGTTGTA CGTGTCCAAG ATGATCCCGA CCGCCGATAA GGGTCGTTTC TTGGCCTTCG GTCGTGTGTT CTCTGGTACC GTGCAAACTG GCCAAAAGGT GCGCATCATG GGTCCGAACT ACGTTCCGGG TGAGAAGAAG GATTTGTACA TCAAGTCCAT CCAGCGCACC GTGTTGTGCA TGGGCCGTCG TCAAGACGCT ATCGACAACG TGCCGTGCGG TAACACCGTC GCCATGGTTG GTCTCGATCA GTTCATCCAA AAGAATGCGA CCATCACTGG TGAGAAGGAT GTCGATGCGC ACACCATCAA GGCGATGAAG TTCTCCGTCT CTCCGGTTGT CCGCGTCGCT GTTGAGTGCA AGAACTCGCA AGATCTCCCC AAGCTCGTCG AAGGTCTCAA GCGCCTTTCC AAGTCCGATC CGATGGTTCA GTGCCAAATC GAAGAGACTG GTGAGCACAT CGTCGCCGGT GCTGGTGAGC TTCACCTCGA AATCTGCTTG AAGGATCTCC AAGAAGATTT CATGGGTGGT GCCGAAATTC GCATCTCCGA TCCGGTCGTG TCCTTCCGCG AAACCGTCAA CGGTACCTCG GACCACATTT GCATGTCCAA GTCCCCGAAC AAGCACAACC GTTTGTACTT CCAAGCCGTC GCCATGGACG AAGGTCTTGC CGAAGCCATT GACAACGGTG AAGTCACCCC GCGCGATGAC CCGAAGACTC GCGGTCGTTT CTTGGCTGAC AAGTACGGCT GGGACAAGGA TCTCGGCGCC AAGAAGATTT GGTGCTTCGG TCCGGACACC ACCGGCCCGA ACCTCATCGT CGATATGTGT AAGGGTGTCC AGTACCTCAA CGAAATCAAG GATTCGTGCG TTGCGGCGTT CCAGTGGGCC ACCAAGGAAG GTGTCCTCGC CGAAGAAAAC ATGCGCGGCA TCAAGTTCGA GATCCACGAT GTCGTTCTCC ACACCGATGC CATTCACCGT GGTGGTGGTC AAATCATCCC GACGTGCCGC CGTGTCTTGT ACGCGTCTGC ACTCACCGCG GAACCGCGTC TCCTCGAGCC GGTGTACTTG GTTGAAATCC AAGCTCCGGA GCAAGCGCTC GGCGGTATCT ACTCCACCGT TACGCAAAAG CGTGGTATGG TTATCGAAGA GACCCAGCGC CCGGGTACCC CGATTTACAA CATCAAGGCG TACTTGCCGG TCATGGAATC TTTCGGTTTC ACGGGTACTC TCCGTGCAGC GACTTCCGGC CAAGCGTTCC CGCAGTGTGT GTTTGATCAC TGGGATATGC TCAACAGCGA TCCGCTCAAC CCGGATTCGC AGTCTGGTAA GCTCGTCAAG GACATCCGTA AGCGTAAGGG TAGCAAGGAG AACGTCCCGC CGCTCAACGA ATACGAAGAC AAGCTCTAAT TGTAGGGTCT CAGGAGCAGC TTTTGAAACG TACATCTCTA TAATACACAA A
|
Protein sequence | MVKFTIDELR KQMDHNKNIR NMSVIAHVDH GKSTLTDSLV AAAGIIAQEN AGDARLTDTR QDEQDRCITI KSTGISLFYT VSDEDLARLP KDVPRDGNNY LINLIDSPGH VDFSSEVTAA LRITDGALVV VDCVEGVCVQ TETVLRQALG ERIKPVMTVN KLDRCFLELM LDGEEAYQNF CRVIENANVI MATYTDEALG DVQVAPEKGT VCFSAGLHNW AFTLTVFAKM YAAKFGIDQD AMMGKLWGDN FFDPKERKWT KKNTGSKTCM RAFVQFCYEP IRRVIDAAMN DNKDKLWPML EKLQVKDRLK PADLDLMGKP LMKRIMQTWL PADVALLEMI IYHLPSPATA QKYRADTLYE GPLDDAYANA IRECDANGPL MLYVSKMIPT ADKGRFLAFG RVFSGTVQTG QKVRIMGPNY VPGEKKDLYI KSIQRTVLCM GRRQDAIDNV PCGNTVAMVG LDQFIQKNAT ITGEKDVDAH TIKAMKFSVS PVVRVAVECK NSQDLPKLVE GLKRLSKSDP MVQCQIEETG EHIVAGAGEL HLEICLKDLQ EDFMGGAEIR ISDPVVSFRE TVNGTSDHIC MSKSPNKHNR LYFQAVAMDE GLAEAIDNGE VTPRDDPKTR GRFLADKYGW DKDLGAKKIW CFGPDTTGPN LIVDMCKGVQ YLNEIKDSCV AAFQWATKEG VLAEENMRGI KFEIHDVVLH TDAIHRGGGQ IIPTCRRVLY ASALTAEPRL LEPVYLVEIQ APEQALGGIY STVTQKRGMV IEETQRPGTP IYNIKAYLPV MESFGFTGTL RAATSGQAFP QCVFDHWDML NSDPLNPDSQ SGKLVKDIRK RKGSKENVPP LNEYEDKL
|
| |