Gene OSTLU_52010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_52010 
Symbol 
ID5006599 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp276424 
End bp279394 
Gene Length2971 bp 
Protein Length848 aa 
Translation table 
GC content57% 
IMG OID640422020 
Productpredicted protein 
Protein accessionXP_001422701 
Protein GI145356981 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0480] Translation elongation factors (GTPases) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00490] translation elongation factor aEF-2 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0299674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00863878 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
CGCTCGCGCA CGCCGACGAC GAAGAGACGC CGCCCGATCG CTCGCCATGG TGAAGGTGCG 
CGAATGTTTC GACGCGATGC GCGCGTCGCG GGCGCCGCGC GCGCGTCGCG GATGCGCGCG
CGCGCGCGAA CGCGCGCGAG GATGATTACG CGATCGAGCG ACGCGACGGA CGGGCGACTG
ACGAACGTTT TCGCGCGATA GTTTACCATC GATGAGCTGC GCAAGCAGAT GGATCACAAC
AAGAATATCC GTAATATGTC TGTGATCGCG CACGTCGATC ACGTACGTCG GCGAGGCGAG
GCGACGCGCG AATCGATGCG ACGGATTGAT GATTGATTGG GGTTACGACA CCGCGGGTGA
CGGGACGAGG CTGAACCCTA ATGCGAGGCG ACCGCACGCG CACGCGAACG CGGACGCGTG
AGACTGACGA AAATACGCGC GTTCGGACGG TGAAATGAAT AGGGTAAATC GACGCTCACG
GATTCTCTCG TCGCCGCGGC GGGGATCATC GCGCAGGAAA ACGCCGGGGA CGCGCGCTTG
ACGGATACGC GTCAAGACGA GCAAGATCGG TGCATTACGA TCAAGTCCAC GGGTATTTCG
TTGTTCTACA CCGTGTCTGA CGAGGATTTG GCGCGCTTGC CGAAAGATGT GCCGCGTGAT
GGTAACAACT ACCTGATCAA CTTGATTGAT TCTCCGGGTC ACGTCGATTT CTCGTCCGAG
GTGACGGCGG CGTTGCGCAT CACCGACGGC GCCTTGGTCG TCGTTGATTG CGTCGAAGGT
GTTTGCGTAC AAACCGAAAC CGTGCTTCGT CAAGCGCTCG GTGAACGCAT TAAGCCTGTG
ATGACGGTGA ACAAGCTCGA TCGCTGCTTC CTCGAACTCA TGCTCGACGG CGAAGAGGCG
TACCAAAACT TCTGCCGCGT CATCGAAAAT GCGAACGTCA TCATGGCTAC CTACACGGAT
GAGGCGCTCG GTGACGTGCA AGTCGCGCCG GAGAAGGGTA CCGTGTGCTT CTCCGCTGGT
CTTCACAACT GGGCGTTCAC CTTGACCGTT TTCGCGAAGA TGTACGCCGC CAAGTTCGGC
ATTGACCAAG ACGCCATGAT GGGCAAGCTT TGGGGCGACA ACTTCTTCGA TCCGAAGGAG
CGTAAGTGGA CCAAGAAGAA CACTGGCTCG AAGACGTGCA TGCGTGCTTT CGTCCAGTTC
TGCTACGAGC CGATCCGTCG CGTCATCGAT GCCGCGATGA ACGACAACAA GGACAAGCTC
TGGCCGATGC TCGAAAAGCT TCAAGTGAAG GATAGGCTCA AGCCCGCTGA CTTGGACTTG
ATGGGCAAGC CGTTGATGAA GCGTATCATG CAAACCTGGT TGCCGGCCGA TGTTGCTCTT
CTTGAGATGA TCATCTACCA CTTGCCTTCC CCGGCTACGG CGCAAAAGTA CCGTGCGGAT
ACCCTCTACG AGGGTCCGCT CGATGACGCC TACGCCAACG CGATTCGCGA GTGCGATGCC
AACGGTCCGC TCATGTTGTA CGTGTCCAAG ATGATCCCGA CCGCCGATAA GGGTCGTTTC
TTGGCCTTCG GTCGTGTGTT CTCTGGTACC GTGCAAACTG GCCAAAAGGT GCGCATCATG
GGTCCGAACT ACGTTCCGGG TGAGAAGAAG GATTTGTACA TCAAGTCCAT CCAGCGCACC
GTGTTGTGCA TGGGCCGTCG TCAAGACGCT ATCGACAACG TGCCGTGCGG TAACACCGTC
GCCATGGTTG GTCTCGATCA GTTCATCCAA AAGAATGCGA CCATCACTGG TGAGAAGGAT
GTCGATGCGC ACACCATCAA GGCGATGAAG TTCTCCGTCT CTCCGGTTGT CCGCGTCGCT
GTTGAGTGCA AGAACTCGCA AGATCTCCCC AAGCTCGTCG AAGGTCTCAA GCGCCTTTCC
AAGTCCGATC CGATGGTTCA GTGCCAAATC GAAGAGACTG GTGAGCACAT CGTCGCCGGT
GCTGGTGAGC TTCACCTCGA AATCTGCTTG AAGGATCTCC AAGAAGATTT CATGGGTGGT
GCCGAAATTC GCATCTCCGA TCCGGTCGTG TCCTTCCGCG AAACCGTCAA CGGTACCTCG
GACCACATTT GCATGTCCAA GTCCCCGAAC AAGCACAACC GTTTGTACTT CCAAGCCGTC
GCCATGGACG AAGGTCTTGC CGAAGCCATT GACAACGGTG AAGTCACCCC GCGCGATGAC
CCGAAGACTC GCGGTCGTTT CTTGGCTGAC AAGTACGGCT GGGACAAGGA TCTCGGCGCC
AAGAAGATTT GGTGCTTCGG TCCGGACACC ACCGGCCCGA ACCTCATCGT CGATATGTGT
AAGGGTGTCC AGTACCTCAA CGAAATCAAG GATTCGTGCG TTGCGGCGTT CCAGTGGGCC
ACCAAGGAAG GTGTCCTCGC CGAAGAAAAC ATGCGCGGCA TCAAGTTCGA GATCCACGAT
GTCGTTCTCC ACACCGATGC CATTCACCGT GGTGGTGGTC AAATCATCCC GACGTGCCGC
CGTGTCTTGT ACGCGTCTGC ACTCACCGCG GAACCGCGTC TCCTCGAGCC GGTGTACTTG
GTTGAAATCC AAGCTCCGGA GCAAGCGCTC GGCGGTATCT ACTCCACCGT TACGCAAAAG
CGTGGTATGG TTATCGAAGA GACCCAGCGC CCGGGTACCC CGATTTACAA CATCAAGGCG
TACTTGCCGG TCATGGAATC TTTCGGTTTC ACGGGTACTC TCCGTGCAGC GACTTCCGGC
CAAGCGTTCC CGCAGTGTGT GTTTGATCAC TGGGATATGC TCAACAGCGA TCCGCTCAAC
CCGGATTCGC AGTCTGGTAA GCTCGTCAAG GACATCCGTA AGCGTAAGGG TAGCAAGGAG
AACGTCCCGC CGCTCAACGA ATACGAAGAC AAGCTCTAAT TGTAGGGTCT CAGGAGCAGC
TTTTGAAACG TACATCTCTA TAATACACAA A
 
Protein sequence
MVKFTIDELR KQMDHNKNIR NMSVIAHVDH GKSTLTDSLV AAAGIIAQEN AGDARLTDTR 
QDEQDRCITI KSTGISLFYT VSDEDLARLP KDVPRDGNNY LINLIDSPGH VDFSSEVTAA
LRITDGALVV VDCVEGVCVQ TETVLRQALG ERIKPVMTVN KLDRCFLELM LDGEEAYQNF
CRVIENANVI MATYTDEALG DVQVAPEKGT VCFSAGLHNW AFTLTVFAKM YAAKFGIDQD
AMMGKLWGDN FFDPKERKWT KKNTGSKTCM RAFVQFCYEP IRRVIDAAMN DNKDKLWPML
EKLQVKDRLK PADLDLMGKP LMKRIMQTWL PADVALLEMI IYHLPSPATA QKYRADTLYE
GPLDDAYANA IRECDANGPL MLYVSKMIPT ADKGRFLAFG RVFSGTVQTG QKVRIMGPNY
VPGEKKDLYI KSIQRTVLCM GRRQDAIDNV PCGNTVAMVG LDQFIQKNAT ITGEKDVDAH
TIKAMKFSVS PVVRVAVECK NSQDLPKLVE GLKRLSKSDP MVQCQIEETG EHIVAGAGEL
HLEICLKDLQ EDFMGGAEIR ISDPVVSFRE TVNGTSDHIC MSKSPNKHNR LYFQAVAMDE
GLAEAIDNGE VTPRDDPKTR GRFLADKYGW DKDLGAKKIW CFGPDTTGPN LIVDMCKGVQ
YLNEIKDSCV AAFQWATKEG VLAEENMRGI KFEIHDVVLH TDAIHRGGGQ IIPTCRRVLY
ASALTAEPRL LEPVYLVEIQ APEQALGGIY STVTQKRGMV IEETQRPGTP IYNIKAYLPV
MESFGFTGTL RAATSGQAFP QCVFDHWDML NSDPLNPDSQ SGKLVKDIRK RKGSKENVPP
LNEYEDKL