Gene OSTLU_50879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50879 
Symbol 
ID5004474 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp201526 
End bp203421 
Gene Length1896 bp 
Protein Length607 aa 
Translation table 
GC content62% 
IMG OID640419895 
Productpredicted protein 
Protein accessionXP_001420444 
Protein GI145352203 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0733722 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTCG CGCGCGCGTT CGCGCGAAGG ACGGTCTCGA CGCCGCGCGC GTCGCTCGCG 
GCGCGCCCGC GGGCGTCGAC GCGCGCGAGC GAGCGCGAGA GCGCGCGCGC GACGGCGACG
GCGGCGACGA CGCGACGACG AGGCGCGACG ACGAGGACGC GCGCGCGCGC GAGCGACGCG
GGAACGAATA AAAAGCGCGA CGGCGGCGCG GGGAATCGAA AGCGCGGCGA GGGCGGCGGA
CGAGACGCGA GCGGGGTGAC GAGCTCGCCC GAGGAGGTCA AGGCGCTGCG CGCGCAAAAG
CTGGACGCGC TCGGCGCGCT CGGACAGCGG GGATTCGATT ACCGGTTCGA TCGCACGAAA
TATTGCGATG CGCTGCAGAG GGAACACGAA GGATTGGAGA ACGGGGTCGA GATCGAGGGC
TCGAGCGAGG CGGTGTGCGG AAGGGTGATG GCGAAGCGGT CGTTCGGGAA GTTGGCGTTT
TTGTCGCTGG TGGACGAACG GGGAAGCGTG CAGTTGTTTT GCGATAAGAA ACGGTTGGAT
GAGACGAGCC CGGGGGCGTT TGAGATGATC ACGGAACTCG TGGACGTCGG GGACATCATC
GGCGTGCACG GGAGCGTGAA AAGGAGCGAT AAGGGGGAGC TGTCCATCGT GCCGGCCAAG
GTGCAAATGT TGACAAAGGC GCTGCTGCCG TTGCCGGATA AGTGGCACGG ATTGCAGGAC
GTGGAGAAGC GATACCGGCA GCGATACGTG GACTTGATCG TGTCGCCCGA GGTGCGAAAC
ACGTTCAAGG CGCGCTCGAA CATCATCTCC ACCATTCGTC GAATGCTCGA CGACGACGGG
TTTTTGGAGA TGGAGACGCC CGTGCTGCAC ACGCAAGCGG GCGGCGCGGA CGCGAAGCCC
TTCAACACCT TCCACAACGC GCTCGGCATG CAGCTCACGC TTCGTATCGC CACCGAGTTG
CATCTCAAGC GACTCGTCGT CGGCGGTTTC GAACGCGTGT ACGAGCTCGG ACGCGTGTTT
CGCAACGAAG GCTTGAGCAC GCGACACAAC CCGGAGTTTA CGTCCATAGA AGTGTATCAG
GCGTACGCCG ACGTCACGGA CATGTTGGAG CTCACGGAAG AGATGATTTG TCGATGCGCG
ATGAAGGCGT GCGGGACGCT GACGATTCCT TACGGCGACG TGACGATTGA TTTGAGCCAA
CGCCCGTGGC GCCGGGCGCC GATGAACGAT CTAGTCAAGG AAGCCACGGG CGTAGACGTC
ATGGCGTTCG GGGATGACTT GGAAGGAGCC AAGGCAGCGG CAATTCCCGC GCTCAAGGCG
CACTCTAAGA AAGCTGGGGA AGGCATCAAA GGTGTCAAGT TGGCGGCGAG CGTCGGTCAC
GTGCTGAACG AAATGTTTGA AGCCGCGTGC GAAAGCGATT TGATTCAGCC CACGTTCGTT
CTCGACCACC CGCTCGAGAT TTCTCCTCTC GCGAAACCGC ACCGTAGTAA ACCGGGAGTC
ACCGAACGGT TCGAGCTCTT CGTCGTCGGT CGCGAGCTCG CGAACTCGTT CAGCGAGTTG
ACCGATCCCA TCGATCAACG TAAACGCCTC GAGGCGCAGA TGGTGACGCA CGCGAAAACG
AGCGCGGCGC AGCGCGAAGC GGCGGCGGCG AGTGGTAAAG ACGAACTCAA AGCGCTCGAA
GACGAGGCGT ACGACGTCGA AATGGATGAA GATTTCGTCG CCGCGCTCGA GTACGGCATG
CCCCCGACCG CGGGTATGGG TCTCGGCGTC GATCGTCTCG TCATGCTTCT CACGAATTCG
CCGTCGATTC GCGACGTCAT CGCGTTCCCG CTTCTTAAGA AGCAAGACTC GTAAGTCATA
CGCGAGAGTA TATATATTCT TAGCGGCGCT ACGACA
 
Protein sequence
MLLARAFARR TVSTPRASLA ARPRASTRAS ERESARATAT AATTRRRGAT TRTRARASDA 
GTNKKRDGGA GNRKRGEGGG RDASGVTSSP EEVKALRAQK LDALGALGQR GFDYRFDRTK
YCDALQREHE GLENGVEIEG SSEAVCGRVM AKRSFGKLAF LSLVDERGSV QLFCDKKRLD
ETSPGAFEMI TELVDVGDII GVHGSVKRSD KGELSIVPAK VQMLTKALLP LPDKWHGLQD
VEKRYRQRYV DLIVSPEVRN TFKARSNIIS TIRRMLDDDG FLEMETPVLH TQAGGADAKP
FNTFHNALGM QLTLRIATEL HLKRLVVGGF ERVYELGRVF RNEGLSTRHN PEFTSIEVYQ
AYADVTDMLE LTEEMICRCA MKACGTLTIP YGDVTIDLSQ RPWRRAPMND LVKEATGVDV
MAFGDDLEGA KAAAIPALKA HSKKAGEGIK GVKLAASVGH VLNEMFEAAC ESDLIQPTFV
LDHPLEISPL AKPHRSKPGV TERFELFVVG RELANSFSEL TDPIDQRKRL EAQMVTHAKT
SAAQREAAAA NEAYDVEMDE DFVAALEYGM PPTAGMGLGV DRLVMLLTNS PSIRDVIAFP
LLKKQDS