Gene OSTLU_37953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37953 
Symbol 
ID5004117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp25191 
End bp26684 
Gene Length1494 bp 
Protein Length497 aa 
Translation table 
GC content56% 
IMG OID640419538 
Productpredicted protein 
Protein accessionXP_001420057 
Protein GI145351377 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.150647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGAA AGAAGGAGAT TTTTACGCCG CGAGACCCGG CGGGGAAAAA GGTGCAAATG 
TACGTGTGCG GCGTCACGGT GTACGACTAT TCACACATCG GTCACGCGCG CGTGTACGTC
GCGTTCGACG TATTATATCG ACAATTGATG CGTTTAGGGT ACGACGTGAC GTATTGCCGA
AATTTCACCG ACATTGACGA CAAGATTATC AAGCGCTCAA ATGAGAGCGG GGAGACGTGC
GAGGCGCTCA CGGATAAATT CATAGAGGCA TTCCACGAAG ACATGGCGGC GCTCGGATGC
CTGCGTCCGA CGCTCGAGCC TCGTGCGACG GAGTGTGTGG ATGACATCAT CGCGTTCATC
GAGCGTTTAA TCGCCAAAGG TAACGCGTAC GAGACGGAAG GGGACGTCTA CTTTTCCGTC
GACACCTTGC CCGCATACGG GGCGTTGTCA GGGAGAAATC AAGAAGACAA TCGCGCGGGT
GAGCGCGTGG CCGTGGACGG GCGCAAGAAA AATCCAGCCG ACTTTGCGCT ATGGAAGACT
GCAAAACCGG GTGAGCCAAC GTGGACGAGT CCGTGGGGCG AGGGACGACC GGGCTGGCAC
ATTGAGTGCA GCGCAATGAT TGAAAAAATG CTAGGACCGA CGATTGATAT CCACGGTGGA
GGCCAAGACT TAGTTTTTCC GCATCACGAA AACGAGCTGG CGCAGTCTTC GGCGGCGTGC
GGTTGTGGAG CGCACGCGGA TGAGAATCCG TTTGTGCGTT ACTGGGTGCA TAACGGCTTC
GTCAAGGTGG ATTCTGAGAA GATGTCCAAG TCGCTCGGCA ACTTTTTCAC TATTCGCGAA
GTGTTGGACA AGTACCATCC GTTCGTGCTA CGTTTCATGC TTCTTGGCGC GCACTACAGA
GCGCCCATCA ACTACACACA GCGCGCGCTG GAGGAGGCTT CCGATCGCGT TTACTATTTG
TACCAAACAG TTCACGATGT ACGAGCAATT CTTCGCGATG CCGCGGCGGA AGAGCCAGCT
AAAAAGCCGG TACCGCTCGT TGCGGATGCG CTGAAGCTCG CGAGTGAGGC TGAGAAGCAA
GTGTCCGAGG CTTTGAATGA CGACATGAAC ACGCCCGGAG TGATCGCGAC GCTCTCCGCG
CCGCTCAAAT CAATGAATGA TTTCATGACC ACCAAGGCTG GAAAGAAAGC AGTCGGTCGT
GTTGGGGCGC TTCAGTCGTT GTTGAGCACT GTCGAGGGTT TAATGGAGGC GGTTGGCATG
CCCAAGGATG AAGAAAACGT CATTCTTGCG GAGCTTCGCG CGCGCGCGTT GCACCGCGCG
GGCTTGACTG AGGACGATCT CTTAGCTAAA ATAGAAGAAC GCAATAAGGC GCGCGATGCG
AAAGACTTTG CGGAGTCCGA CCGCTTACGC GACGAACTTT CCGCGCGCGG CGTTGGTCTC
ATGGACGGTT CTGCGGTGCC GTGGCGTCCA GTTCCAGTCA TCGACGCGAC GTAG
 
Protein sequence
MTRKKEIFTP RDPAGKKVQM YVCGVTVYDY SHIGHARVYV AFDVLYRQLM RLGYDVTYCR 
NFTDIDDKII KRSNESGETC EALTDKFIEA FHEDMAALGC LRPTLEPRAT ECVDDIIAFI
ERLIAKGNAY ETEGDVYFSV DTLPAYGALS GRNQEDNRAG ERVAVDGRKK NPADFALWKT
AKPGEPTWTS PWGEGRPGWH IECSAMIEKM LGPTIDIHGG GQDLVFPHHE NELAQSSAAC
GCGAHADENP FVRYWVHNGF VKVDSEKMSK SLGNFFTIRE VLDKYHPFVL RFMLLGAHYR
APINYTQRAL EEASDRVYYL YQTVHDVRAI LRDAAAEEPA KKPVPLVADA LKLASEAEKQ
VSEALNDDMN TPGVIATLSA PLKSMNDFMT TKAGKKAVGR VGALQSLLST VEGLMEAVGM
PKDEENVILA ELRARALHRA GLTEDDLLAK IEERNKARDA KDFAESDRLR DELSARGVGL
MDGSAVPWRP VPVIDAT