Gene OSTLU_19561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19561 
Symbol 
ID5002491 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp405746 
End bp407607 
Gene Length1862 bp 
Protein Length524 aa 
Translation table 
GC content55% 
IMG OID640417912 
Productpredicted protein 
Protein accessionXP_001418241 
Protein GI145347579 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00408] prolyl-tRNA synthetase, family I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000368339 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.864068 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAGG CAAAAGCCGC GAAAGCACCG AAGGAGAAGA AGGCGGACGA TCCGAACAAG 
TCGGCGCCCG GGGCTGGGAA AGCTGAGAAG AAGAAGGAGA CGGGTCTCGG TTTATCCACC
AAGCGCGACG AGGATTTCGG CGCGTGGTAT TCGCAAGTGG TCGTCGCGGG AGATCTCATC
GATTATTACG ATATTTCTGG TTGCTACATC TTGAAGCCGT GGGCGTACGC GCAATGGGAG
TACTTGAAAG AATTCTTCGA TCGCGAGATC AAAGAGCTCG AGGTGGAAAA CTGCTACTTC
CCCATGTTCG TCTCGGCGAG CCGATTAGAA GCGGAGAAGG ACCACATCGA AGACTTCGCC
CCGGAAGTTG CGTGGGTTAC TCGAAGCGGA AACACCGATC TCGAGGTCCC GATCGCGGTT
CGCCCGACAT CAGAGACGGT GATGTACCCG CATTACGCGC AATGGATTCG TTCGCACAGA
GACTTGCCTT TGCGATTAAA CCAGTGGTGC AACGTCGTGC GCTGGGAATT TAAGCATCCA
ACTCCGTTCA TTCGTTCGCG CGAGTTCTTG TGGCAAGAGG GACACACCGC TTACAGTAGC
AAAGCGGAAT GCGACGTCGA GGTGCGTCAA ATCTTGGAGC TCTACCGTCG AGTGTATGAA
GAATATCTGG CCGTGCCCGT CGTTCCGGGT AAGAAATCTG AAAAGGAAAA GTTTGCCGGT
GGAGATTACA CTACTACGGT CGAAGCGTAC GTGCCGGGAT CTGGTAGAGG CGTGCAGGGT
GCGACGTCAC ACTGCTTGGG CCAAAACTTC GCGAAAATGT TCAACATCGA GTACGAAGAC
GCGAAGGGTG GGCGCTCTTT GGTGTGGCAA AACTCGTGGG GCTTCACCAC GAGAACGCTT
GGTGTCATGT ACATGGTGCA CGGCGATGAC GACGGCCTCG TGCTGCCGCC AAAGGTCGCG
CCGGTGCAGG CGATCGTCAT TCCAATCCCT AATAGTAAGC TTTCGGACGA GGCCAAGCAG
AAGATGGACG GTACGTTTTA TTTCACTCGC GCAATCGCGT TTCCGACAGT TTTTCTCGCA
ACGACTTTCT TACCTGCGAA GTGCTCGATT TCGATGAGCA TAATCGCCAT TTAAAGGGTC
GGATAAAAAA TGCCAAGACG CCTTTACCAA TATCTGGTCG AAACCTTCGG GTCATGGCTT
AGTTATAATG GCTTGGGGTC TAAATTCCTT TGCTTCACGC GGAGGCGCGA TGAGTTCACT
CTCGGCGTAC TAACCTCGTA CTTTTCGCTC TATAGAAATC GCGACTGGCA TGTGCAAGTC
GCTCAAGGCC GCTGGCGTGC GATCCAAGCT CGATAACCGC GACAACTACA CGCCAGGTTG
GAAGTATAAC CACTGGGAAC TGAAGGGGGT GCCGATGCGC GTCGAATTCG GCGCGCGCGA
CTTAGAAACT GGCACGTGCG TGATCGCCAG ACGTGACACT CGCGAGAAGG AAACGGTCAA
AATCGAAGAT TTGACTAAGC GATGCTCCGA GTTGTGCGAA CAAATCCAAA AGGACATGTT
CGAGCGCGCT AGGAAGATTC GCGACGAAAA CATCGTCTCT CTCACGTCTT GGGACGGCTT
CATCGAAGCC TTGGACGCCA AGAAGCTCAT CATGACACCG TGGTGCAACA CCAAGGACAG
CGAAGAGCTC GTCAAGAAGA AGTCTACCGC CGAGTCCACG GGCGGTGCGG CGAAAACGCT
GTGTATCCCG TTCGAGCAAC CCGCGCTCGA AGCCGGCACC AAGTGCTTCA TCACGGGCGA
ACCCGCCACA TGCTGGGTTC TCTGGGGCCG ATCCTACTAG ATAATCGCCG CGCGAGTGCG
CG
 
Protein sequence
MPKAKAAKAP KEKKADDPNK SAPGAGKAEK KKETGLGLST KRDEDFGAWY SQVVVAGDLI 
DYYDISGCYI LKPWAYAQWE YLKEFFDREI KELEVENCYF PMFVSASRLE AEKDHIEDFA
PEVAWVTRSG NTDLEVPIAV RPTSETVMYP HYAQWIRSHR DLPLRLNQWC NVVRWEFKHP
TPFIRSREFL WQEGHTAYSS KAECDVEVRQ ILELYRRVYE EYLAVPVVPG KKSEKEKFAG
GDYTTTVEAY VPGSGRGVQG ATSHCLGQNF AKMFNIEYED AKGGRSLVWQ NSWGFTTRTL
GVMYMVHGDD DGLVLPPKVA PVQAIVIPIP NSKLSDEAKQ KMDEIATGMC KSLKAAGVRS
KLDNRDNYTP GWKYNHWELK GVPMRVEFGA RDLETGTCVI ARRDTREKET VKIEDLTKRC
SELCEQIQKD MFERARKIRD ENIVSLTSWD GFIEALDAKK LIMTPWCNTK DSEELVKKKS
TAESTGGAAK TLCIPFEQPA LEAGTKCFIT GEPATCWVLW GRSY