Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19561 |
Symbol | |
ID | 5002491 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 405746 |
End bp | 407607 |
Gene Length | 1862 bp |
Protein Length | 524 aa |
Translation table | |
GC content | 55% |
IMG OID | 640417912 |
Product | predicted protein |
Protein accession | XP_001418241 |
Protein GI | 145347579 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00408] prolyl-tRNA synthetase, family I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000000368339 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.864068 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAAGG CAAAAGCCGC GAAAGCACCG AAGGAGAAGA AGGCGGACGA TCCGAACAAG TCGGCGCCCG GGGCTGGGAA AGCTGAGAAG AAGAAGGAGA CGGGTCTCGG TTTATCCACC AAGCGCGACG AGGATTTCGG CGCGTGGTAT TCGCAAGTGG TCGTCGCGGG AGATCTCATC GATTATTACG ATATTTCTGG TTGCTACATC TTGAAGCCGT GGGCGTACGC GCAATGGGAG TACTTGAAAG AATTCTTCGA TCGCGAGATC AAAGAGCTCG AGGTGGAAAA CTGCTACTTC CCCATGTTCG TCTCGGCGAG CCGATTAGAA GCGGAGAAGG ACCACATCGA AGACTTCGCC CCGGAAGTTG CGTGGGTTAC TCGAAGCGGA AACACCGATC TCGAGGTCCC GATCGCGGTT CGCCCGACAT CAGAGACGGT GATGTACCCG CATTACGCGC AATGGATTCG TTCGCACAGA GACTTGCCTT TGCGATTAAA CCAGTGGTGC AACGTCGTGC GCTGGGAATT TAAGCATCCA ACTCCGTTCA TTCGTTCGCG CGAGTTCTTG TGGCAAGAGG GACACACCGC TTACAGTAGC AAAGCGGAAT GCGACGTCGA GGTGCGTCAA ATCTTGGAGC TCTACCGTCG AGTGTATGAA GAATATCTGG CCGTGCCCGT CGTTCCGGGT AAGAAATCTG AAAAGGAAAA GTTTGCCGGT GGAGATTACA CTACTACGGT CGAAGCGTAC GTGCCGGGAT CTGGTAGAGG CGTGCAGGGT GCGACGTCAC ACTGCTTGGG CCAAAACTTC GCGAAAATGT TCAACATCGA GTACGAAGAC GCGAAGGGTG GGCGCTCTTT GGTGTGGCAA AACTCGTGGG GCTTCACCAC GAGAACGCTT GGTGTCATGT ACATGGTGCA CGGCGATGAC GACGGCCTCG TGCTGCCGCC AAAGGTCGCG CCGGTGCAGG CGATCGTCAT TCCAATCCCT AATAGTAAGC TTTCGGACGA GGCCAAGCAG AAGATGGACG GTACGTTTTA TTTCACTCGC GCAATCGCGT TTCCGACAGT TTTTCTCGCA ACGACTTTCT TACCTGCGAA GTGCTCGATT TCGATGAGCA TAATCGCCAT TTAAAGGGTC GGATAAAAAA TGCCAAGACG CCTTTACCAA TATCTGGTCG AAACCTTCGG GTCATGGCTT AGTTATAATG GCTTGGGGTC TAAATTCCTT TGCTTCACGC GGAGGCGCGA TGAGTTCACT CTCGGCGTAC TAACCTCGTA CTTTTCGCTC TATAGAAATC GCGACTGGCA TGTGCAAGTC GCTCAAGGCC GCTGGCGTGC GATCCAAGCT CGATAACCGC GACAACTACA CGCCAGGTTG GAAGTATAAC CACTGGGAAC TGAAGGGGGT GCCGATGCGC GTCGAATTCG GCGCGCGCGA CTTAGAAACT GGCACGTGCG TGATCGCCAG ACGTGACACT CGCGAGAAGG AAACGGTCAA AATCGAAGAT TTGACTAAGC GATGCTCCGA GTTGTGCGAA CAAATCCAAA AGGACATGTT CGAGCGCGCT AGGAAGATTC GCGACGAAAA CATCGTCTCT CTCACGTCTT GGGACGGCTT CATCGAAGCC TTGGACGCCA AGAAGCTCAT CATGACACCG TGGTGCAACA CCAAGGACAG CGAAGAGCTC GTCAAGAAGA AGTCTACCGC CGAGTCCACG GGCGGTGCGG CGAAAACGCT GTGTATCCCG TTCGAGCAAC CCGCGCTCGA AGCCGGCACC AAGTGCTTCA TCACGGGCGA ACCCGCCACA TGCTGGGTTC TCTGGGGCCG ATCCTACTAG ATAATCGCCG CGCGAGTGCG CG
|
Protein sequence | MPKAKAAKAP KEKKADDPNK SAPGAGKAEK KKETGLGLST KRDEDFGAWY SQVVVAGDLI DYYDISGCYI LKPWAYAQWE YLKEFFDREI KELEVENCYF PMFVSASRLE AEKDHIEDFA PEVAWVTRSG NTDLEVPIAV RPTSETVMYP HYAQWIRSHR DLPLRLNQWC NVVRWEFKHP TPFIRSREFL WQEGHTAYSS KAECDVEVRQ ILELYRRVYE EYLAVPVVPG KKSEKEKFAG GDYTTTVEAY VPGSGRGVQG ATSHCLGQNF AKMFNIEYED AKGGRSLVWQ NSWGFTTRTL GVMYMVHGDD DGLVLPPKVA PVQAIVIPIP NSKLSDEAKQ KMDEIATGMC KSLKAAGVRS KLDNRDNYTP GWKYNHWELK GVPMRVEFGA RDLETGTCVI ARRDTREKET VKIEDLTKRC SELCEQIQKD MFERARKIRD ENIVSLTSWD GFIEALDAKK LIMTPWCNTK DSEELVKKKS TAESTGGAAK TLCIPFEQPA LEAGTKCFIT GEPATCWVLW GRSY
|
| |