Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31093 |
Symbol | |
ID | 5001593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 288359 |
End bp | 291643 |
Gene Length | 3285 bp |
Protein Length | 1094 aa |
Translation table | |
GC content | 54% |
IMG OID | 640417014 |
Product | predicted protein |
Protein accession | XP_001417466 |
Protein GI | 145345960 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0495] Leucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00395] leucyl-tRNA synthetase, archaeal and cytosolic family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00463574 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGAGA CGGGAAAGAA CACGGCGCGA CGCGACCTGT TGCTGGAGCT GCAGCGTCGC GCGCAGGGGA AGTGGGCGCG GGAGAAAACG TTCGAGGTGG ACGCGCCGAA GGCGAGCGAT GGCGAGGGAG GGAGGGATAA GTTTTTTGGA AACTTTCCGT ATCCGTACAT GAACGGGCTG CTGCATCTCG GACACGCGTT CTCGCTGAGC AAGCTGGAGT TTGCGAGCGC GTATCACAGG CTGAAGGGCG ATCGGACGCT GTTTCCGTTT GCGTTTCACT GCACGGGGAT GCCGATCAAG GCGTGCGCGG ATAAGATTGC GAAGGAGATT GCGGCGTATG GGAATCCACC CGTGTTTCCC GACGCGAGCG TGATGGAGGC GGAGGCGGAG GCAAAGGCGA AGGCGGAGGC GGCGAACGCG GGGCCGGCGG ACCCGACAAA GTTTGTGGCG AAGAAATCGA AGGCGACGGC GAAGAAGGGG ACGCAGGCGA CGCAGTGGGC GATCATGCAA GCGAGTGGGA TCCCAGATGA GGAAATTCCG AGCTTCGCGG AATCTATGCA TTGGTTGAAT TATTTTCCGC CGCTGGCGAA GCGCGACGTC ATCGCCATGG GATGTCAAGT CGACTGGAGA CGTTCGTTCA TCACCACCGA CGCGAATCCG TTCTACGATG CGTTCGTTCG CTGGCAGTTC AATACGCTTA AGAAGATTGG TAAGATTGTG AAGGCGAAGC GTTTCGCCGT GTACTCGCCG ATCGATGGAC AGCCGTGCGC CGACCACGAC AGAGCTTCGG GTGAAGGCGT CGGACCGCAA GAATATTTGC TCATTAAGAT GGCTGTGTAC GATGAATGCC TCACTGGTGA TCTTGCACCT TTGGCCGGTA AGAAGGTATT TTTGGCTGCC GCGACTTTGC GTCCGGAAAC GATGTACGGT CAGACAAACT GCTGGATTTT GCCCGACGGC GACTATGGCG CGTACGAGCT CGCCAACGGT GAAGTCGTAG TCATGTGTGA GCGCGCTGCT TTGAACCTGT CTTACCAAGA ACAGTTCGCG GAAGAAGGTA AACCCAAGTG CCTGCTTACG TTCAAGGGTC AATCCTTGAT CGGTTGCGCG GTCAAATCTC CGCGAGCTGA GCTCGAGAAG ATTTACTGCT TGCCCATGAT GACAATTCTC ATGAACAAGG GCACGGGCGT TGTTACCTCC GTTCCGTCTG ATTCTCCGGA TGATTTCATG GCGCTCAGTG ATTTGAAGGC CAAGCCTGCG TTGCGAGAAA AGTTTGGTGT CAAAGACGAA TGGGTAATGC CTTTTGAAGT CGTCCCTTGC GTGCACATTC CCGAATTCGG CGACGCGTGC GCGCCGATGG TTTGCGCTGA ACTCAAGATT CAGTCACAAA ATGACCGTGT GAAGCTTGAT GAAGCCAAGC ATCGGACGTA CTTGAAGGGT TTCACGGAAG GCGTTATGAT TCTTGGCAAT CATAAGGGTA AGCCTGTGAA GGAAGCCAAG CCATTGATCA GACAAGAGAT GATCGACGAC AACACGGGTA TGGTGTATAG CGAACCTGAG CGCACCGTCA TGTCGCGCTC TGGTGGCGAG TGTGTCGTTG CCCTCACCGA TCAGTGGTAC CTCGAGTATG GCGAAGAGGC TTGGAAGGCA AAGGCCGAGA AGTGCCTCGA GAACATGAAC TGCTACCACG ACGAAGCTCG ACACTCATTT GAGCACACTC TCGGTTGGTT ACGGCAGTGG GCTTGCAGCC GATCCTTTGG CCTGGGTACG CGCATGCCGT GGGATGAGCA ATATTTGATT GAATCGTTGT CGGACTCTAC CATTTATATG GCGTACTACA CCGTTGCTCA CTTGCTTCAA GGCGGTGACA TGTACGGTGA AGCCCGCCCG TCAGTGGATC CGAGCAAATT GACGGATGAA GTTTGGGACG CTATCTTCTT GGGCACCGCG AAGCCTTCTG AGGATGACTT CCCGCGTGAC TTGTTAGATC GCATGATCAA CGAATTCAAC TTCTGGTATC CGTTCGATCT TCGCGTTTCG GGCAAGGATC TGATCCAAAA TCACCTGACT TTTGCGATTT ACAATCACAC CGCGATTTGG GAAGACGAAA AGATGTGGCC GCGTTCGTTC AGAACGAACG GCCACTTGCT GCTGAACAAC GAAAAGATGA GTAAGTCTAC TGGTAACTTT AAGACGCTTA AGCAAGCCAT CGAAGAATTC AGTGCTGATG CGATGCGTTT CACATTGGCT GATGCGGGAG ACACGGTTGA AGACGCGAAC TATGTCGACG ACACCGCGAA CGCTGCCATT TTACGTTTGA CCAAGGAAAT CACGTGGTAC GAAGAGCAAA TGGCTGAAAT CGAAGCAGGT AATCTGCGCA CGACCGAGCC GAATAAATTT ATCGACCGTG TCTTCACGAA CGCGATGAAC ACGGCCATCG CACAGACTCA AGAGCATTAC GAGAATATGA TGTTCCGCGA AGCGCTCAAG AGTGGTTTCT ATGACTTGCA GTCGGCGAGA GACGCGTACC GTCTTATGAG CGCTGAAGAG GGTGGGATGC ACGCAGATCT AACTAAGCGA TTCATTGAAG TTCAAACGCT CTTGCTCGCG CCGATTTGCC CGCATACGTG CGAGCATATC TACGGCACTA TTCTCAAGAA GGAGGGCAGC GTCACCAGCG CAGGTTTCCC GAGCGGCGAG GTAGAGGATG TCGCTCTCAC CGCGGCGAAC AAGTACTTGG CTGATCTCAT CACGAACATG CGTAAGGGCA TCGCAAAGTG CACGGCACCC CCGAAGAAAG GCCCGAAGGG CCCTCCTAAG GTTGCCAAAG AGGGAACTAT TGTCGTGGCG TCTGAGTTTG TGGGTTGGCG AGCGGTGTGT CTGAGCATCT TAGCGGAATC GTACGACACC AAGTCCAAGA CTTTTCCTCC CGTGCCGGAT ATTTTGGCCA AGGTCAAAAG CAGCGAGCTT TCCGCAGACG CTAATTTCAA GAACGTGATG AAAATGGTCA TGCCGTTCAT CAAGTTCAAA ATGGATGAGG CCAACGTTGC CGGCGCATCA GCGTTAAACA CGAAGATCAT CTTTGACGAA ATGGACGTCT TGAAGGAGAA CATTGACTTT ATCAAGCGTG CATTAAGTCT ATCGACTCTT ACCATTTGCT ACACAACTGG TGAAAACGCG GGTTCAAAGG CGGACGACGC CACGCCTGGC GCCCCAGCTT TTGAATTTGT TGTCACGTCT GATGAGGACC TCGCCGCCGG AGTCGCAAAT ATGCTCTTGG GATGA
|
Protein sequence | MAETGKNTAR RDLLLELQRR AQGKWAREKT FEVDAPKASD GEGGRDKFFG NFPYPYMNGL LHLGHAFSLS KLEFASAYHR LKGDRTLFPF AFHCTGMPIK ACADKIAKEI AAYGNPPVFP DASVMEAEAE AKAKAEAANA GPADPTKFVA KKSKATAKKG TQATQWAIMQ ASGIPDEEIP SFAESMHWLN YFPPLAKRDV IAMGCQVDWR RSFITTDANP FYDAFVRWQF NTLKKIGKIV KAKRFAVYSP IDGQPCADHD RASGEGVGPQ EYLLIKMAVY DECLTGDLAP LAGKKVFLAA ATLRPETMYG QTNCWILPDG DYGAYELANG EVVVMCERAA LNLSYQEQFA EEGKPKCLLT FKGQSLIGCA VKSPRAELEK IYCLPMMTIL MNKGTGVVTS VPSDSPDDFM ALSDLKAKPA LREKFGVKDE WVMPFEVVPC VHIPEFGDAC APMVCAELKI QSQNDRVKLD EAKHRTYLKG FTEGVMILGN HKGKPVKEAK PLIRQEMIDD NTGMVYSEPE RTVMSRSGGE CVVALTDQWY LEYGEEAWKA KAEKCLENMN CYHDEARHSF EHTLGWLRQW ACSRSFGLGT RMPWDEQYLI ESLSDSTIYM AYYTVAHLLQ GGDMYGEARP SVDPSKLTDE VWDAIFLGTA KPSEDDFPRD LLDRMINEFN FWYPFDLRVS GKDLIQNHLT FAIYNHTAIW EDEKMWPRSF RTNGHLLLNN EKMSKSTGNF KTLKQAIEEF SADAMRFTLA DAGDTVEDAN YVDDTANAAI LRLTKEITWY EEQMAEIEAG NLRTTEPNKF IDRVFTNAMN TAIAQTQEHY ENMMFREALK SGFYDLQSAR DAYRLMSAEE GGMHADLTKR FIEVQTLLLA PICPHTCEHI YGTILKKEGS VTSAGFPSGE VEDVALTAAN KYLADLITNM RKGIAKCTAP PKKGPKGPPK VAKEGTIVVA SEFVGWRAVC LSILAESYDT KSKTFPPVPD ILAKVKSSEL SADANFKNVM KMVMPFIKFK MDEANVAGAS ALNTKIIFDE MDVLKENIDF IKRALSLSTL TICYTTGENA GSKADDATPG APAFEFVVTS DEDLAAGVAN MLLG
|
| |