Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_45389 |
Symbol | |
ID | 5001641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 51171 |
End bp | 54144 |
Gene Length | 2974 bp |
Protein Length | 949 aa |
Translation table | |
GC content | 59% |
IMG OID | 640417062 |
Product | predicted protein |
Protein accession | XP_001417128 |
Protein GI | 145345247 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.000425012 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGGGCGCGA CGGTCGCGGG ACGCGCGCGG GGCGAAGGCG GTGACGGTAA GGGAAAGAAG GAGAAGAACG CGTACGGCGC GACGGTGCGA TTGCCGCAGA CGACGTTCGA GATGCGGGCG AACAGCGTGC AAAAGGAACC TAAAATGCAG GCGTGGTGGG CCGAGCGCGG GGTGTACGAG CGGTTGCGAG CGAGGGAGGA CGGGGCGGCG TTCACGCTGC ACGATGGACC GCCGTACGCG AACGGGGATT TACACATCGG ACACGCGTTG AATAAGATTT TGAAGGATTT CGTGAATCGG TGGGAGATGA TGAACGGGAA GCGCGTGCGA TACGTGCCGG GATGGGACTG TCACGGGTTA CCGATCGAGT TGAAGGTTTT GCAGAACATG GACGCGGAGG CGCGGAAGGC GCTGACGCCG ATTAAACTGA GGTATAAGGC GAAAGCGTTC GCGATGAAAA CGGTGGCCAG TCAGCGGGAG CAGTTTAAGA GGTATGGGAT TTGGGCGGAT TGGGACGAGC CGTACATGAC GCTGCTGCCC GAGTACGAGG CGGCGCAGTT GGAGGTGTTC GGGAAGATGT TTCTCAACGG GCACATTTAT CGCGGGGAAA AACCCGTGCA CTGGTCGCCT TCGAGCATGA CGGCGCTCGC GGAGGCGGAG CTGGAGTACC CCGAAGGCCA CGTATCGCAG AGTATTTACG TCCAATTTCC GGTGAGCGAC GTTCCCGAGT CGACGCCGAG CGAGCTCAAG GCGCTTCTCG ACGGCGCGTC TTTGGCGGTT TGGACCACGA CGCCGTGGAC CATGCCCGCA AACGCCGCGG TGGCGGTGAA CGCCAAGCTC GATTACTCCG TCTGTCGAGT GGAATCGACG GGAGCGACGC TCGTCGTCGC CGAAGGTTTA CGCGAAGCCG TGGCGAGCAA GTTGGAAACC ACGCTCACGA CGCTGGGCAC GTTCAAGGGT CAAGACTTCG AGAACGTGCA ATATCAGCAC CCGTTCTACG AACGCAAGTC TCCAGTGATT ATCGGTGGGG AGTACATCAC CACCGAAGCC GGTACCGGTC TGGTGCACAC CGCGCCGGGG CACGGGCAAG ACGATTACAT CTCCGGCATG AAGTATGGCT TGCCGTTGTA CTCTCCGGTG GATAACGCCG GTTTGTTCAC GGCCGAAGCG GGGAGTGATT TAGAGGGGAA AGATGTTTTA GGCGATGGCA ACGTTTTGTG CATCGAAAAG TTGACCTCTG CTGGGGCGTT GCTCAAGCAA GAGGCGTACA ATCACAAGTA CCCGTACGAC TGGCGCACGA AAAAGCCGAC GATTTTCCGC GCCACCGAGC AGTGGTTCGC TTCTGTTGAG GGCTTCCGTG AGAAGGCTTT GAGCGAGTTA GATAAGTTGG AGTTCATTCC GGAATCTGGT TCGAAGCGCA TGCGACCGAT GGTGAGTGGG CGCGCGGACT GGTGCATCTC GCGCCAGCGC GCGTGGGGTG TGCCGATTCC GGCGTTTTAC CACAAGGACA CGAATGAGGT TTTGATGGAC GAATCCATCA TCACGCACTT CACTGAAATC GTTCGCACAC GAGGGACCGA TGCGTGGTGG GAATTAGAAG TCGAGGACTT GTTGCCCGAG CAACACAAGG CCAAGGCGAA TGATTACGTG CGCGGTTCGG ACACGATGGA CGTGTGGTTT GACTCCGGCA GCTCTTGGGC GGGCGTCGTG CAATCTCGCG GTCTTTCGTT CCCGGCGGAT ATGTATTTAG AAGGTAGCGA TCAGCACCGA GGATGGTTCC AGTCGAGTTT ACTCACCGGT GTCGCGTCCA CCGGTCAAGC GCCGTACAAG AAGATTTTGA CACACGGCTT CGTGCTCGAC GAAAAAGGAT ACAAGATGTC AAAGTCCTTG GGGAACGTCA TCGACCCGCG AACGGTGATC GAGGGAGGAA AGAACCAAAA GACAGAGCCT GGATATGGCG CCGATACCTT ACGACTGTGG GTGGCGAGCA CGGATTACAC GGGTGATGTG AGCATCGGTA TGAACATCAT CAAGCAAACG AGCGAGGCGT ACCGCAAACT CAGAGGAACG ATTCGCTTCT TGATGGGCGT GTTGGACGAT TTCAAGCCAA CCGAGGGCGT GGCGTACGAG TCGCTGCCGG CGTTCGAGAA GTACATTCTT AGCCGCCTGG ATGCGACGAT GCAGGAGATC GAGACGTCGT ACAAGTCGCA CGAGTACAGC CGCGTCGTCG CGGCCATCAC CTCCTTCACC ACGTTCCTGT CCAACGTGTA CTTGGACGTG AGCAAGGACA AGTTGTACAT TGGCGAACAG AACGACATCA AGCGCCGGGC GTGTCAAACC GTGGTTTCGG CCATCGTCGA GCGATTGATC GCCGCCATCG CCCCGCTCAC CCCGCACATG GCTGAAGAGG CGTTTCAAGC GTTGCCTTAC GACAAACCCG GCGACGCCAT CTCCGTCTTC ATCGCCGGAT GGCCGTCGAG ACCCAGTACC TGGGCCTCGA TCGATGCCGA TGAGGTCAAG TTCTGGGATT CCTTCTTGGA GATTCGCGAC ACGGTGAACA AGGTGCTCGA AGATTCTCGT AACGCCAAAC TCCTCGGCGC GTCGCTCGAA GCCAAGATCA CCCTGCACTG CTCCGACGCC GATTTCGTCG CCAGGCTCAA CCGCGAAGAA ATCGCGCGAG ACTTGCGTTA CCTTTTCATC GTCAGTCAAG TCGAAGTCGT GACATCAAGT GACGCCGCCA CCGCGGGATG CGACTTTAAA TCCATCGTCG ACGTCCCCGG CGCCGGCACC GTCTCCGTCG GCGTCGCGCG CGCCTCCGGA GCCAAGTGCG CTCGATGCTG GAACTTTTCC AACCTCGTCG GCGTCGACGC CAAGCACGCC ACCCTGTGCG AACGATGCGT CCCGATCGTC AACGCCTCGC ATCCCGATTT AGTCGTCGCC GCGCCCGCAC CCGCGTCATG AGCGCCTAGC CGTT
|
Protein sequence | MRANSVQKEP KMQAWWAERG VYERLRARED GAAFTLHDGP PYANGDLHIG HALNKILKDF VNRWEMMNGK RVRYVPGWDC HGLPIELKVL QNMDAEARKA LTPIKLRYKA KAFAMKTVAS QREQFKRYGI WADWDEPYMT LLPEYEAAQL EVFGKMFLNG HIYRGEKPVH WSPSSMTALA EAELEYPEGH VSQSIYVQFP VSDVPESTPS ELKALLDGAS LAVWTTTPWT MPANAAVAVN AKLDYSVCRV ESTGATLVVA EGLREAVASK LETTLTTLGT FKGQDFENVQ YQHPFYERKS PVIIGGEYIT TEAGTGLVHT APGHGQDDYI SGMKYGLPLY SPVDNAGLFT AEAGSDLEGK DVLGDGNVLC IEKLTSAGAL LKQEAYNHKY PYDWRTKKPT IFRATEQWFA SVEGFREKAL SELDKLEFIP ESGSKRMRPM VSGRADWCIS RQRAWGVPIP AFYHKDTNEV LMDESIITHF TEIVRTRGTD AWWELEVEDL LPEQHKAKAN DYVRGSDTMD VWFDSGSSWA GVVQSRGLSF PADMYLEGSD QHRGWFQSSL LTGVASTGQA PYKKILTHGF VLDEKGYKMS KSLGNVIDPR TVIEGGKNQK TEPGYGADTL RLWVASTDYT GDVSIGMNII KQTSEAYRKL RGTIRFLMGV LDDFKPTEGV AYESLPAFEK YILSRLDATM QEIETSYKSH EYSRVVAAIT SFTTFLSNVY LDVSKDKLYI GEQNDIKRRA CQTVVSAIVE RLIAAIAPLT PHMAEEAFQA LPYDKPGDAI SVFIAGWPSR PSTWASIDAD EVKFWDSFLE IRDTVNKVLE DSRNAKLLGA SLEAKITLHC SDADFVARLN REEIARDLRY LFIVSQVEVV TSSDAATAGC DFKSIVDVPG AGTVSVGVAR ASGAKCARCW NFSNLVGVDA KHATLCERCV PIVNASHPDL VVAAPAPAS
|
| |