Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42662 |
Symbol | |
ID | 5003137 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 615767 |
End bp | 618580 |
Gene Length | 2814 bp |
Protein Length | 924 aa |
Translation table | |
GC content | 57% |
IMG OID | 640418558 |
Product | predicted protein |
Protein accession | XP_001419436 |
Protein GI | 145350050 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAAGC CCGGAAAACG ACACGGAGAC CTGCCGAAGA ACTTTGAGCA CCAAAACGTG GAGGAAAACC TGTACGAGTG GTGGGAGTCG CGCGGATACT TTGCCCCGAA CGATGAGACG GCCACGGGAC CGCCGTTCGT GATCCCGATG CCGCCACCGA ATGTCACGGG AGCGCTGCAC ATGGGACACG CGATGTTCGT GACGCTGCAG GACGTGATGA CTCGAAGCGC GCGAATGCGT GGTCGCAAGA CGCTGTGGCT GCCGGGGACG GACCACGCGG GCATCGCCAC GCAATTGGTC GTGGAGAGAA AGTTAGAGAG CGAGGGAGTG AAACGAACGG ATATGACGCG CGATGAGTTC GTGGAGCGAG TGTGGGAGTG GAAGGCCGAG TATGGGGGAC GGATTCAGCA GCAGATTAAG AGGTTGGGGG CGTCTTGCGA TTGGTCTCGC GAGAGATTTA CGCTCGACGA GGGGTTATCT GAGAGCGTGC TCGAGGCATT CATCACGCTG CACGATCGCG GTCTCATTTA TAAAGGAACC TACATGGTGA ACTGGGCGCC GAAGTTGCAA ACCGCGGTGA GTGACTTGGA GGTCGAGTAC ACCGAAGAGC CGGGGACGTT GTACTTTTTC AAGTACCCGG TCGAGGGCGG CGGCGCGGAC GACTACCTTC CGGTGGCAAC GACGCGTCCG GAAACGATTC TTGGTGACAC TGCAGTCGCG GTCAACCCAG AAGACGATCG CTTCAAGCAC ATGATCGGGA AGAGATGCGT CGTGCCCTTC ACCAACGGTC GCACCGTGCC CATCATCGGC GACTCGTACG TGGACATGGA ATTCGGCACG GGGGCGTTGA AGATCACGCC GGGACACGAC CCGAATGATT ACGAAATCGG CAAGCGTGTG GGCTTGGATT TGATCAATAT TATGAACAAA GATGGTTCCA TGAACTCTAA CTGCGGCAAG TATGCCGGCA TCGATCGCGC GGATTGCCGA AAGCAACTCT GGGCGGATAT GGAGGCGGAA GGTTTAGCAA TCAAGGCGGA GCCGTATACG AACCGCGTGC CTCGCTCGCA GCGGGGTGGC GAAATCATCG AACCAATTGT TTCTGAGCAA TGGTTCTGTA AGATGGAAAC CATGGCGGAG CCCTCGCTGA AGGCTGTGGA GACTGGCGAA CTCACCATCA TCCCGCAGCG TTTTGAGAAA ATTTACAAGT CTTGGTTGAC GGATATCCGA GATTGGTGCA TCTCGAGACA GCTGTGGTGG GGTCATCAAA TCCCAGTGTG GTACGTGCAC GATTCTGAAG AAGATCTCGC GCGTGCGAGA GAGGGGGAAG GTAAAGGCAC GAATAAGCGA TACGTAGTGG CGCGAAATGA CGCCGACGCC GCAGAAAAAG CGAAGGCGCA ATACGGCGAA AACGTCGTGT TGTACCGTGA AGAAGACGTG CTCGACACTT GGTTTTCTTC TGGTCTGTGG CCGTTCAGCA CGTGTGGGTG GCCCAACGAA GAAGCTCCTG ACATGAAAAA TTTCTTTCCG GCGAGCGTGC TTGAGACTGG GCACGATATT TTATTCTTTT GGGTGGCGCG TATGATCATG ATGTCGTACG GGATGACGGG GAAATTACCC TTCCACACTG TCTTTCTCCA CGGCCTCGTC CGCGATTCGC AAGGTCGCAA AATGAGCAAA TCGCTTGGTA ACGTCGTGGA TCCGTTAGGC GTCATCGCTG AGCAAGGCTG CGACGCTTTA CGATTCACGC TCGCCACCGG CACCACGCCC GGTCAAGACT TGAACTTGAA CCTCGAACGT TTGGCGTCGA ACAGAAACTT CACCAACAAG ATTTGGAACG CCGGTAAGTT TGTGCTGTAT AGTATGGAGG ACATGACAGA CGATGAGCGC ATGGCGCTCG TCGACGAAGG TGCTGCGCTT TGCGCCGATG AAGCGTCAAT TGCCGAGCTT CCGCTCGCCG AGCGTTGGAT CGTGAGCAAA TTGAACGCGA CGGTGGATCA CGTCACCGCC GCGCAAGACA AGTACGACTT TGGTGAGGCT GGTAGATCGA CGTACACGTT CTTCTACGAC TCCTTTGCCG ATTGGTTCAT TGAAGGCGCT AAAAGTCGCA CGTACGGAGA CGATGCCGAC GCCGCGCGAG TCACCAAAGC GGTGACGCTG TACGTTTTGG ATCAAACGCT TCGATTGCTC CACCCATTCG TGCCGTACGT GACGGAGGAA GTTTGGCAAG CGCTTCCGCA TCGCGGCGAG GCGTTGATAG GCCAAGACTG GCCCGCGCTT GAAGCTTACG TGGACAAATC CGCAGTGTCG ACGTTTGAAA ACTTACAATC GATCGTCACG CGTATTCGCA ACGCGCGCGC CGAGTACTCG GTGGAGCCGG CGAAGCGTAT TCCCGCCGTC ATCGTAGCGA CGGATGCCGA AGCGAACGCG GCGTACAGCG CCGAGTTGAA TCTTATCGCC ACGCTCGCGC GTTTAGACGC CGAGCAAACG TCCGTCGCGA GCGCGCCACC CACCGAAGTC GCGAGCGCGC CGGAGAATTT CGTTCAAATC ATCGTCAACG AAAGCCTGGA GGTGTACTTG CCGCTCGCGG GTCTCGCGGA TCCGGTTAAA GAAATCGCCC GCCTCACGAA GCAAGAGACA AAGATGCAAA AGGAAATCGA TGGGCTGGCT GATCGCGTCA ACTCGCCCGC GTTCGTCGAC AAAGCCCCCG CGGCGGTCGT AGAAAAAGCG CAGAAGGAGT TGAGCGAGTT GCAAGAGCAA CTCGCCGTCG TTCGATCTCG CATCGAACAA ATGCAGGCGC TCGTCGCCAA GTGA
|
Protein sequence | MPKPGKRHGD LPKNFEHQNV EENLYEWWES RGYFAPNDET ATGPPFVIPM PPPNVTGALH MGHAMFVTLQ DVMTRSARMR GRKTLWLPGT DHAGIATQLV VERKLESEGV KRTDMTRDEF VERVWEWKAE YGGRIQQQIK RLGASCDWSR ERFTLDEGLS ESVLEAFITL HDRGLIYKGT YMVNWAPKLQ TAVSDLEVEY TEEPGTLYFF KYPVEGGGAD DYLPVATTRP ETILGDTAVA VNPEDDRFKH MIGKRCVVPF TNGRTVPIIG DSYVDMEFGT GALKITPGHD PNDYEIGKRV GLDLINIMNK DGSMNSNCGK YAGIDRADCR KQLWADMEAE GLAIKAEPYT NRVPRSQRGG EIIEPIVSEQ WFCKMETMAE PSLKAVETGE LTIIPQRFEK IYKSWLTDIR DWCISRQLWW GHQIPVWYGE GKGTNKRYVV ARNDADAAEK AKAQYGENVV LYREEDVLDT WFSSGLWPFS TCGWPNEEAP DMKNFFPASV LETGHDILFF WVARMIMMSY GMTGKLPFHT VFLHGLVRDS QGRKMSKSLG NVVDPLGVIA EQGCDALRFT LATGTTPGQD LNLNLERLAS NRNFTNKIWN AGKFVLYSME DMTDDERMAL VDEGAALCAD EASIAELPLA ERWIVSKLNA TVDHVTAAQD KYDFGEAGRS TYTFFYDSFA DWFIEGAKSR TYGDDADAAR VTKAVTLYVL DQTLRLLHPF VPYVTEEVWQ ALPHRGEALI GQDWPALEAY VDKSAVSTFE NLQSIVTRIR NARAEYSVEP AKRIPAVIVA TDAEANAAYS AELNLIATLA RLDAEQTSVA SAPPTEVASA PENFVQIIVN ESLEVYLPLA GLADPVKEIA RLTKQETKMQ KEIDGLADRV NSPAFVDKAP AAVVEKAQKE LSELQEQLAV VRSRIEQMQA LVAK
|
| |