Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33832 |
Symbol | |
ID | 5000870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 461962 |
End bp | 465132 |
Gene Length | 3171 bp |
Protein Length | 1056 aa |
Translation table | |
GC content | 56% |
IMG OID | 640416291 |
Product | predicted protein |
Protein accession | XP_001416669 |
Protein GI | 145344290 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.828638 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAGGG CGACGATCGA TCCGGAGAAG GCGGCGAAGA AGGCGGCGAA GCTGGCGGAG AAGGAGGCGA AGAGGGCGAA GGCGGCGGCG AAGGCGGCGA AGGCGGCGGA GGAAAAGGCG AAGAGGGACG CGGGCGGCGG CGGCGAGGGC AAGGCGAAGA GGGAGAAGAA GGCGAGCGGG CCGAGCGAGG AGGACGTGCG AGCGCTGGAG AGCGCGCTGA AAACGCCGAG GGGGGAGATG AAGGATTTGG TGGGGACGCC GATGGCGAAG TCGTATAATC CGGTGGCGGT GGAGGCGGCG TGGTACGACT GGTGGGAACG GTGCGGGATG TTTACGCCGA CGATGGGGAC GAATAAGCCA AAGTTTGTCA TCGTGATTCC GCCGCCGAAC GTCACGGGTG CGTTGCACAT TGGACACGCG CTGACGAACG CTATTCAAGA CACCATCGTG CGGTGGCGAC GAATGCAGGG ATATGAAGCG CTGTGGGTTC CGGGGACGGA TCACGCGGGC ATCGCGACGC AAACCGTGGT GGAGAAGAAA TTGCAGCGCG AGGAAGGCGT CACGAGGCAC GATCTCGGGC GAGAAAAGTT CCTCGAGCGC GTGTTCGAGT GGAAAGACGT GTATGGGGGC AAGATCTGCA ATCAGCTTCG CCGAATCGGC TCTTCTATGG ACTGGACTCG AGAGGCGTTC ACGATGGATG AAAAGTTGAG CAAAGCCGTC AAGGAAGCCT TCGTGCGCAT GTTCGACGAA GGTTTGATTT ATCGCGACAA TAGATTGGTG AACTGGAGTT GTCAGTTGAA GACTGCGATC AGTGATATCG AGGTGGATTA CATCGAGCTC GATGGGCCGA CGATGCTCGC GGTGCCGGGG CACACCAAAA AGGTGGAGTT CGGCGTCATC ACATCCTTTG CGTACCCGTT CGAAGACGGG CAAGGCGAAG TTGTCGTGGC GACGACGCGT ATTGAGACCA TGCTTGGTGA CACCGCTGTT GCGGTGCACC CCGAAGACGA ACGCTACAAG AGCTTACACG GAAAATTCGT TCTTCACCCG TTCAACGGTC GTCGTATTCC CATCATTTGC GACGCCGAAC TTGTCGACAT GGAATTCGGT ACAGGTTGCG TGAAGATCAC CCCAGCCCAC GATCCGAACG ATTTCAACAC CGGTAAGCGT CACGATCTTG AGTTTATCAA CGTATTCACC GAGGAAGGAT TGGTGAACGA GCAAGGCGGC GAGCAATTCC AGGGCATGAA GCGCTTTGAG TGCCGCGTCG CCATCACGGA GGCGCTCGAC AAGCTTGGTC TTTACCGGGG CAAGGCGAAC AACCCGATGC GTTTGGGTCT TTGTAGCCGT AGTAAGGATG TCATCGAACC GATGCTCAAG CCGCAGTGGT GGGTCAATTG CCAAGACATG GCAAAGGAGG CGTGCGATGC CGCTCGCGAC AAGCGACTTG AAATTCTCCC TACTTTCATG GAACCGACGT GGTTCAGATG GTTGGAGAAC ATTCGTGACT GGTGCATCTC CCGTCAACTT TGGTGGGGTC ATCGTATTCC AGCCTTCTAC GTTCGTTTCG CCGGCGAAGG TGACGACGAC TGTGGGATGC CAGGCGGTAG CTCTGAGAAG ATGGATCGAT GGGTCATCGG TCGTGACGCC GATGAAGCGC GCGTTTCGGC CGAGAAGAAG TTCCCGGGTC GAGAATTCAC ACTCGAACAA GACGAAGACG TGCTCGACAC TTGGTTCTCG TCCGGTTTGT TTCCGTTTTC CGTCTTTGGC TGGCCCGACG AGACACCGGA TTTGGCCGAG TTTTACCCAA CGTCCCTCCT CGAAACTGGA CACGACATTT TGTTCTTCTG GGTCGCTCGT ATGGTAATGA TGGGCATGAA ATTGACTGGT AAGGTGCCGT TCAAACAAGT GTACCTGCAC GCCATGGTTC GTGACGCGCA TGGTCGTAAA ATGTCAAAGT CTCTCGGTAA CGTCATCGAT CCTCTCCATG TCATTGAAGG TATCGACTTG GCCGCCCTCA ACGAGACTCT CGTCGGCGGT AACCTCGACG AGAAGGAACG CAAAAAGGCG CAAGCGGGTC AAAAGGCTGA CTTCCCCGAT GGCATTCCAG AATGTGGGAC CGATGCTATG CGCTTCGCGC TCGTGTCTTA CACCGCTCAA GGTCGTGACA TCAACCTGGA CGTTCTCCGT GTCGTTTCGT ACCGCCACTG GTGTAATAAG CTTTGGAACG CGACTAAGTT TGCGATGATG AATTTGGGTG ACGAATACGT GCCGCCGGCC GATTTCAACG CTACTTTCGA TGTACAAAGT CTTCCGCTCG CGGCCAAGTG GGTTCTCAGC CGCTTGAATG CAACATGCGC GTCGACAAAC GCGGGTATGG AAGTGTACGA TTTTAACACC GCCACGAACT CTGTGTACGC GTTCTGGCAA TACGAACTCT GTGACGTTTA CATCGAGATC ATCAAGCCTG TTATGAGTGG ATCGGATGAA ATGGCGAAGA AACAACTGCG AGATGCGCTT TGGATTTGCC TCGACGCTGG TTTGAGACTT CTCCACCCGT TCATGCCCTT CGTCACCGAA GAGCTCTGGC AGCGACTTCC GCGCACTCGC AATGAGAACA CACCGAAGAG TATTATGATT GCTGAGTACC CAGTCGCGGT CGATTCTTGG GCCAACGCCA CGGCTGAGTC GCAGATGCAA ATCATAATGG ACTCTGTTAA GGCGTTCCGA TCGTTGAAGT CCAACTACAA CCTTCCTCCC AGAGCGAGAC CGGATGTTTT CTACAGTGTC AAGACGGATG ACAGCGAAGC GGCGATGAAG ATTGATCCAG AAGGTTTGGT CTCTCTCGCG GGCGTCGGTG AGTTGAAACA GCTGGCTCAA GGTGAATCTG CGCCGCCGGG TTGCGCGGTG AGCATCGTGA ACGAGTCCGT GACGGTCTAT GTTCTCTTGA AGGGAATCGT CGACGCCGCG ACGGAAATTG CCAAGCTCGA TAAGAAGCTT GATTTGCTCA CCAAATCGAC AGAAGCGCTC GTCAAAAAGA CGCAAGAAGA TGGCTACGAG ACCAAGGTTC CGGAGAAGGT TCGCAACGAA AACGCCGACA AGATCGCTAA GCAAGCTGAA GAAATTGATT CCATCAAGGC CGCACGTGCG GACTTTGAAG CGTTGCTGTA G
|
Protein sequence | MERATIDPEK AAKKAAKLAE KEAKRAKAAA KAAKAAEEKA KRDAGGGGEG KAKREKKASG PSEEDVRALE SALKTPRGEM KDLVGTPMAK SYNPVAVEAA WYDWWERCGM FTPTMGTNKP KFVIVIPPPN VTGALHIGHA LTNAIQDTIV RWRRMQGYEA LWVPGTDHAG IATQTVVEKK LQREEGVTRH DLGREKFLER VFEWKDVYGG KICNQLRRIG SSMDWTREAF TMDEKLSKAV KEAFVRMFDE GLIYRDNRLV NWSCQLKTAI SDIEVDYIEL DGPTMLAVPG HTKKVEFGVI TSFAYPFEDG QGEVVVATTR IETMLGDTAV AVHPEDERYK SLHGKFVLHP FNGRRIPIIC DAELVDMEFG TGCVKITPAH DPNDFNTGKR HDLEFINVFT EEGLVNEQGG EQFQGMKRFE CRVAITEALD KLGLYRGKAN NPMRLGLCSR SKDVIEPMLK PQWWVNCQDM AKEACDAARD KRLEILPTFM EPTWFRWLEN IRDWCISRQL WWGHRIPAFY VRFAGEGDDD CGMPGGSSEK MDRWVIGRDA DEARVSAEKK FPGREFTLEQ DEDVLDTWFS SGLFPFSVFG WPDETPDLAE FYPTSLLETG HDILFFWVAR MVMMGMKLTG KVPFKQVYLH AMVRDAHGRK MSKSLGNVID PLHVIEGIDL AALNETLVGG NLDEKERKKA QAGQKADFPD GIPECGTDAM RFALVSYTAQ GRDINLDVLR VVSYRHWCNK LWNATKFAMM NLGDEYVPPA DFNATFDVQS LPLAAKWVLS RLNATCASTN AGMEVYDFNT ATNSVYAFWQ YELCDVYIEI IKPVMSGSDE MAKKQLRDAL WICLDAGLRL LHPFMPFVTE ELWQRLPRTR NENTPKSIMI AEYPVAVDSW ANATAESQMQ IIMDSVKAFR SLKSNYNLPP RARPDVFYSV KTDDSEAAMK IDPEGLVSLA GVGELKQLAQ GESAPPGCAV SIVNESVTVY VLLKGIVDAA TEIAKLDKKL DLLTKSTEAL VKKTQEDGYE TKVPEKVRNE NADKIAKQAE EIDSIKAARA DFEALL
|
| |