Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25190 |
Symbol | |
ID | 5004302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 21962 |
End bp | 25360 |
Gene Length | 3399 bp |
Protein Length | 1132 aa |
Translation table | |
GC content | 61% |
IMG OID | 640419723 |
Product | predicted protein |
Protein accession | XP_001420398 |
Protein GI | 145352104 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGC TGGACGACGT CGATGCGCTC GATGCGGTCG TGCGCGACGC GTTCGCGCGG ACGAACGACG CGCGCGCGAC GGCGAAGGCG GCGGACGCGT CGCGCGCGCT GGACGCGCTG AAACGCGACG CGCGGGCGTG GGCGGTGTGC CTGCGCGCGT ACTCGCGCAC GACGTCGAGC GAGACGAAGT TTTGGTGCCT GCAGACGCTG ACGGAGGCGC TGGCGAAGGA GGCGAGAGGA GGCGGGACGA CGATGTCGGA CGAAGACGCC GAGACGCTGC GACGAGCGCT CGGGGCGTGC GTGAGCGAAG CGACGGGCGA GAGCGCGACG GGAGGAGACG GTGGAGATGA TCGACGGAGT TCGGCGGCGA CGGCGGCCTC GTCGACGTCG ACGCCGGCGT TCGTGAAAAA TAAGCTCGCG CAGGCGTGCG CGTACGCCGT GGCGCTGGAG TACCCAGAAC GATGGCCCTC GTTCTTCGCG GATTTGGCGA ATCTATTGAG CCGAGGGACG CAAGGCGTGG ATATGTTCAC GCGTGTGCTC GAGGCGATCG ACGAGGAGCT GATCGCGACC GTGGACGCGG GAAGAGCAGG GAAGGAGGAC TTCGCGCGTT CGATGCGCAT CAAGGAAGCG ATGCGAGCGG ATGGGAGCTT GCGCTTAGTT TTCGATGCTT GGAGGCAGTG TTTGGAACAT TATGCGCGCG TCGATCCACG CGTGGCGACG CGGGTATGGT CTGTCGCGCG GCGATACGTC GAGTGGGTGG ACGTCGGTCT TGTCGCGAGC GAAGAGTACG TGAAGGGCGC CAAGGAGTGT TTGATGCTCG ACGATTTGAG CGGCGCGGCG GATGAAAGTT TGCGCGCCGC GGCGGTAGGG TATTTGCACG CCGTCATAAC GAAAGGCATG GAAACGAGTG CAAAAGTGCG CCTGATCATC AGCACGAAGA TCGTAGAAGT GTGCGCTCGG TTGCAGGTGA TTTGCGCGGC TACCCAGGAT GATTTCGACG AAGAATTCGT AACTCAGGTG ACCAATCTCG CCGCCGCGGT CGCCGCGGAG CTTTTGAACG CCAACAAAGT CGAGAATATA ACGGCTCTCG GTGTTGAACT GTCCGCTGAG GTCAGTTCGT CCTTGCATCA GGTGACTCCG TTGGTTTTGT CGAGCATCAG TTTTAAGCAC GAGCGCGCCG TGCTCGTGGC GCTACCATTC CTCACGGCGT ACATTGGGTA CATGAAATCA CAGCCGGCGT TGTTGAATGC GGCGCAGCCG GCGCTTACGT CGGCGTGTCA GGCGCTCATA GCGCGTGGGG CGTTCCCTAC AGAACACATA GACGGCTTGG ATTGGAACGA CGGCTCCAAC GCCCTCACGC AAGAATTTGA AGCGGACGTC TTCAGTCTTC GCGCTGAACT AAACGTGCAG CTGAAGAACA TAGCGCGGCT CGCACCACAC TTGGCGCGCG AAGTCGTTCG TCAAGTCTTG ATGAGCGCCG TCGTGGGTAG TGGCGAAGGA AACCAGCTTT GCTGGCAAAA CGTCGAGGTC GCGATTTCGG CGCTGTTCAC CCTGGGAGAA GGCGCCGACG ACGCCGCGGT GAAACCCATG TCCATCGCGG ATCGAGCCAA GGCGACGAAC GGCACCGGTG AGGTCACAGA CACCCCCCTC GGAGCGCTAG TGGTATCGCT TATACGCGAA TGGGGCACGA GTGTTGGTCG CGCGGCGTAC CATCGCCTCG TAGCGCCGAC GTTTTTGGAG ATTTGCGTCA GATATCACGC CGTTTTGGAG CGCGATGACG CCGCTCTAGT CGCCGCCTTG ACTGCGTTCT TAGACGAGCG AGGTATAGGT CACGTCGATC TCGCCGTGCG GTCGCGTGCA TGTTACTTGA TTTCGCGTCT GTCCCGACCG CTTCGCTGCA AGCTTTCGGA TAAGGTGGAA AACATCATGT GCGTACTCCC GGCCTATCTC ACGGAATTCG CGAGATCGTT GCCGGAACCG GCGACGCAGA ACGCTGCGTT TGTATCTGTG AGCGCGGCTG GTATTCAGTC TCGAGCGATG GCTGAGAGCG GAAACGACGA TCGGTTGTAC CTGTTCGAAG CCTTCGGCAC GATGCTAGGC GCGGATGAGG TGAAAGAAGA GGAACAGTAT AGGTATTTAT CGCAAATCGC CGCTGGGCTT TGTCGTCAAA TCGAGGAAGT CGCGGCGGGA GGACAGTCGG GCGAGGACGC GCCAATTCGC ATCGCATTGG CAACGCGCGC CATCGTGGCG TTCGGGAATA TATCCAAAGG ATTTTCGCAG CGAACATGTT TGACGAGTAG ACCTCGCACT GGAGAAGTGT TTAGATCGTG TTTGGAGATG TCGCTTCGAT GTCTCGACGT GTGGCCGCGA GACGCGAGCG TGCGCAATCG TGCCACTGGT TTCTTGCACA GAATGATTGA TTTGCTCGGA CCGACTGTGA CACCGTACTT GGCTCCGACG GTCCATAAGC TCCGTCGCGA TGCCGACGCC GTGGAGCTTC GCGAGACGTT GGTGCTTTTC AATCAACTCG CGTCGACGTA CGCCGCCGAG CTCGCGCCAT TTGTTGTTGA AGTATTACCA GGTCTTGCCG CTCAAATCTT CAACACGATC TCCAGCGCGT ACGCACAGGC ATCGGTTGAG TCCGTGGGGG GGAGTATTGC CACGAACACG GAAGTTGTGC GCGAAGCCGA CGAGCTCGAA CGTATGTGGC TTACTACGAC GGCGGCCCTC GGCGCCAACG CGCTCATTGC GCCCACATTC ACGGGATATC CCAACACCAA GCGTACCGCG CCGCTCAGAG AGCAGCTGTT GTCGCATCTT GTCCAAGCTG CGCAGTCGCA CGGCATGGTG AGCGCGCGCA AAGTCGCCCT GACCGCGCTC AAGAGCTTCG TTGAAGAATG GACGTTGGAC ACATCGCCCG ACGAACCACC TTCGCTCGAA GGTGCGCCGA CGTCGTCAGC GCCTGCCAAA GGACCGAGAG ATGAACGCGT ACCAGGCTTT ACGCGATTCG TTGTCGAACG CGTGTGCGTG GAGTGCTGCA TCTTACCGCC TATACGCGGT GATTTAGACC TCTCCGACGC CGTCTCCGTC GGCGCGCTCA ACGAATCCTT CGCCATCCTC GCCGTCGTCC ACGCCCGGCA ATCCGACGCC TTGACCACCG CTCTGACGCA CATCTTCCGT CACTCCATCT TACCCAGTCA TTCTCCCGAT CGCATCGACG CCATCGTCAA CGAGTACATC CGCGTCCTCG CCGCCGCCGC CGCCGCCGGC AAGCCGCATT TGCGTTCGAC CAAACCCGCG CGCGCGCTCG TCGACGCCGT CCGTCGCGAG ATCGGCGCCG CGCCCGACCG CGGTCGCACC TTAGACCTGA CCCCTAGACT CGCCAAGCGA GGCGTGTGA
|
Protein sequence | MTTLDDVDAL DAVVRDAFAR TNDARATAKA ADASRALDAL KRDARAWAVC LRAYSRTTSS ETKFWCLQTL TEALAKEARG GGTTMSDEDA ETLRRALGAC VSEATGESAT GGDGGDDRRS SAATAASSTS TPAFVKNKLA QACAYAVALE YPERWPSFFA DLANLLSRGT QGVDMFTRVL EAIDEELIAT VDAGRAGKED FARSMRIKEA MRADGSLRLV FDAWRQCLEH YARVDPRVAT RVWSVARRYV EWVDVGLVAS EEYVKGAKEC LMLDDLSGAA DESLRAAAVG YLHAVITKGM ETSAKVRLII STKIVEVCAR LQVICAATQD DFDEEFVTQV TNLAAAVAAE LLNANKVENI TALGVELSAE VSSSLHQVTP LVLSSISFKH ERAVLVALPF LTAYIGYMKS QPALLNAAQP ALTSACQALI ARGAFPTEHI DGLDWNDGSN ALTQEFEADV FSLRAELNVQ LKNIARLAPH LAREVVRQVL MSAVVGSGEG NQLCWQNVEV AISALFTLGE GADDAAVKPM SIADRAKATN GTGEVTDTPL GALVVSLIRE WGTSVGRAAY HRLVAPTFLE ICVRYHAVLE RDDAALVAAL TAFLDERGIG HVDLAVRSRA CYLISRLSRP LRCKLSDKVE NIMCVLPAYL TEFARSLPEP ATQNAAFVSV SAAGIQSRAM AESGNDDRLY LFEAFGTMLG ADEVKEEEQY RYLSQIAAGL CRQIEEVAAG GQSGEDAPIR IALATRAIVA FGNISKGFSQ RTCLTSRPRT GEVFRSCLEM SLRCLDVWPR DASVRNRATG FLHRMIDLLG PTVTPYLAPT VHKLRRDADA VELRETLVLF NQLASTYAAE LAPFVVEVLP GLAAQIFNTI SSAYAQASVE SVGGSIATNT EVVREADELE RMWLTTTAAL GANALIAPTF TGYPNTKRTA PLREQLLSHL VQAAQSHGMV SARKVALTAL KSFVEEWTLD TSPDEPPSLE GAPTSSAPAK GPRDERVPGF TRFVVERVCV ECCILPPIRG DLDLSDAVSV GALNESFAIL AVVHARQSDA LTTALTHIFR HSILPSHSPD RIDAIVNEYI RVLAAAAAAG KPHLRSTKPA RALVDAVRRE IGAAPDRGRT LDLTPRLAKR GV
|
| |