Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15149 |
Symbol | |
ID | 5001524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 705086 |
End bp | 708706 |
Gene Length | 3621 bp |
Protein Length | 1206 aa |
Translation table | |
GC content | 59% |
IMG OID | 640416945 |
Product | predicted protein |
Protein accession | XP_001417327 |
Protein GI | 145345671 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00128251 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACG CGCCGCCGGC GCGCGGCGAG ATCCTCGCGC ACTTCTGGTC GCTGAGCGCC GACGACGCGT CGACGCGGAA GCGCGCGTGC GAGGCGCTGA TGCGCGATCT TCGAGCCGCC GCCGCCGCCG ACGGCGCGCG CGCGCGGCGT GGCGACGACG ACGGCGACGA AGGCGCGGAC GCGCGGGCGT ACGCGCTGCG AAGACTCACG CGAGGGCTGA GCTCGGGACG GGCGGGCGCG AGGCAGGGGT TCGCGCTGGC GCTGAGCGAA CTGGCGACGC ACGCGGCGAC GCCGGCGGAG GCGCTGGACG CGCTGGACGC GAACGTCGCG CCGATCACGA AGGCGACGAA GGGACAGGAG GCGAGAGACA TACTGTTGGG AAGACTGTTC GGGGCGGCGG CGATCGGATT AGCGCTGGGA GGGCGAGAAG ACGTGCGCGA GGAGGAGCGG AGACGGTGCG GGGCGGAGGT GGCGAGACGC GCGGAGACGC TGTCGAGGGA GAAGACGTAC TTGGCGGAGC CCGCGGCGGC GTGCGTGATC GAGCTCAGGG CTTCGTTGGG GGACGAGACG TTCGCGGGGG TGGTCGAAGA CGCGGGAGAA GGGTTGGAGC GGTGGTTGAG CGGGGATTGC GGCGGCGACG CCGGGGCGGA TACGCTTTGG CTGGCGTGCG AGACGTTTGA GGCGTTGCCG CGCGAGACGC GCGATAGAGT TCAGTGCGTG CGGGCGACGA AGAAGGGCAA GAAGGTTGAT TGGGCCGAGA TGTTTACTCG AACGCATTTG AGTAAGATTT CAAAGGCGCT TTTGGACACC GCGCACACGC ACCCGCGCAT GCACAGCGCG TGGGAGATGA TGCTTCGCGA AGCCCCGGGA GCCCGAGGCG TCGTACCGCT GTGGGAGATT GTGTGCGAAG ACGGCTTATT CGTCTCCGGC TCGCATCAGC GTCGGTTTCT CGGATTCCGC GTGTTCGACA CCTTACTCTC TTCCGCGGAA GCTCACGAAA TCCCCGCACT GTTTTCTAGC AACTTTATCA AGTGCTTGTT GAACAACCTG AGCGCGCCGG ACAACTACCT CCACGAATGC GCGGTGGACT GTTTGGCGAG AATCGTTGCG TTCGCGTCGG ATAAGAAGAC GAGCTCGGAG AAGAAGATTG CCGTCATAGC CGCGCTTCAG CGTCAAGGTC CGACACGTTT CGATAACGTG ACGAAGACAA ACGCGGTGCA GGACTTGGTG AAGAGTCTGG ACAGCGACGA TGCGTTGCAC TACTTGCAGA GCATGTACGC CGTCGTGACT AAGGCGCCCG TACAAGATTC CGACGTCGTC GGCACAGAAG AAGAACTCGC GAGCGCGTTA GCGAACGGTA CGGGTCAAAA GCGCCGCCTG TGGGCGCTCG AACAAATGGC TGGATTGGCG CCGATGCTCC CGAGCGATAA AGTCGTGGAA TTGATGCAGT TTATGCTTTT CCACGCATAT TACAAAGCCA CGGATGGTAA GGCGGGGAAG AAGGGCAAGT CCAACATTCC AGCGAGCATC CTGAAGTCAC CGCTCGAAGA ACCCACCGGG TCCGTGCGCT CGGCGTGCTC GACGCGCTTC TTAGCGATGA TCAATTCTAA CGTTCGCGCG CAACGCGCGG CGGCAAGTAA GGAGTCGGAT GACAAGCAAG ACGTCGTCGA TTTATTGAGC GAAGCCACTT CGTTTTGCCG CGCTCTCGAG GGCGAGACAG CGGTGGACAT GATCGACAGT ATTCCGGACG AGTGCAGAGA AGTGCGCGCT GAGCTGTTCA AAGCCCTCGA CGCGTGTGTC GGTAGCGGCG ACGAGTTGGC GGCGAAGGTC GCACCACTCA TTCGCGTCTT GTCCGTGCTC CAAGTCGGTG ACTGGCGCGA ATTCACGCCG GCATTGCAGG ACCTTCCGCG TTGCGTCGGA GAACTCGTGA ATCCTAAGAA AAAGTCAAAA AAGTCGAAAA AGTCGAAGAA AGACGGAGAG GACGAGCCTG AAGCGATCGA TGTGCTCACT GATATTCTTC TCAGCTTGCT TGCTCAACCG AGCGCGTTGC TTCGGGATGT CGTCGAGCAC ACCTTCAAAG CCGTCTCTGG ACAAGTTTCC AAAGAAGGTA TTCAGGACAT GCTGCGAATC ATCGCCGGGC CAGAAGTCGG TGAAGATGCT GGCGAGGGTG AGGGCGACGG TGAAGACGAA GACGTGCTCA TGGAAGATGA TGATAGCGAC GTCGATGACG ACGATGACGA CGACGAAGAG AGCGACGACG ACGACGATGA CGAAGAGAGC GATGACGAGG AAGACTACGG TGAAGCTAAC GATGCGGAAA TCGCGGCTAT GCGCGCTGCG GCGAGTAAAA TCGTCGGGAC TGCGGCGGAA GATTCGGATG ATTCTGACTC AGAGTCTGAA GGTATGGATG ACGCCGCGAT GTTCCGAATC GACAAACTTC TCGCCGAGGC GTTCAAGAGT CGACAGCAAG ATTTGATGCG TAAGAAGAAT CTGAAGCGCG CAACGCGCGA CTTCAAATTC CGAGTCATCT CTTTGTTCCA GTTGTACGCG AAAGCGCAGC CGGGAAGTGC ATATCTACCT AACGCGGTCG TAACACTCTT GGAAGCTATG CGCGACTCGC TCGGCAAGCA AGATCCACAA AGCGCGCAAC TCGCTGAGCG CATCGCGGCG CTAATCAGCA AGCACATCGC GCACGCGCGC GATTTGCCCG AGCTCCTTGG GGACGAGGTG ACGTCAAAAA CAATCCAGTC CAAGTTGCTG GAAGTCATTG TCGCCGCAAA CCGGGGTGCC AGTGACGCGC AGGTGTTCAA CAAAGCAGCC GGCGCTGCTG CGGCGTATTT GTTGCGCGTT CTCGAAGCCG TCGCTCTTCA CGAGAAAGGC GGAAAAGCCG CAAAGGTGGG TGAAGAAGTT GCGAGCGAAA ATGCGATCGA TTGTTTCCGC GAAGCATTGA AAATGTTCAA GTCGAAGAAG AGCAAGCTGA AAACGGGCTT CTTTTCGCAA ACTTTTGCGA GACATCCGGC GCTCGCTTCG GCGCTCTTGC CCGAATTGTT CAGTCTCGTC GCCATCGATG CCGACAAACC CAACGCAAGA GGCGAGTTCC TGCGACTGGA GGCCTTGAAA CTTGTGAACC CAGTGATTCA GTCGGGTAAG AAACGTTATC CGCCGTTGGC GAAGAGCGCG ACCAAGTCGA TGAAGACGCT TTCGGTATCT TTGGCGGCAG CCATCGGCGC GCCTTATAAG AACAAGAACA CACGCGCAGA CGCGTGCCAA CAAGCGGCGA ACTGCATCGA GTCACTGAAT CGATTGATTG GCGAGATAGA AATCAAGACT ATCATCGACG TCGACGCCAT CATCGACGCT GTGGCGAAGC AAATGTCTCG CCCGCCCGCG TTGCCGCAAA AGGCGCAAAA AGCGTTCCAG CGCATTTGCG CTCTCCTCGA TCGCGCTGTG CCGGATGTCG AAATGCAACC CAAGAGCGAT GCGAACGACG ACGACGGCGA CGACGGGAGC GAGTCTGAAG AAGACGCCCC GAAACAGAAG AAAGATAAAA AGTCGAAGAA GCGGCGCGAT TCCTCAGGTG GCGAAAACTC GAGCAAGAAA AAGGTCAAGA AGAACCGTTA G
|
Protein sequence | MTDAPPARGE ILAHFWSLSA DDASTRKRAC EALMRDLRAA AAADGARARR GDDDGDEGAD ARAYALRRLT RGLSSGRAGA RQGFALALSE LATHAATPAE ALDALDANVA PITKATKGQE ARDILLGRLF GAAAIGLALG GREDVREEER RRCGAEVARR AETLSREKTY LAEPAAACVI ELRASLGDET FAGVVEDAGE GLERWLSGDC GGDAGADTLW LACETFEALP RETRDRVQCV RATKKGKKVD WAEMFTRTHL SKISKALLDT AHTHPRMHSA WEMMLREAPG ARGVVPLWEI VCEDGLFVSG SHQRRFLGFR VFDTLLSSAE AHEIPALFSS NFIKCLLNNL SAPDNYLHEC AVDCLARIVA FASDKKTSSE KKIAVIAALQ RQGPTRFDNV TKTNAVQDLV KSLDSDDALH YLQSMYAVVT KAPVQDSDVV GTEEELASAL ANGTGQKRRL WALEQMAGLA PMLPSDKVVE LMQFMLFHAY YKATDGKAGK KGKSNIPASI LKSPLEEPTG SVRSACSTRF LAMINSNVRA QRAAASKESD DKQDVVDLLS EATSFCRALE GETAVDMIDS IPDECREVRA ELFKALDACV GSGDELAAKV APLIRVLSVL QVGDWREFTP ALQDLPRCVG ELVNPKKKSK KSKKSKKDGE DEPEAIDVLT DILLSLLAQP SALLRDVVEH TFKAVSGQVS KEGIQDMLRI IAGPEVGEDA GEGEGDGEDE DVLMEDDDSD VDDDDDDDEE SDDDDDDEES DDEEDYGEAN DAEIAAMRAA ASKIVGTAAE DSDDSDSESE GMDDAAMFRI DKLLAEAFKS RQQDLMRKKN LKRATRDFKF RVISLFQLYA KAQPGSAYLP NAVVTLLEAM RDSLGKQDPQ SAQLAERIAA LISKHIAHAR DLPELLGDEV TSKTIQSKLL EVIVAANRGA SDAQVFNKAA GAAAAYLLRV LEAVALHEKG GKAAKVGEEV ASENAIDCFR EALKMFKSKK SKLKTGFFSQ TFARHPALAS ALLPELFSLV AIDADKPNAR GEFLRLEALK LVNPVIQSGK KRYPPLAKSA TKSMKTLSVS LAAAIGAPYK NKNTRADACQ QAANCIESLN RLIGEIEIKT IIDVDAIIDA VAKQMSRPPA LPQKAQKAFQ RICALLDRAV PDVEMQPKSD ANDDDGDDGS ESEEDAPKQK KDKKSKKRRD SSGGENSSKK KVKKNR
|
| |