Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36160 |
Symbol | |
ID | 5000136 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 769838 |
End bp | 772918 |
Gene Length | 3081 bp |
Protein Length | 995 aa |
Translation table | |
GC content | 54% |
IMG OID | 640415557 |
Product | predicted protein |
Protein accession | XP_001416241 |
Protein GI | 145342543 |
COG category | [R] General function prediction only |
COG ID | [COG0488] ATPase components of ABC transporters with duplicated ATPase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGGGA TTTGCAAGGG GGTCGGCGTC GCCGCGCTCG CGGACTTACA CGTCGTGGAT GAGATTAAAA AGGCTATCGA GGATAAAAAG GATCCCGTCG CTCGCGCAGG GGCGTTGTTG ACGTACGCGC ACATGTGTCG AACCGCTGGT CGTGGATTTG AGCCGTACGC CATCGGTGAA GCTCCCATCG TTTTCACGTT GCTCGGTGAT AGAAATGCCG ATGTTCGCGA GGCGGCGAAT TTGGCGCAGG CTGCCGTCGT CAAGGCGTTG CCTTTGACGG CGATGAAGCT TTTATCTCCG GCATTAAATG CTGGCATGCA GCACAAGGAC TGGCAGAGTA AGCTCGGCTC GTTACACATC ATGGGCGACC TCGCGAACAG AGTGCCGCAA TCGTTCATGC GAGCCATTCC GGATCTGTTC CCGTCGTTTT TGGACACTCT ATTCGATACG CACCCGAAAG TTTCTGCGCT TTGTGAAGAA ATTCTTCCCA GTATTTGCTG TTGCGTAAAA AATGCGGAGG TTTTGGGCAT GATGGACCTC GTATTGAGTG CGATTCGCAC ACCGCAAAAA GCCACCGAGG ACTGCTTGGA CAAGCTTATG GAGACGACAT TCGTGAACTC CATGGACGCC CCGTCCTTGG CTGTGATTTT GCCCGTCATC TTGCGCGGCC TGCGTGAGCG AACGAAAGAG CTCAAGCAAA AGGCTGCAAC GACTTTCGGA AACATTTGCG CTTTGGTCGA CGACCCTCGC GATCTGCTCC CGTTTATTCC GGTGTTGTTG CCCGAGTTGG AAAAGGCTGA AGAGCACTCG CATCCAGACT TGCGCGAAGC CGCGACGCGC GCAAAAACGA GTTTGATGAA AGGTATCGAC GCGAGCCAGT CCGAGGAGCG CAAGGTTGCG TCGAATATCG TGAAGGAGGC CATCAATGGC GCCGGCGTCA ACGTTGACGT CGAGACGCAG TCGTATGCGG CCGCGCTCGG CGGTTGGGTC ATGGATTCCG CCCCGTTGCG CATTCCTCCC TCCATCTTAT CGAATGACAT CAAGCGTGAG CTCACTCCGG TGCTCGAAAG CGCGTTCGAA GGGAACATTC AAGCCATTGA AAAGATTGCC AATGTATCCG TGATGGCATA CAAGGGCCTC GACGAATCAG CCCTGCTCGA CGAAAGCACG AAGGATTACA TCGTCGATCT TCAAGGCATT ATTTTAGCCT TTGCCGGTCG CGTGTTACTT CAACGCACGA ATTTCACGCT CGAACGTGGT CGCACGTACG GCATCGTAGG TCAAAATGGT ACCGGTAAGA CGACACTTTT GAATCGCGTC GCGGCGAAGG ATATCGCTGG TTTCCCGGAG GACGTCTCTG TATACTACAT TCAGCACGAA ATTATGAGTG ATAAGGAAGA AACGATCGTC GACTTCATGG TTCAAATGGT ACCTGAAGGT GTCACACGAG ACACCGTCGT CAACACTCTC AAAGAGGTTG GTTTCGACGA CGAAAAGATG GCTGCCACTA TTCAATCCCT CTCTGGCGGG TGGCGTATGA AACTCGCCAT CGCGCGCGCC ATGTTGTGGG ACGCCGATGT TCTCTTGTTG GATGAGCCAA CAAACCACTT GGACACGAGT GCTATCGCGT GGTTGACCAA CTATTTGAAG TCCTTAACGA ACACCACCAT CTGCCTCGTG TCGCACGACT ACGACTTCCT GGCCGAGGTG CTGACGGACG TCATTCACTT GAGTGAAAAG ACGCTCACAT ACTATCCCAT GAGCTTCCGC GATTTCCAGT CGTTGAAACC TGAAATCGTC GCCGCGTTGC CGAGCAACGA CAATGCGATC GCTAAGCAGT CAAACATCGA AGGTGGCTCT GCCGACGGAG CGAAAGAGGC GAAAAAAGAG AGCGGCTTTG CCATCGACCA AATCGGCGAC GACCAGCCGT CGCACATCAA ACCGATTCGA TTCCCCGAAC CGGGTGATTT GGAGGGCGTC AAGTCTCGCG CAAAATGTGT CATGTACATG AAAGAGGTAT CGTTCGGCTA TCCAGGAACT TCAAAGAAGA TTCTCAACGG TGCTACCGTG AAAATTACGC AAAACTCTCG AGCGGCGCTC GTGGGCCTCA ACGGCGCCGG TAAGACGACG CTTTTGAAGT TACTCATCGG CGCACTTCAA ATCGATGAAG GTGTCGGGGA AGTCTGGCGC CACCACAACT TGCGACTGTC CTACATCGCG CAACACTCCA TGCAGCACTT GGAAGAGTCT TTGGAGAACA CGCCGTTGGA GTACTTGCAA AACCGCTTCT ACAATGGTCG CGATAAAGAA ATCGCCAAGC GCGCCTCGCA CAACTTGAGC AAGGACGAGC TCGCAACATC GCAAGAGCGT GGTAACATCA TGGATGTCAT TGGTCGCGTG ATGCGAGGTA AACATCTGTT TTACGAAGTG CGCCGCGCCG GACGCGAAGA AAACGACACC GATTGGGAAA TGATGAGTTC GTTAGAGCGC AAGGATCCGT ACGTGATGAA GATGGTGCGT AACTTTGACG AAAAGCTCAA GGCGATGCAA TCGGGTATGG ATTTACGACC GCTCACCAAA GAAGAAGTGC GCATTCACTT GGAAAACTTT GGTATTGATC AAGATTTGGC CATGGGGAAA ATCAAGCGCA TGTCCGGCGG GCAAAAGAGT CGACTCGTCC TCGCCGCGGC CATGTGGACG AATCCACACA TCATCGCTCT TGATGAGCCG ACAAACTACT TAGATAACGA TACCTTAGCT GCGTTGACCA AGGCGCTCAT TGATTTCAAG GGTGGGGTGA TTACGATTTC CCACAACGAG CCGTTCGTGA ACGCCGTGTG TGACGAATTG TGGCGAGTCG GCGACGGAGT CGTCGTGACG GAGCCTGTAG CGGGTAAGGC GCCGAAGAAA CTTTCTGTGG CTGAACGACG GGCTCTCAAA GCCGGCATCG ACGCCGAGGA GGCGACGATG AAAGAAGCAC AGAACAACAA AAAACTCACC GCCAAGGAAA AGAAGGAAGC GGCGGCGGCG AAGAAGGCGG CTGCTGGTCC GGTCAAGGAT CTGTACGGTC GCGGCAAATA A
|
Protein sequence | MAGICKGVGV AALADLHVVD EIKKAIEDKK DPVARAGALL TYAHMCRTAG RGFEPYAIGE APIVFTLLGD RNADVREAAN LAQAAVVKAL PLTAMKLLSP ALNAGMQHKD WQSKLGSLHI MGDLANRVPQ SFMRAIPDLF PSFLDTLFDT HPKVSALCEE ILPSICCCVK NAEVLGMMDL VLSAIRTPQK ATEDCLDKLM ETTFVNSMDA PSLAVILPVI LRGLRERTKE LKQKAATTFG NICALVDDPR DLLPFIPVLL PELEKAEEHS HPDLREAATR AKTSLMKGID ASQSEERKVA SNIVKEAING AGVNVDVETQ SYAAALGGWV MDSAPLRIPP SILSNDIKRE LTPVLESAFE GNIQAIEKIA NVSVMAYKGL DESALLDEST KDYIVDLQGI ILAFAGRVLL QRTNFTLERG RTYGIVGQNG TGKTTLLNRV AAKDIAGFPE DVSVYYIQHE IMSDKEETIV DFMVQMVPEG VTRDTVVNTL KEVGFDDEKM AATIQSLSGG WRMKLAIARA MLWDADVLLL DEPTNHLDTS AIAWLTNYLK SLTNTTICLV SHDYDFLAEV LTDVIHLSEK TLTYYPMSFR DFQSLKPEIS GFAIDQIGDD QPSHIKPIRF PEPGDLEGVK SRAKCVMYMK EVSFGYPGTS KKILNGATVK ITQNSRAALV GLNGAGKTTL LKLLIGALQI DEGVGEVWRH HNLRLSYIAQ HSMQHLEESL ENTPLEYLQN RFYNGRDKEI AKRASHNLSK DELATSQERG NIMDVIGRVM RGKHLFYEVR RAGREENDTD WEMMSSLERK DPYVMKMVRN FDEKLKAMQS GMDLRPLTKE EVRIHLENFG IDQDLAMGKI KRMSGGQKSR LVLAAAMWTN PHIIALDEPT NYLDNDTLAA LTKALIDFKG GVITISHNEP FVNAVCDELW RVGDGVVVTE PVAGKAPKKL SVAERRALKA GIDAEEATMK EAQNNKKLTA KEKKEAAAAK KAAAGPVKDL YGRGK
|
| |