Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14831 |
Symbol | |
ID | 5000812 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 941189 |
End bp | 946052 |
Gene Length | 4864 bp |
Protein Length | 1563 aa |
Translation table | |
GC content | 61% |
IMG OID | 640416233 |
Product | predicted protein |
Protein accession | XP_001417100 |
Protein GI | 145345183 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCGAC GGGCGAAGCG CGCGGTGAAA GACGCGCGCG TCACCGCGAA GGCGCGCGCG ACGGTCGAGA CGCGCGACGA GGCGTACGCG CGCGTGCCGC TTCGAATCGA CTTGGACGGC GCGCGCGCGA GCGATGACGA CGAGGACGCG CGCGACGAGG ACGCGCGAGC GTCGCCGCGC TTGGTGGCGC ACTACGGATT CGACGCCACG ACGTCCGCGG TGTCGTACGC GAGGGACCAA CACGCGCTGT TCGCGGCGAG CGAACACGGC GTGAAGGCGT TCGGCGCGCG CGGGACGGAG TGCTTGTTCG CGTCGGCGTC GGGAGGCTCG TCGGAGGCGG TGCGAGGAGA GTACGTCGCG CCGTCGCGCG TGTATCGATG CGGACGAGAC GGCGACGTCG AGGTGTGGGA CGGGCGCGAG CGAAGGTCGC TGGCGAGCGA AAATTTAGAC GCGGGCGACG ACGAGCCGAG CTGCGCGAGC GAGGCGATGC GAGGGACGAA TTTCTTCGTC ACGGGAACGG TGAGGGGGGA CGTGTGCGTG CACGCGTTGG GAGTGAACGC GCGAGGAGAG ACCACGGTGG CGAGAAGGCG CGGGTACAGG GTGTCGGCGT CGCGGGCGTT GTCGCGCGTT TTGCCGACGC GAAACGCCGT GGCGGCGGTG CGCGCGCGAC CGGGAAGAGA CGAGCGAGCG ATGATGCTCA TCGCGTGGTC GGACGGGGCG CTCGCGCTGT GGCACTTGCA CGAACAACGG TCGGTGGCGG TGACGTCGCC GCGAGGCGAC GCTGTGGACG AGACGGAGGC GACGGACGCG TCTTTGACGT GCGCCGAGTG GCTCGACGCC GAATGCGTCG TCGCGGGGTA CAGCGACGGG CGCGTGAGAG TGTGGAAAGT GCGCGCGGGT ACGGTGATGA CGGAGGAAAT CGTTGAAAAG CAGTGCATCG TGCCGCACGT TTTGATTAAT CCGTCGTACA AGGGCGCGCT GACACCCATT CGCGCGTTGA AGACGTACGT GAGCGAAGAC GACGACGCGG CTCGCGACGT CGCCACGTGG TTCGCGTGCG TCGGCGGCGA GCCAATCGCG TGTCCAGATC CCGTGATTTG TTTGCGAGCG ACGCGATGTG ACGGGGAGTT TCGAATCGAA AGCGCGGGTG CGATTGCGCT GCCTTGGTTC GGTCCAGTGC TCGACGCGAC GTTGGTGCCA TATCGCGACA CTGTGGAATC GATTTGCGTT CTGTCCGAAG GCTCGCAGCT ACACCTACAC GACGTGCGTT ACGGAGTTTC GAGCGACGAC ATCGCGCGCC CGCAGGAAGT AATGGTGATG CGACCGACAC TTTCGAAGAC GTGCGCGCCG AGTATCTGCG CGACGTCGCG CGACGTCGCG CGGGCATTTG AGCAGAACGC CAAACGCGTC GCGGTTCCAG AAAATGAAGT AGACGCGTCG TTCGCTTGGA AAGGGTCGAA ATGGCCAATC AGCGGTGGAT GCGACGTAGT CGATGACACG ACATCGCCTT CATCCGCGCT GCGTGCGCGC ATCGTCGTCG GCGCGTTCGG CAGAGACGGT TCTGGAGTGA AAATCTACAT CGATCGCGAC GGACGATTGA TGTCTGGTGG CGCGATCGCG CCGAACGGCG ACTCGATCAC GAGACTGCAC GTCGACGCCG GTGGTGCTTT GCTGATCGTC GGTCGGGTGA GTGGAAACCT TGAAATATAC GCGTTGCGCG AGTATCCAGC CGACGGTGCC GAAGCGACTT CGCGAGTACG TACGCACGCG CGCAACTTAA ACAAGAGCGA CGCCGAAAAC GTTGACGAGG AGCGTGCATT TTTCGCCGGT GAATCTCGCG ACTTTGTCGA TACGGATTCC GCAGCGGCGT GTATGACTTC GGTATACACA CTCATCGGTA GATTTAGCTC GGCTGGCAAA GCCATATCGT GCGTTCGAAC GAACGCCGCG GCGACACTCC TCGCCGTCGG CGACGTCGCA GGGTGTGTGT CCTTACTCAA TCTTCAACGA GGAACGAAAA TGTGGACGGT GTCGTTACCT ACTTCTGCCG AAGGAAAGCC ATCGGCCGTG GCGGATTTCG ACTTTGGGTT ACCGCTTCCA GACGCTCCGG ATGAGTGCGT CCTGGCGGTT CTAGAATCAA ACTGTAGCGT TCGTTTCCAC GCGCTTTCGA CCGGTTCGCA GATTGGGAAG ACTATGACCC CGAAGTCAGC GGCAGAAGAC GTGGCGCTTG CAATCTCGCT CCTTCGGCTC GATGGTACGG CGTCAGATAT AATTCCACCA GCGCCCGTCG CCAAAAGTTG GTTCACACCG CCGACATCGT TTGCTTATCA ATTTCTCGCG TCGGAGTGGA CTGTGGACGA CGCGTCTGCG TCAGACGACG AAGACAAAGT GCTGTCGACG GACGAGGAAG ACCGTGTCGA CAGCGACGAC GTCATTAAGG GAAACCCAAA GCTGACCGCC ATTGTCGTTA CTGTGGCGAA AGATTCGATG CGCGTATACC ACGCAGTCGG CTGCTCTCGA GGAGAGCGAT TTACGTTGCG AAAAGAACAC CTCGACGAGC CGCTCATCGG TGCGTTCGCC GTTCGAGATG ACGGTAGCGA AAACGAATGT GCGTTCGGTC ACGGACGGAA ACGTTCTCAC ATCGTCGCCT TGACGGAGTT TGGGAGAATG GTGGCGTACG CGTCGCCGTC CCTGCAAATG CGCGGCGTAT TTGGTCCGGT GCCGGCGTTG TCCAACGCCA ACGCGACGTG TTGCTCTCGA GGCGGCGCCG TCGTCGTCGT CGCAGATGAC GGCTTGTCTA TCGCTCGACT CGAAGCATTT GGTGACAGCT TTCCCGCGCA AGGTTGTATC ATCGATCTCG AAGTCGAGAG CGCCGCGGAA GCCGCGCGAG CGGCGCGACA CGCCATGGAA GAAGACGATC CCGAACTCGC GCAGCAAACC AAGTCACCTC GACGAGAAGT ATCGTCAGCG TCGACGACAC CCATGAAGAC CAAAGCGCTG ACGATGAGTG AAGCGCTACG ACATCGCGCA AAGGCGGCGT TTGAAAAGCT GGAAGAAAAG TTCGCGCAGT CACCACGTGA CTCATCGTCG TCGCCAGCGA CGCGCAAGAT GTACACGACG ACGGACCTGG CTGTCTTATT CGCAGATGCT CGCATAGAGG ATCCCGCGCC GGTCGAAAGG ACGACAGAAT CTGCAGAGCG CGATGAGTTG TTCGCGACAT CGTCGTCCAC GCCAGCCATC GCGCCTGTGC GTCGGAGCGC GTCTTCAGTC CGAGCCAAGT ATGGACGAGA AGTTACTTCG CAAATGAACG AAACAAAAGA CATGCTCGTC GAGCGTGGAG AGAAACTTAG CAGGCTTCAA GATAAATCAG CCTCGCTCGA AAATGACGCG GCGGATTTCG CCTCGCTCGC TCGCGAGATT CGTAAGCAAT CCGAACGTCG GTGGTTTTAG ATGCGATTTT AGAATTTTTA GTAGCAGCGC GACGCCGAGC GGCGTCACAA GTACACGCGC ACCTTTCATG CCTCCCGCTG GAAATTGGCC CAAGTGGGCC AAAAAGCGTC ACGCTCGAGA CTTCACGGCG CGCGACACCG TGCTCGAGGA GCTGCGCAAG AAGACCCACC ACGGCGCCTC GCTCCTCAGA CGCGCGTTGA AGAAGGCCAA AACCTTTGAC GAAGCCAAGC TCCGCCGGCG TCTCAAGGCG CCCGAGGACG CCGATAAGCT CAAGCGCGCG TTGCACGCGA CGCGTAACGT CGATATCGAC GTCTTGGCTC GGCAATGCGC GTCGGCGTGC GCGCAGAGAG TGGAAGAGTC GCTCGAGTTG GCATTTCGCG AGGTGGCGGC GGATGGCGAC GCGAACGCCG TCCGCGCGAC GCACGCCGTG GACTTGGGCG ACGGCTTCGA CAGTCAAGAC GTGTTACGCG CTATTTGCGA CGACGCCGCG AGCGCAAAAG CGAGCGAGAG CGAAGGGAAG GAAGTCGTCG GGGAGAATGA ATACGTGAGC GCGGCGAGGC GGTTGTTGCG AGCGCAGGCG ACGCGCGCGG AGACGGATGC GCTGAAGGAA CGATTGTTGG CGATCGCGGG TCGAATGAAT CGTGCGGTGG TTGGGAAGCA AAAACGCGAA GAGTGGACGC GTGAGAAGCA GGAGCGACGG CAGAAGCGCG AGTTGGCCAT CGTCAAAGCG GAGGAAAAGG CGAAACGGCG AGCGGCGGGA GAGGAGGTGT CGTCGAGCGA GGAAGAGGAG GTCGTGCACG CGAAGCCCAC GTCTCGCCGC GATGCTGACA ACGAGGATAG TGCATCCAAA TCGGATTCTG AATTCGAATC CGACGTCGGT GAAGATGGCT ATGCGAGTTT GAGCGAGGAT ACGTTGGCGG AGCTTTACGC GGCGAAGGCG GCGGCGAAAA AGTCGAAAAA GCAGGCGAAA TCGGCGAAGG ACGGGGACGA TGTCGAGGAT GTAGATCTAG GACCAAAGAA GAAAAAAGTG AAAAAGCGCA TGGGGCAGCG CAAGCGTCGT CAAATCGCGG AGGCTAAGTT TGGCTCAAAC GCCGCGCACA TCATCGCCGA ACGCGAAAAA GCTGCAGCCG AGCGTAGGGC GAAGGAGGAA GAGGAAAGAA ATATGCACCC GTCTTGGAAA GCGAAACGTA AGCAAGCACC CATCATAATC GCTGGTGCAA AGGGTAAGAA GGTCAAGTTC GGCGATGACG ATGGTGCGAA GAAAACGCCG ATAGTTTCGA AGAAGCAGCA ATACGCGCCC AAGGTTCCGG AGGGACCGCT GCACCCATCG TGGGCAGCTA AGCTGAAGGC TGATCAACAG GCCTGGGGCG GCGGCGGTGT CAAGCCCGAG GGTAAAAAGG TCGTATTTGA TTAA
|
Protein sequence | MLRRAKRAVK DARVTAKARA TVETRDEAYA RVPLRIDLDG ARASDDDEDA RDEDARASPR LVAHYGFDAT TSAVSYARDQ HALFAASEHG VKAFGARGTE CLFASASGGS SEAVRGEYVA PSRVYRCGRD GDVEVWDGRE RRSLASENLD AGDDEPSCAS EAMRGTNFFV TGTVRGDVCV HALGVNARGE TTVARRRGYR VSASRALSRV LPTRNAVAAV RARPGRDERA MMLIAWSDGA LALWHLHEQR SVAVTSPRGD AVDETEATDA SLTCAEWLDA ECVVAGYSDG RVRVWKVRAG TVMTEEIVEK QCIVPHVLIN PSYKGALTPI RALKTYVSED DDAARDVATW FACVGGEPIA CPDPVICLRA TRCDGEFRIE SAGAIALPWF GPVLDATLVP YRDTVESICV LSEGSQLHLH DVRYGVSSDD IARPQEVMVM RPTLSKTCAP SICATSRDVA RAFEQNAKRV AVPENEVDAS FAWKGSKWPI SGGCDVVDDT TSPSSALRAR IVVGAFGRDG SGVKIYIDRD GRLMSGGAIA PNGDSITRLH VDAGGALLIV GRVSGNLEIY ALREYPADGA EATSRVRTHA RNLNKSDAEN VDEERAFFAG ESRDFVDTDS AAACMTSVYT LIGRFSSAGK AISCVRTNAA ATLLAVGDVA GCVSLLNLQR GTKMWTVSLP TSAEGKPSAV ADFDFGLPLP DAPDECVLAV LESNCSVRFH ALSTGSQIGK TMTPKSAAED VALAISLLRL DGTASDIIPP APVAKSWFTP PTSFAYQFLA SEWTVDDASA SDDEDKVLST DEEDRVDSDD VIKGNPKLTA IVVTVAKDSM RVYHAVGCSR GERFTLRKEH LDEPLIGAFA VRDDGSENEC AFGHGRKRSH IVALTEFGRM VAYASPSLQM RGVFGPVPAL SNANATCCSR GGAVVVVADD GLSIARLEAF GDSFPAQGCI IDLEVESAAE AARAARHAME EDDPELAQQT KSPRREVSSA STTPMKTKAL TMSEALRHRA KAAFEKLEEK FAQSPRDSSS SPATRKMYTT TDLAVLFADA RIEDPAPVER TTESAERDEL FATSSSTPAI APVRRSASSV RAKCDFRIFS SSATPSGVTS TRAPFMPPAG NWPKWAKKRH ARDFTARDTV LEELRKKTHH GASLLRRALK KAKTFDEAKL RRRLKAPEDA DKLKRALHAT RNVDIDVLAR QCASACAQRV EESLELAFRE VAADGDANAV RATHAVDLGD GFDSQDVLRA ICDDAASAKA SESEGKEVVG ENEYVSAARR LLRAQATRAE TDALKERLLA IAGRMNRAVV GKQKREEWTR EKQERRQKRE LAIVKAEEKA KRRAAGEEVS SSEEEEVVHA KPTSRRDADN EDSASKSDSE FESDVGEDGY ASLSEDTLAE LYAAKAAAKK SKKQAKSAKD GDDVEDVDLG PKKKKVKKRM GQRKRRQIAE AKFGSNAAHI IAEREKAAAE RRAKEEEERN MHPSWKAKRK QAPIIIAGAK GKKVKFGDDD GAKKTPIVSK KQQYAPKVPE GPLHPSWAAK LKADQQAWGG GGVKPEGKKV VFD
|
| |