Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_48514 |
Symbol | |
ID | 4999916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 421505 |
End bp | 424313 |
Gene Length | 2809 bp |
Protein Length | 931 aa |
Translation table | |
GC content | 55% |
IMG OID | 640415337 |
Product | predicted protein |
Protein accession | XP_001415488 |
Protein GI | 145340762 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATGC GGTTGGACAT CAAGCGCAAG CTCGTGCAGC GGAGCGACCG GGTGAAGGGG GTGGAGATAC ACCCCACGGA ACCGTGGATA CTGACGAATC TGTACTCGGG GAACGTGGCG ATATGGGATT ACGAGACGAA CGCGCTGGTG AAGTCGTTCG AGGTGACGGA ACTGCCGGTG AGGACGTCGA AGTGGATCGC GCGCAAGCAG TGGATCGCGA CGGGGGCGGA CGACATGTTC CTGAGGGTGT ATAATTATAA CACGTCGGAG TTGGTGGTCG GGTTCGAGGC GCACAGCGAT TACATACGGT CGATCGCGGT GCATCCGACG CAACCGTACG TGGTGACGTG CTCGGATGAT ATGCTCATCA AGCTTTGGGA TTGGGAAAGG CAGTGGGACT GCGCGATGGT GTTCGAGGGA CACTCGCACT ACGTGATGCA CGTGGTGTTC AACCCGAAGG ATACGAACAC GTTCGCCAGC GCCTCGCTCG ATCGCACCAT CAAGGTGTGG AACGTGACGT CGCCGGTGTG CAATTTTACG CTCGAGGGTC ACGAAAAAGG GGTCAACTGC GTCGACTACT TCGCCGGCGG CGATCGCCCG TACCTCATCT CCGGCGCTGA CGATAAGTTG GCTAAGATTT GGGATTACCA GACGAAATCG TGCGTTCAAA CGCTCGAAGG TCACGCGCAC AACGTTTCGG CGGTGAGCTT CCACCCCGAA CTTCCCGTGA TCATCACCGG GAGCGAGGAC GGGACGCTGC GCATCTGGCA TCAGAATACG TATCGCTTGG AGAACACGCT CAACTATGGA TTGGAGCGGG TTTGGGCGAT CGGTTGCTTG AAGGGATCGA ATTCGGTGGC GATCGGTTAC GACGAAGGGA CGGTGATGTT CAAGATCGGT CGCGATGAGC CCGTGGTGAG CATGGACAGC ACCGGTAAGA TCATCTGGTG CAAACACAAC GAAGTGCAGA CGACAAACGT CAAGGCGCTT CCGGCGGATT ACGAAGCCGC GGACGGTGAG AGACTGCCGT TACCGGTGAA GGAACTCGGT AACAGTGAAC TGTACCCGCA GAGCTTGGCG CACAACCCGA ACGGACGGTT CGTTGCCGTT TGCGGCGACG GCGAATACAT CATCTACACC GCCCTCGCGT GGAGAAATAA GAGCTTCGGG AGCGCGATTG AATTCGCTTG GAGCATCGAT CCCAGCGAGT TCGCTGTTCG TGAAAGTTCG AGTAAGATTA AAGTTTTCAA GAACTTTACG GAAAAGAACG CTTTCCGCCC TAATTTCACC GCTGAAGGTT TGCACGGCGG CGCATTGCTC GGTCTTCGCT CGACGGATTT CATTTGCTTT TACGATTGGG ATGAGTGCCG TGTCATTCGT CGCTTGGATG TGTCGGTTAA AAACGTCATC TGGAGTGAGT CGGGTGAAAT GGTCACCATA GTAAGCGATA CGAGCTTCTT TATCCTGCGA TACAACCTTG AAGCGACTGC GGAGGCTTTC GCATCTGGAC ACGTGGACGA GAGCGAGGGC GTTGAAGAGT CGTTCGAGCT GATTTCCGAA ATCAACGAGT CGGTCAGCAC GGGTATTTGG GTGGGAGATT GCTTCATCTA CACTAACACC GATAAGCGTT TGAACTACTG CGTCGGTGGC GAAGTGACGA CCCTTACACA CTTGGATCGT TCCATGTTCA TCTTGGGTTA CCTCGCGGCG CAAAACCGTG TCTTCTTGAT GGACAAGAAT TTTGCCGTCG TGTCCTTTAC TTTGCTGTTG ACCGTGGTCG AATTCAAGAC ATTGATCCTT CGCGGTGAGT TGGAGGCCGC AGAGGAAGTT CTCGAGACGA TCCCGGTCGA TCAGCACAAT TCCATCGCGC GCTTCCTCGA GTCTCGCGGC TTGGTGAGCG ACGCCTTACG CATTGCGACC GACCCGGATT TCAAGTTTGA ACTCGCTGTT CAATTAGGTG AACTCGATAT CGCCCGAGAA ATCGTAGAAA CGGAAGGTGC GAATGAGTCG AAATGGAAGC AACTCGGTGA GCTTGCGATG TCGAATGGTG ACCTCGAGCT CACGAACAAG TGCTTAGAAA AGTCTGGAGA TTTGTCTGGT CAGTTGTTAC TTGCGACCTC ATCTGGATCA CCAGAGACGC TCAAGCAACT CGTGGAGGAA TCGAAACTCA AGGGTAAGAA CAATGTGGCG TTTGTATCGA TGTTCATGCT GAAAGATATC GATGGTTGCA TCGACTTATT GATCGAGACA AAGCGTATTC CCGAGGCGGC GTTCATGGCG CGCACATATG CGCCGAGTCG TGTATCTGAA ATCATCGCAT TGTGGAAGGA TGACTTGAGT AAGGTAAACA AGAAGGCAGC CGAGGCTTTA GCCGATCCAG CGGGACATCT GGAGTTATTC GAAGGCTTTG ATGAGGCACT CGACGCGGAA AAACACGCCA GAGCGCAAGC GGGCGCCCAG GCCGACGCGT GCGAATATGG CGTCGGGCTC GCCGTTGACA AGCTTACGGA TGCGATCGAC GACATCGATG TGAACGAGCA ACAAGATCAG TCAGAAGCTG TAGCGGAGCC CGAAATCGAG GAAGAGGCGT CGCCGGAATC CGATATCGCG GTGGAGCCAG AATCTGAAGT GGTGGATGAG GCGGAACAGG AAGTCGAGGC AGAACAGGAA GTCGAGGCGG AACAGGAAGT CGAGGCGGAA CAGGAACCCG AGGTATCATC CGGCGAGCCA GCCGCAGATG GTGGTGAAGA GGATTGGGGT TTGGACGATG ACGCCGCACC GGCGAAGGCT GATTAAAATC ACACTAGCT
|
Protein sequence | MPMRLDIKRK LVQRSDRVKG VEIHPTEPWI LTNLYSGNVA IWDYETNALV KSFEVTELPV RTSKWIARKQ WIATGADDMF LRVYNYNTSE LVVGFEAHSD YIRSIAVHPT QPYVVTCSDD MLIKLWDWER QWDCAMVFEG HSHYVMHVVF NPKDTNTFAS ASLDRTIKVW NVTSPVCNFT LEGHEKGVNC VDYFAGGDRP YLISGADDKL AKIWDYQTKS CVQTLEGHAH NVSAVSFHPE LPVIITGSED GTLRIWHQNT YRLENTLNYG LERVWAIGCL KGSNSVAIGY DEGTVMFKIG RDEPVVSMDS TGKIIWCKHN EVQTTNVKAL PADYEAADGE RLPLPVKELG NSELYPQSLA HNPNGRFVAV CGDGEYIIYT ALAWRNKSFG SAIEFAWSID PSEFAVRESS SKIKVFKNFT EKNAFRPNFT AEGLHGGALL GLRSTDFICF YDWDECRVIR RLDVSVKNVI WSESGEMVTI VSDTSFFILR YNLEATAEAF ASGHVDESEG VEESFELISE INESVSTGIW VGDCFIYTNT DKRLNYCVGG EVTTLTHLDR SMFILGYLAA QNRVFLMDKN FAVVSFTLLL TVVEFKTLIL RGELEAAEEV LETIPVDQHN SIARFLESRG LVSDALRIAT DPDFKFELAV QLGELDIARE IVETEGANES KWKQLGELAM SNGDLELTNK CLEKSGDLSG QLLLATSSGS PETLKQLVEE SKLKGKNNVA FVSMFMLKDI DGCIDLLIET KRIPEAAFMA RTYAPSRVSE IIALWKDDLS KVNKKAAEAL ADPAGHLELF EGFDEALDAE KHARAQAGAQ ADACEYGVGL AVDKLTDAID DIDVNEQQDQ SEAVAEPEIE EEASPESDIA VEPESEVVDE AEQEVEAEQE VEAEQEVEAE QEPEVSSGEP AADGGEEDWG LDDDAAPAKA D
|
| |