Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26340 |
Symbol | |
ID | 5004301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 18756 |
End bp | 21620 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | |
GC content | 55% |
IMG OID | 640419722 |
Product | predicted protein |
Protein accession | XP_001420397 |
Protein GI | 145352102 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTCG CGCGCGACGC GCGAACGCGC GGTGCGGAGC GCGAGGGCGA CGACGCGGAG ACGCAGTCGA GAGGGAAGAA GCGCGCGCGA CGCAAACGCG ACGACGCGGG AAGGCGGGAC GCGCGGACGA CGTTCGGAGG ATTTGCGACG CTGCCGACGG AGCTGGTGGT GAAGATAATG AAAGAACTTG ATGGTTACTC GCTGGCGATG GCGACGTGCG CGTGTAAGGA TTTTGAAGCG ACGGGGAGGG AAAACGACGA ACTGTGGCTC GAGTTGTTGA TTAAACTCGA ACCGCGCGCG CTGATGCGAG AGGATTTGAT GCAAAACGGC AGTGGGAAGT TTCGTGCGTT CAGCTACAAG GAGTTGTTCA TGTGGCGTCT TCACACGCTG AAAGGGTGCG CGATGATGAA CACGAAGGCG TACGCCGAGA TGGCGGCGCG GGGCGACGCC ACGCGCGCGT TGAGCGCGGA CGCTTCCGGA CCGAGCTCGA GCGAATTGCG GTCGAAGATT GGTCACCTCG GTAGACTGGC CGTGATAAAT CAAGAACAAT TGTTCACGTG CGATCCACGA CCGAAGGGGG GACTCATTTA CAACGTATAC GACAGCGGGA TGAACTCGTT TGTTCCGCAT ACGCTCCTGT GGTCGCCGAG GTCGGAATTA CTGGCGTGTG CTTTCCTCGT GAACGGCGCG GAGTCCAAAA CGTGTCATAG CAAGATTATA CTGTCTGCGC CGAAGGCAGT ACTCACAGGG CAACTGAAGT TTGCCCGGGG CGACGCGCTT ACAAACACAA ATGCAGAGGT GTCGACGCCG CGACACGGGC GACATCCATA CGAAGCTCCG CCCATGAATA ATATCATAGC GTTACCGCAT GGGTTACAGT GCGCACACAT GGCATTTGCG CCTTGTGGGA CGATGTTGAA TATCTTGCAC AAAGACCGCA TGGAGTCGTC GCTATATACG CTCGATTGTG CAATCAGCAT TACGTCCTTG TACGGTCCCA GTAACGGCAA GAGTGGCACT TCGCCCGTGC CGCCACCGTC AGAGCACGTC ACGAGGGTTG CGAGCGGTAC GGATAGCATC CGATTCGCCA TCTCTCCCAT CGATAACAAT TCCTTGCTGA TGTTCGGCGA TCAACGAGAA ATCATGTTGC TGAGGAAACA AGGAATACGC GGGGGCGCGA TCCCGCTGTC TGATCGTTGG CCGGGAACTC GAGCGTTTGA CGAGGTTTCG TCAGACGACG ACAGTCTGTA CGCATCAGAT GGTTTTGAGC GCGAATACGA AATGCCCCCG GATGCGGTCG ATGTTCCGAG ATCTCTCAAT CATTCTCTCA CATCGGCAAA CTCAAACGCG GCGCACCCAG CGTGCGACAC AGCGTCTGAG CCGCTCGAAT CCGCCTTTCA GGACGACATT CACGAGCGAG CGACACGCGA TTGTCAAGAA GTGGTGTCAA CGCCCAAGAT GACTTGGTGG CAACAGCTCA GTCGTAACAT TCTCGCCGGC GTAAAGTCTG CTACTTTGGG CGCGCGGGGC AACGAAAAAC CCTCGACTTC GGCAAAACGC GATGAAGATT CGGAGCATGG CGCGCTTAAA CGTATAAAAG ACTCTGCAAA TAGTATTGAT CGCAGTGATT GGAGTTGTCT CAAGGCGGGC ACGTGGTGGT CAAACATCCC CAACTTTCGT AAGTTGACGG ATATCTTGGT GAAGGAACAC GCCAAACAAC CGCTCGCGCG ACTGTACACT GACGAAACGG CAAAGCGCAT CGAATGGGTT CCTTCCAAGC GAGGCGACGT TAATGGGCGT GGGTTTTGGC TTTTACCGTC GTCTATCCCG GAGCGTTATG ATGATGATCA TGATACTGCG TACTACGCCT TCCTCGTCAT GGTTCCGGTG CCGTCGGAAA AAGCGATCAA GGAACGAAAG ATAAAGCCCT TCATCAGCGG GGACGACGAC GAACTGTTTC AAAATATTAG TGAGTGCGTG ATAACTGAAA TCTCGCCCGC ACCAGCGTAC GCGGAGCCCA ACATAGTCGC TCGGTTTCTT TATGCTGGAA GGGCGGGAGG TCGCCAAGTC GTTTGGTCGA CCGTAGACGG CCTCTTCGTG CGCTGTGTAG ACTTTAATCT CGACGCCGAG GGCGACTGGA TGGGTCCTCC CACTCTGGGT CCGATGATGC AGGTGATCGA TTTCACGAGC ATCTTGAACG GTGCGAACAG ATGGAACAAA TCATACGATG AAGTTAGCGT GCGAAACTAT GCTTCCTGGA ACGGTTCGCG CCAAGTTCCC GTGTTATCTG TGGTGAATGA GGTCTTGCGT TACACCATCG AGGTGATTCA GTGGAGCCCG AGCGGCGAGC GCCTGCTGAT CCTTATGGCG GTTCACCTCG CGTACGAAGA GCCGGTGCAG GCCGGTGAAT ATTATACTGT GCATCAGTGG ATTTGTTGGG ATCCACCCCC TGTGATGTCC GACGATTCTG AGAAAGCGCT GCACCGCGTC CCGGCGGGGG AGTGTCACGG CGTTTTATCG TTCGGGTCTC GATTCATCCC GTCGCAAACG TTCCGGTCGG AGTGTTCCGA GAAGATGGAC AACTTATCCG GAGGTATTAA TTTGTGGAGT CCTGAAGAAA CTGCCATAGC GTTCGGCATT CAAGTACCCA GAGTCGTGCC GACGCATGAA AGCAGCGATT ACATAGTCAT CCAAAATTTC CCTCGCGTCA ACTTTGACGA AATTGAGCGA CGAAGCCTCA CGGGTGAGGC GCCGCCGCTT CAACAAGACG GGTCCGTGGC ATCCTTGCCA TATCATTTGA ACTACGTCGA TAGCGCGTTA GAGTACGTCT GCGAAGGCAC GTATTGTTCG TGGAGTCCGA CGTGA
|
Protein sequence | MTLARDARTR GAEREGDDAE TQSRGKKRAR RKRDDAGRRD ARTTFGGFAT LPTELVVKIM KELDGYSLAM ATCACKDFEA TGRENDELWL ELLIKLEPRA LMREDLMQNG SGKFRAFSYK ELFMWRLHTL KGCAMMNTKA YAEMAARGDA TRALSADASG PSSSELRSKI GHLGRLAVIN QEQLFTCDPR PKGGLIYNVY DSGMNSFVPH TLLWSPRSEL LACAFLVNGA ESKTCHSKII LSAPKAVLTG QLKFARGDAL TNTNAEVSTP RHGRHPYEAP PMNNIIALPH GLQCAHMAFA PCGTMLNILH KDRMESSLYT LDCAISITSL YGPSNGKSGT SPVPPPSEHV TRVASGTDSI RFAISPIDNN SLLMFGDQRE IMLLRKQGIR GGAIPLSDRW PGTRAFDEVS SDDDSLYASD GFEREYEMPP DAVDVPRSLN HSLTSANSNA AHPACDTASE PLESAFQDDI HERATRDCQE VVSTPKMTWW QQLSRNILAG VKSATLGARG NEKPSTSAKR DEDSEHGALK RIKDSANSID RSDWSCLKAG TWWSNIPNFR KLTDILVKEH AKQPLARLYT DETAKRIEWV PSKRGDVNGR GFWLLPSSIP ERYDDDHDTA YYAFLVMVPV PSEKAIKERK IKPFISGDDD ELFQNISECV ITEISPAPAY AEPNIVARFL YAGRAGGRQV VWSTVDGLFV RCVDFNLDAE GDWMGPPTLG PMMQVIDFTS ILNGANRWNK SYDEVSVRNY ASWNGSRQVP VLSVVNEVLR YTIEVIQWSP SGERLLILMA VHLAYEEPVQ AGEYYTVHQW ICWDPPPVMS DDSEKALHRV PAGECHGVLS FGSRFIPSQT FRSECSEKMD NLSGGINLWS PEETAIAFGI QVPRVVPTHE SSDYIVIQNF PRVNFDEIER RSLTGEAPPL QQDGSVASLP YHLNYVDSAL EYVCEGTYCS WSPT
|
| |