Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24407 |
Symbol | |
ID | 5001326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 405074 |
End bp | 408451 |
Gene Length | 3378 bp |
Protein Length | 1106 aa |
Translation table | |
GC content | 57% |
IMG OID | 640416747 |
Product | predicted protein |
Protein accession | XP_001417232 |
Protein GI | 145345470 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.100413 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGACGC GCGCGAGCGC CGCCGGCGCG ATGGTTTCGC TCGCGATGGC GAGCGGGACG GCCGTCGCGT ACACGGCGGG GACGGCGTGT TACCCCGAAG CGGCGTGCGA GTCGGGCGGA AGCAACACGT TCGATAAATA CCAGCGAAGA GTGTGTCAAA TGCTCGCGGA CACGAGCAAG TATTGCATCA TCCCGAGCGA ACCGACGGAC GAATTGAGTG GGTGTTCGCG CGAGCTGCTG TATCACGGTT GGTGCGACGC GAACAAGTAC GTCTTTGAAT CGGGATGTTT GGCCAAACAT CCGACGACGG ATTCTTCGGC GTGCTCGAAT TGCAATTTCG ATTCGTATCC GACAGCGTAT TGGAGCGTGG GGGCGCCATA TCCTCTACAC CCTGGAATGG TTAATCACGC TTTCCGAGAG TGCGTGGTCG ATTACGTCAA GGCGAAGATC GAAGAAGGTT TTGCCGCTGA AGGTATGACC GCGGATGCGA CATCGCTTCA GCAGCTCGCA GAGCAAGAAT TTCCGGTGGA ACAAATGGTG GACGTTGACG ATTACAAAGA GCAGTATTCA AGTTATTACT TAGATGATGG AGACTTCGAC GAGTTGGACG AAGAGATGAC CGATGAACAC CTCGACGACG TCTCCGGATC CGTTGACGAA AGCGTCATGA CGAAGACAGT AACGGCTTCG TGCTCTTCGC CCTCAGACGA CAAAATCTGC TACAAAATGT GCGCTGGTGA CCTCGGTCTT AATCTCGAGT ATAAGATCAA GGTGCCGAAA GTCTGCGCTA AAATCAAAAT TCTCGGGAAG AAGTTTAAGA AATGCATCAG AGTCCCGACG ACGAAGTTTA AAATTCCAAA CAAGTGCACG AACTTGTGCG TCAATATTCC AGGTTACTGC GAGATGCAAG AAGCGGCGAG TGCGATTACA CAGATAAAGA ACATCAGATC ACTCGGTGAT TTGGCATTAC CCTGCAAAAC ACTGGGTGGG CCGGGCGATG TCTGTAATCT GCTCGAAGAA GCCGACGACG CGTTTAGAGC GATGGAGTCG ATGGCGAAAA TAGCCACGAC GTCAACGGTC GACGCCTTTC TCGATCTGCG AGTGTTACCG TCGATATTGC AAAACGTCCT CGACGAAGCT ACGGACGCTT TGGAGGACAT CGCCAACGGC CTCGAAAATA AACTCCGCAA CTTGGTGGAA CATGTCTGGG GTACGGTTGC GAGCTCGTCG TCCGAAGTCG TATCGTTCAT CGAAAATAAC GTCAAGGGTT CAATATGCGC ATCGTCGAGT TCCACGGCAT CGCTCGGAGC CGCGCGCGAA GAGCGACGAC TCGCCCTCGT AAATGACATT CATCGCGGCG TGCGCGCTGC GTTCCGGGGC GACTCGACTC CGACGCACTC AGCGCGTCCA ATCGTTCCTA ATCTAGGCGC TGGGCAGTGT TGCTATCACA TCCCGTTCGC GTGTAGCAGC GAGGTGGATT TCCCAATGCC TTGGCCGAAG GCCTTGGAGA ATATATCGGA CTCACCGGGT GCCGTCGTTG TGAACATGCC GGGCCTGGAA TTCAACATCT GTGGCGAGAT CACGCAGTTT AAGGTGGACG AGAAAGTTGC GACGAAACTG GTGAATGCTT TTGGTGATAT GTTCGAAGCG CTCTTTGCAG CACTTTACGA AGAGAGCGGA CTGAAAAAAG TCGTGGATGA CGTGAAAGAT CTGACAAAGG ACATGTTTGG GTCGTCGGCG GCGCTCGGAT CGTATGATCG ACCGTCTTTC TCCACCGACG ACCGCAAACA TCTGTTGAGG AAATACGTCG AAGTCAAAAA CCGGATGGCA GAGACGGAGA CAATCGTTCT GGAGGAACTT TTGAAAATAT CTGACACCAT TCATTCGCCC GAATACTTGA CGCATACAAC ACGGACGCCG TCGCGCGATG ACGTGCCGTC TCTGGGCGGC GAGAACTTGT TCGAAAAGGT CCTGAATGAT TTCGCCGACG ATTTGCAGAC GGCTCTTAAA TCTATGGCAG ACACGACAGT CGTGAAGGCG GATATGTCAA TCAACGTGAA AGGAGACACG TCCATCAATG CGAAGGCGAG CGTCTACAAG ATCGGCGACA TCATCGACAA GCTCGATATT CCAAATAGCT TTGCGGGCGT CCACGTCGCC CCTCTGTTTC CCGGTTTAAC CGCGGCGCTT CAATACGACG TGTTGCTTTC GATGCCATAT TACGTCAACA TCGACATGGA GGCATCTTTT GCCTTGGGTC TCGATATCGA TATCCCGATA TCTCTCGAAC TTTCGAATAC ACCAAACTTC GCCATGGGCT CGCCCGCCGT GAACTTGGTT CCGACGTACA CCGCAGCCGG TAAGTCGAGC GCGCAGGTAG GCGCGAGTGT TGAGATAAAG AAAGGCTGGA TAGCGCTGTG CGCGGGGACT CACTGCGTCG GCCCATGGAT CAAAGCGCGC CAAGACGTCT ACGTCGGGGT CGACACCTGG GCTTTCGCCA ACTGCGACTC GGGATATGGC GAGCTCGTGC CAAAATGGAC CGACGGATTC TCTTATTCGA GCAAAAATCA AGCAGCATGC GCAGGTTCAC TCGCAGGCGC TGGTGGCTAC GCGCAAGTGC CGAAGACGGG CAATATTCTC GCTCAGGTAC TGTTCGCCCC CATGCCGATC ATGCCCTCGG GCGCGAGCGC CGCCTCGCAA TCCGCGGCGC GTCTCGGCGA CGACGACGAC GAGTGCGAGG CTCAGCCAAT GGGTATGATA CTGGCCGATT ACACCAACAC CGTGCATAAC GCCGTCATGG CGGGCGGCGA TAACTGGTAC ACGACGGATT TGTTTGCCGA GTGCCCGTCC GCGGGCAAGT GCCCCGCGCC GTCACCGCCG CAACCGTACA CGCCAACGGC GAACACAAAG CTTGCCAAGT ACAACGCCCG GTGCTGCGGC AACTCCGGCG ATTCGCGCTG CGCCGCAGCG ATCGAAGGTA CGTACGAAAC GCTCTGCGCC CAGCTCTGCG ACGCCTGCGC GGGTTGTGTC GCCTTCGACT ACCAAGCCAG CAAGAACAAA TGCGGCTTCA GCAAAAGCGA TAGCACGGAG GACCGGGTGG GATTCACGCA CTTCGGTCAC GGCGCGACGC CCGCGTCCAA ATCTTCCCTC GGCTCTCGCA TGCGTTTCAG CGCGCGCGCG TCGTCGCCAA CGTCACCATC CGCGCTCACC GCCGTCTTCG TCGTCGCCTT CGCCGCCGCC TTCGCCTCGG CCGCGTTTCG CCGCCGCCGC CTCGACGACG CTCGCCGCGC GCGCGTCGAC GATTACGGCG CGTTCGACTG ATCGACGCGT CCCTCCGGCG CGTCTCGCGC TCGAGCGTTC GAGTCGATCA TCGCTCCA
|
Protein sequence | MWTRASAAGA MVSLAMASGT AVAYTAGTAC YPEAACESGG SNTFDKYQRR VCQMLADTSK YCIIPSEPTD ELSGCSRELL YHGWCDANKY VFESGCLAKH PTTDSSACSN CNFDSYPTAY WSVGAPYPLH PGMVNHAFRE CVVDYVKAKI EEGFAAEGMT ADATSLQQLA EQEFPVEQMV DVDDYKEQYS SYYLDDGDFD ELDEEMTDEH LDDVSGSVDE SVMTKTVTAS CSSPSDDKIC YKMCAGDLGL NLEYKIKVPK VCAKIKILGK KFKKCIRVPT TKFKIPNKCT NLCVNIPGYC EMQEAASAIT QIKNIRSLGD LALPCKTLGG PGDVCNLLEE ADDAFRAMES MAKIATTSTV DAFLDLRVLP SILQNVLDEA TDALEDIANG LENKLRNLVE HVWGTVASSS SEVVSFIENN VKGSICASSS STASLGAARE ERRLALVNDI HRGVRAAFRG DSTPTHSARP IVPNLGAGQC CYHIPFACSS EVDFPMPWPK ALENISDSPG AVVVNMPGLE FNICGEITQF KVDEKVATKL VNAFGDMFEA LFAALYEESG LKKVVDDVKD LTKDMFGSSA ALGSYDRPSF STDDRKHLLR KYVEVKNRMA ETETIVLEEL LKISDTIHSP EYLTHTTRTP SRDDVPSLGG ENLFEKVLND FADDLQTALK SMADTTVVKA DMSINVKGDT SINAKASVYK IGDIIDKLDI PNSFAGVHVA PLFPGLTAAL QYDVLLSMPY YVNIDMEASF ALGLDIDIPI SLELSNTPNF AMGSPAVNLV PTYTAAGKSS AQVGASVEIK KGWIALCAGT HCVGPWIKAR QDVYVGVDTW AFANCDSGYG ELVPKWTDGF SYSSKNQAAC AGSLAGAGGY AQVPKTGNIL AQVLFAPMPI MPSGASAASQ SAARLGDDDD ECEAQPMGMI LADYTNTVHN AVMAGGDNWY TTDLFAECPS AGKCPAPSPP QPYTPTANTK LAKYNARCCG NSGDSRCAAA IEGTYETLCA QLCDACAGCV AFDYQASKNK CGFSKSDSTE DRVGFTHFGH GATPASKSSL GSRMRFSARA SSPTSPSALT AVFVVAFAAA FASAAFRRRR LDDARRARVD DYGAFD
|
| |