Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29010 |
Symbol | |
ID | 4999469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 678712 |
End bp | 681579 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | |
GC content | 55% |
IMG OID | 640414890 |
Product | predicted protein |
Protein accession | XP_001415567 |
Protein GI | 145340924 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00158423 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCACG CGAACGCGCG AGGGGCGATG CACGAGGCGG TCGTGCGCGC GGTGCACGGG TTCAAGGGAG GGATGGCGGC GGCGGCGGCG GCGGGGCGAG ACGCGGCGGC GCGCGCGGCG AAGGGGCTCG CGCGCGATGA GGACGTGATG AGCGTGAAAG AGGTCGTGGC TTACTTGAAT ATGGTGATGT TTCCGAGGCA GAGCGAGGAG GATGTGTTTT GCGCGCGTCC GGGGTTTGTG ATGATGTACG ATTTGACGCC GTTGAAGCGC GCGGCGCCGA CGACGAAGAA GCCGCCGGCG AGCAAGTTTG AGGGAGATAA TGGTCGCGTG AAAGTTGATC GAGGGCCTTC GGCATCCGCG AGTGGGGCGT CGACGGACGG TGGGGATCGG AGTGCGGCGC CGGCGCCGTC TGGGGCGGTG ACGGCGCCGC GCGCCACGGC GACGAGCGGC GCCACGATTT CGCCGCACAT GGAAAAGCTC GAGTTGGCGC TTCAGGAGAC GATGGATAGC GCTTTGCTCG CAGAAGAAGC GCAGTCAAAA CTAGAAGCAC TGTTGGCGAA AGAACAGAAG TTACAGAGCG AGTTGGCCGC GACAAAGGTT CCACAGGTAA AGCTCGACTT TTCTGACATG CCGAGCTTCG AACTGGACGC TGAGGCGAGG GAATACAAGG GAGATCCCGA CGACAGACGA GAAATTTTAG CGCATAGGAA GCGAATTGAG AAGGAAAGCG ACCGTTTGGC GGTCGCCAAA GAAAAATGGA TCGCGGGTAG AAGGAAGGAG GCGAAGCAGG CGGCGAAAGT ACAAGCGAAT GCGACGAGGA ACGTGAAGCT GAGTGCTCTC AGAGGAGAAA TCGCGCAGGC GAAAAAGGCA GTGCAGCAAA CATTTTCCGT GGCGACTTCT CACGCGAAAA AGTTAGAGCG CCTTCGTTCG AGAGCAACTG CAGCACAGCA GCATGAAGCC GAGCTGGCGA GCAGAAAGAG AGAAGCTGAA GCCTACCGTG CGCAGCAGCA GCAGGAACGA GCGGCGAAAA AGTCAAAGAT TCAGCTCGAA CGCGAAGCTC AGAAGTTACA AGACGCCGAA CGCAAACTCA AAGAGGCGGA GCAGCGAGCC AAAGTGAGAG CAGAGGCGGC TCGCTACCCT ATGGATGATA ACGAGCTCGT CGCATACGAC GCGGCGCAAG CGAAAGAACA AAATCGCGAC CCTTGGCCCG CCGTCGCGAA CGGGACGGCG TGGAAGCCAT CCGCGGTAGA TGTCTCGCAG ATGCAAATTT GCGAATTCTT TAGCACATTT GGACGTATTC TAAAACCGAG CGTGGACGAA TTCGGGGTAG AGTACCTGAA CACTATTCTG GATGATTCCG ACTTGACAAA ACTTTCGAAG TTGTACGTAT CCTTGCTTCG AGTGGCCGTG ACGACGACGA CGTACGGAAT CGAGGCCCTC GTGAAGACGT GGTCGGACGC TTTGGAGTTT AGTGGTACTT TCCCTGAAAT CATGAAGCAA TTTGCCAAGG AAAAGAGCCG GGTTGGACAG ATCGATGCCG TTACGCTGAG TACGATTAAC GCGCTCACAG AGAAGCGAAT AGGCATGTTC ACGCCTGAGG AGCACACGCG CGTGCTCGAT TGGTTGTGTG GTGAATGTTT TGAGGCGCCG GAAGTCAACA AAGAAGTCAA TCGACGCGTA AAGGCGCTTG AAGATTTGTC GAAGGAAAAA GAGAGTGATA AGGCGATCGA AAATGCATTC GAGAAAGAGT ACAAGAAACT TGCGCCACAA TTACGCGATG CGAACGCGCT CGTCAAAAAT GCATCCGACG AGCTCATCCA AAAAGTTCAG GCTCACGAAG CTTTGATAAC TCTACCGCAG TCTGACGTCG AAGTGCCAAC CGAGTTGCAA TCTGATGTTC GCGCATTCCT TGAGGCGCGA TCGCTTCTCA AAGAGACGGA GGCGAAAAAG AATGAAATAG CGGCCCAGGC TGCTAAACGT CGTCGACAGT TTCAAGAAGA ACGCATTCGA TCGAATTGTT TGGGTCAAGA CCGCAACGGC ACGAAGTATT ACTGGAATCT ATCGTATAGC TCGGGCGCAT TGCTTGCTAT CCATGTCGAT GGTACCTGGT CAAAACTTAC TACAGAGCAG CAGCTACGTG AATGGAGCAA ATCCTTGAAC ACAAAAGGTA TTCGTGAGCA TCGACTACAC AAGAACGTAG CTGAGATTCG CGCCGAACTC GTCGCCGCGT TTAGGCAGGC TGAATTGGAA GCCATGAACG CCTTCCCAAA GTCGAGTCAG CGAACGAAAG TTGAGAGATG GAACGAAATC GCTGCGCGAG AGACACTTTC GATGGAAGGT GCGAAGAGGA TCATCCAACT GTTAATGGCT GATGTGGTAA ATTGCGAGGT ATCCGCGCCC GACGGCACCA TGTCTGGATG GAGAATCTGG GGCAGAGAGT TGGAGAAAAC CACAGAGCTT ACTGAGATGA TACGATACTT GATGCAAATC GAGGAGGCAA TGGTCGAAAT GTGCGATCTC CCTAGAGACG TTACCGCGAA GGATGCGAAC GGTTCGGCGA TCAACGCATC GAACGAGCGT CTCATGGCGA GCCACGAGTG GTGGGAGCTT ATCATTCCGT CGGATGAGAA GAAGGCGGGA AAACGGCTGT GGCGCACAAA TCAAGAGCGC GCCATCTGGC AAGAGGCCAT GTCCACATCT GACACGTATG CACGCATTGC TTACGGCGCC GCAATGCTGG AATCGTATTC TAGACCACTT TTTGAATTTC TCGAGAACAT CAAGAAGAGA GCGAAACGCG AATCCAGTCG ATTCGCCGAT TACACATCAT ACGACGATGG CTATGATTCT TACGGACAAC GCACGTAA
|
Protein sequence | MVHANARGAM HEAVVRAVHG FKGGMAAAAA AGRDAAARAA KGLARDEDVM SVKEVVAYLN MVMFPRQSEE DVFCARPGFV MMYDLTPLKR AAPTTKKPPA SKFEGDNGRV KVDRGPSASA SGASTDGGDR SAAPAPSGAV TAPRATATSG ATISPHMEKL ELALQETMDS ALLAEEAQSK LEALLAKEQK LQSELAATKV PQVKLDFSDM PSFELDAEAR EYKGDPDDRR EILAHRKRIE KESDRLAVAK EKWIAGRRKE AKQAAKVQAN ATRNVKLSAL RGEIAQAKKA VQQTFSVATS HAKKLERLRS RATAAQQHEA ELASRKREAE AYRAQQQQER AAKKSKIQLE REAQKLQDAE RKLKEAEQRA KVRAEAARYP MDDNELVAYD AAQAKEQNRD PWPAVANGTA WKPSAVDVSQ MQICEFFSTF GRILKPSVDE FGVEYLNTIL DDSDLTKLSK LYVSLLRVAV TTTTYGIEAL VKTWSDALEF SGTFPEIMKQ FAKEKSRVGQ IDAVTLSTIN ALTEKRIGMF TPEEHTRVLD WLCGECFEAP EVNKEVNRRV KALEDLSKEK ESDKAIENAF EKEYKKLAPQ LRDANALVKN ASDELIQKVQ AHEALITLPQ SDVEVPTELQ SDVRAFLEAR SLLKETEAKK NEIAAQAAKR RRQFQEERIR SNCLGQDRNG TKYYWNLSYS SGALLAIHVD GTWSKLTTEQ QLREWSKSLN TKGIREHRLH KNVAEIRAEL VAAFRQAELE AMNAFPKSSQ RTKVERWNEI AARETLSMEG AKRIIQLLMA DVVNCEVSAP DGTMSGWRIW GRELEKTTEL TEMIRYLMQI EEAMVEMCDL PRDVTAKDAN GSAINASNER LMASHEWWEL IIPSDEKKAG KRLWRTNQER AIWQEAMSTS DTYARIAYGA AMLESYSRPL FEFLENIKKR AKRESSRFAD YTSYDDGYDS YGQRT
|
| |