Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_41924 |
Symbol | |
ID | 5005216 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 469318 |
End bp | 472413 |
Gene Length | 3096 bp |
Protein Length | 979 aa |
Translation table | |
GC content | 61% |
IMG OID | 640420637 |
Product | predicted protein |
Protein accession | XP_001421159 |
Protein GI | 145353732 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.123239 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.112463 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCGA GCGACGCGGA GGCGGTGCGA ACGCTGGCGT GCGTGACGGT GAAGCGAAGG TGCACGCCGA GGGCGTTCGC GTCGAGGTTG ACGCGCGGCG AGAGGGACGA GGCGAAGCGA GCGCTTCTGG ATCGAGCGAT GACGGCGGAG AGCAAGGCGC TGAGGAATGC GGTGCTGGAC GTTATCGCGA AAATCGCGCG TTGGACGGTG CCGCAGGGGG AGTGGAACGA GTTGTTGGAA TTTTTGGGAC AGTGCGCGAG CTCGCCCGAG ACGGCGCATC GAGCGTTGGC GTTTAAGTTG TTCGAGAGCC TGACGGAGAC GATCGTGAGC TCGCTGAGTC ATCACTTTAA GACGTTGGCG GGATTGTTTG CGAACGGACT CGTGGACGCG CACGATGAGG TGCGGGTGAG CGCGCTTCGC GCCGTCGGGG CGTTGGTGGC GAACGCGTCG GGCGAGCCCG AGGAGGTGGC GGTGATAAAG TCGTTGGTGC CGCACGTGCT CGAGGCGGCG AAGACGGCGG TGTCGAACGA AGACGAAGAG TCGGCATCGA TCGTGTTTGA GGTCCTAGAT GCGCTCACGG AGAGTCGCAC GAGCGCTTTG AGTGGACACG TGCCCGCCGT CGTCGGCTTT TGCATTCAAG TCGCCACGGC GGAACGCGAG CTCGGGACGA GCGCGCGACG ACGCGCGTTA GACGTGCTGG CGTACATGGC GCGTCACAAA CCAAAGGCAC TGACAAAGTC TAAGCTTGTC GAGCCGATGC TGGCCGTGTT GTGCCCGCTG TGTGGTGAGC CCAAGGAAGC CGAGCTCGCG GGCGAGGACG ATCTCGAAGA CGAAGACGAG GTGCACATAC AAACCGTAGC GAGTCAGCTC ATTGATATTT TAGCGCTTAA AGTGCCGGCA AAGTATGTCC TTCCGACGGT TCTGTCATTC GCCGCGGCGA ATATCAACAA TGCATCGAAC GACCGCTTGC GTCACGCCGC AGTCGCCGTG CTCGGCGTCG TCACCGAAGG GTGCGCCGAG GGCGTGCGCG CGCACGCGAG CACCATCGTG CCGAGTGTGG TCGGACGCTT GAGCGATCTA AATGGACCCG TGCGAGGCGC GGCGGCGTTT ACGCTCGGAC AGTTTGCCGA GCACCTTGGG TTGACGTTGG AAGACCCGGA CATGCACAAG CAAGTGCTGC CGAGCTTATT CACCGCGCTT CCGGTTGAGC AAGTGAAGAG CGTGCAAGAG CGCATGATGT ATGCAATGGA TGCGTGGTTG GAAGACGTTC AAGACGAAGT CGGCGTGTAC GTCAAACCTT TGCTCGACAT AGTCTTATTA GCTCTCGATA GCGGTGCCAA GCGCCACGTG CGCGAGATGT TACTTTCGGC GCTCGCGTCC GCGACGGCGT CGAGCGGGGA CAAGGTGCAT CCGTACTTGG GCGAACTCTT GCCTAGACTC GATCGTTGCT TGTCTCTGAC GGCGGATGAA GAATTGAACG TTCGCGCGCG CGCGTTAGAA GTGCTGGGGA TGTTGATTTC GGCCGAAGGT GGCAAGGAGG CCATGGGACC GCACGTGGAA AACGCCATGC AAGCCGGGCT CTCTGGGTTC GAGCTCGATT TTGCCGAGCT CCGGGAATAC GCCCACGGCT TGTTTGGCGA AGTGGCGGAG GCGCTCAAGG AAGATTTCGA TCGCTACCTC GCCGTGTGCG CGCAGAAAGC GTTCGCGTCG CTCGAACTGG ACGATGGTAT CATGTTTGAC AGCGAGGACG AAGCCGATCG CGAAGAGTTG GATTCGGACG ACGACGGCGA CGGCGACGGT GCCGATGGTA TGACGAGGAA GCCGGCGGGT TACTCCATTC GCTCCGGCGT CATGGACGAG AAAGCGTCGG CGTGCAAAGC GCTGAATTGT TATGCTTCGC ATTGTCCTCG CGCATTCGCG CCGTACATCG CCAAGGCGTC CGAGCTACTC GGAGGCATGA CGGATTATAT GCACGAGATG GTGCGCGTGC AGGCGCACCT TGCGCTCGCG CAAACTACGA TCGCGGCACT CAGCATCAAC CCCGAAGGCG CGAAAGAACT GGTGAACGAC TCACTCTCGG CCACGATTCG TTGTGTTTTA GAAGACGAAG ATCGGGATGC GGTGGCCGCG TCCGTGGAAG CCGCGGCGCT TCTCGTAAAC ATTCTCAAAG AACATCGCGG GGTAGACGTC TCGCAGCACG TCATCGATCT CACCGCGGCA AGTTTGGAAA TTCTAGAGGG CAACACGTTC TGTCAAGTCG AAGATGGGTA TGACAGCGAA GAAGGCGACG AAGAAGGCGA TGAAGACGAA GACGAAGACG TCGAAGCGGG CTTGGTTGTC ATCGAAGCCG TCGCCGAGCT CTTACCTGCG CTCGCGATGT ACATGGGAGA GACGTTCGCG ACGCACTTTG TGCCGCACTT CAACGCTTTG ATGAAGCGCA CGGAAGAAAA TCATACCGAA ACCGAGCGTT CACTGTGCTA CGCGACGCTC GTCGAGGTGG TGCGCGCCGT CGGTGCGCCC GCTGCGGGAT GCGCCGTCGT TGCGCTCCCG AGGTGTTTGC GTGACGTCGC GTCTCTGGAC GTCGGTTTGA GGCGCAACAG TATTTATTGC ATCGGTATAC TGGCGCAAAT AGGTGGTGCG AGCGCGATAG ATTTCCACGG CGCCATCGCG GAGGCACTCG CGCCGATGAC GCGAGCGGAC CGAGAATCCG ACGGCGGCGT TCGCGATAAC GCCGTGGGCG CCATCGCGCG TTTGCTACAA GTCATCGACG GCGGTCATGC TCGCGAAAAC GCATCCGCAC TGCTTGATGT CGTTCTCAAC GCGCTACCCT TGCGAAACGA TTTAGAAGAA GGCCCGGACG TCTACCATTG GCTCGCGTCG ACGATTACGG AAAATCCAAC CTCGCTCGCC GACGCCGCGA TGACTCGCAT CGTCGGTATT TTAGCCGAAG TGGTCACCGA TGGTGCGCTC GCACCGATAG ACACGTCGCG AATATTAGGA ATCGCGCTCT CGCGCGCCGA AGATCAGCGC GTGCGAGCTA CGCTCTCCAG TCTGCCCGCG CAGAGTCAAG ACGCGATTCG TCGAGCCGGG GCGTAG
|
Protein sequence | MSASDAEAVR TLACVTVKRR CTPRAFASRL TRGERDEAKR ALLDRAMTAE SKALRNAVLD VIAKIARWTV PQGEWNELLE FLGQCASSPE TAHRALAFKL FESLTETIVS SLSHHFKTLA GLFANGLVDA HDEVRVSALR AVGALVANAS GEPEEVAVIK SLVPHVLEAA KTAVSNEDEE SASIVFEVLD ALTESRTSAL SGHVPAVVGF CIQVATAERE LGTSARRRAL DVLAYMARHK PKALTKSKLV EPMLAVLCPL CGEPKEAELA GEDDLEDEDE VHIQTVASQL IDILALKVPA KYVLPTVLSF AAANINNASN DRLRHAAVAV LGVVTEGCAE GVRAHASTIV PSVVGRLSDL NGPVRGAAAF TLGQFAEHLG LTLEDPDMHK QVLPSLFTAL PVEQVKSVQE RMMYAMDAWL EDVQDEVGVY VKPLLDIVLL ALDSGAKRHV REMLLSALAS ATASSGDKVH PYLGELLPRL DRCLSLTADE ELNVRARALE VLGMLISAEG GKEAMGPHVE NAMQAGLSGF ELDFAELREY AHGLFGEVAE ALKEDFDRYL AVCAQKAFAS LELDDGIMFD SEDEADREEL DSDDDGDGDG MTDYMHEMVR VQAHLALAQT TIAALSINPE GAKELVNDSL SATIRCVLED EDRDAVAASV EAAALLVNIL KEHRGVDVSQ HVIDLTAASL EILEGNTFCQ VEDGYDSEEG DEEGDEDEDE DVEAGLVVIE AVAELLPALA MYMGETFATH FVPHFNALMK RTEENHTETE RSLCYATLVE VVRAVGAPAA GCAVVALPRC LRDVASLDVG LRRNSIYCIG ILAQIGGASA IDFHGAIAEA LAPMTRADRE SDGGVRDNAV GAIARLLQVI DGGHARENAS ALLDVVLNAL PLRNDLEEGP DVYHWLASTI TENPTSLADA AMTRIVGILA EVVTDGALAP IDTSRILGIA LSRAEDQRVR ATLSSLPAQS QDAIRRAGA
|
| |