Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25332 |
Symbol | |
ID | 5004924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 346609 |
End bp | 349911 |
Gene Length | 3303 bp |
Protein Length | 836 aa |
Translation table | |
GC content | 60% |
IMG OID | 640420345 |
Product | predicted protein |
Protein accession | XP_001420662 |
Protein GI | 145352672 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0150408 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGATCGACGC GCGCGACGCC GACGCCGACG CGCCGTCGCG TTCGCGCGCG CGCGACACCG CTCGAGAGAA TTTCACTCGA ACGCTCGCGC GCGCGCGACG GCGCGCTCGA CGTCGCGGGC GACGCGCGCG ACCGCATTCG CGCGCGCGCG CGCGTCGGAT CGCGACGCGC CGCGCGGCGA TCGTCGCGAC GCGCGCGACG TCGAGCGAGG GTTCGCGCGA TCGAACGCGC GCGGCGGTGG GGCGAAGGCG CGGTGACGGT CGATCGATCG GTCGGATCTC GCGCCGGCGG AGATCGGCGC GGTCCAGAGG TGGATTGGAT CGGATTGGAT TGCGTTGGAT TGCGTTGGAT TGCGTGTGAT TGATTGGTCT GATTCGATCG CGTCGATTCG ATTTGATTGA TTTCGATACG TCGATCTTGG ACGTAGAAAA GGCTCGGTTG AACGCGAACC GACGCCCCCC CCGACGAGAA CGAGCGATCG CGAGGCGGAA CGCGCCGTCG ACGATCGATT GAACGATTTT CATTCTCGCG CGCGCGCTCG CTCGCGAATC GATTTTCTTT CATTGAAAAA CACCAACGCG CTCGACGCGC GGGTAGACGT TCTACGCCGG ACTCGATCGG AGAAATTCCG AGAGGACGAT AATAGACGTC ACCGCGTTTG GCCTGTTAAA TAAATAATAG GAACTTCCTC GAGGCGACCG CCGCGAGAGA AGGGAAGAAC AGAGGCGGTT TCGTCCGACT CGACGCTCGC TCGGCGAGTC TTTGACTGGC GATGACGGAC GTGATCACCA TGTCGTCTTC GTCAGACGAT GATCGACCGC CGATCGCACG CGGTGGAGAG GCATCGTCTT CTTCATCGGA TGGCGACGCT GGGTTCAATG GAGACGTGTT GGCGACTGAG TTGGCAAAGA TGCCGCTCGC GCTTCGCGCG CAGCTCACCG CCGCGGCCAA GTCTTCGACG AGCACCAACC AGCGCACGCT CGTGAGCGAG TGGTCGGAGA AGACGAAACT GTCAGTTGAT TTGGTAGAAA AAATCTTCGC TTCCTTTGCG AAATCGTCGC GTGCGTCAGC GGAGGCAGCG CGCGAAACCG TCGCGCGCCT CGCCGCCGCC GACGGAGTCG ACTCCGCCGA TGGAGACGAG TACTCGCCGG GTGCCGCCGA AGCGGCGACT GCGGATGACG ACGCTATGGA CGACTCGCCG AAAATCCCAG CGTCTAAGCA GCCGGTGCTC GAGGAAGGTG TTCCGTTTCC TGGAATTACT CGAGACGTCT CCGTGGACGT CGCGTGCGGC GATCTCAGAG GCGTTCTCGA GCTGAAGAAG GGATACAAGA AACTTCAGGA GCGCGTGCGA TGCGAGGGCG AAATGATGAC GCCAAGTAAG TTTGAGAGCG AGGGCGGGCG CGGGTCGGCG AAGAAGTGGA AGATTAGCTT ACGCATCGTG CGTTCGGATG GAAAACTCGG GATGACTGTC GGAGACTGGA TCGACAGGTA CGGATACCAT CCGTCGGGCT TAGTCATAGG TGGGGAGGTG GCCGACGCGC CGGCGCCGCC GAAGGTGAAA AAATCAGCGT TTGAGTCACA GCTAGAGACC TTACTCGACT ACGATGGGGA CATTGCCTTG CGTAGCATCG CTAAATTTGT ACGAATGATG CGAGAGACGA CCAAGCCTAA GGAACGCGGA CTTTTGCTCC AAGTCATTCG AGGCACGAAA AACAAAGAGT GTTTGCGCCA GTTCGGTCAG TCGGCGGAGA TTAAAGGATT AGACACGCTG CAAGATTGGA TGGACGACGC TAAAAGGAAG TTTCAATCCA CTCTTTTAGT GAGCATTCTT AGAACATTGA AGATGATTCC AGTTACATTG GACGCGCTCA CGCGTACTTC GATCGCCCCG AATCTGGGCA AGTTGAAGTC GTACGTGGTG CCAGAGGGTG AAGAGGAGTT TGCCAACACC GAGATGAACA CTAAAGTTGT CTTGTTATCC AAGTCGGTCA AGAACGCGTG GAAAGCGCAA ATCACGGCGC CCCACACGGC GCCGGCGCCG GCGCCGAAAC CGGCGCCCGT GGTCAACCCG GCGCCCGCGG CGGTGCCCGC GTCCAAGGCT GTCGAACTCG GTGACGATGA CTTGTTCGGT GCGAAGTCGA AAATCTCACC AGCGCCTTCG AAAGCGCCGG TGGTGAAGAC CACGGTGACG AAAATCACGA TGGAAAAGAA GGTGGCTCCG CCGAGCGTGA CGACGAAGAA ACCGAGCGTG AGCGTGAACG ATTTGCTCAA GACGAGTTCG CAATCGACGA AGATCACCGC ACCGCCGGTG AAGACGAAAG AGAAGGCAAA GGACGATGAT AAAATCGACG AGAAGACTGG GAAGAAGCGC AAGCGGAAGA CGGTGACTTG GGCGAAGGAT GAAAATTTGG AGCAAGTCAG AATCTTTGAG AAGGACGCCA AGCAACCGAA GGAAACGGCT TTCCCAGATC CGACGAGAGA CGGTGGGACC GATGGCGCAA GCCGCAAGGC GCTCGAGAGG AGAGACAGGG AAGTGGAGGC GGAAAGAAAG GCGGCGGCAA AGCAGCACCA GCGTCGACTC GACGAGATGC GCGCCACGAC AACCTGGCGC CCGCCTCGGC GCATCGAAAT TCCTCGTTGG GAAGAGGAAG AGTCGGACAG AGTCCCCGGT GACGAGTCGG AAGAATCCCA GAGAATTCTT CGCATCGAAG CCGAGAAACC GAGCGTCAAG TACAGGAGTC TGAAAGACAT TCCAGACTCT CCGGCGGAGG CGCCGAACGA GGACGCGCAA CTCGACCTCG ACAACACGCC AGCCTTCTAC ATGAAGTTTC AAGAAGAACC GCTGTCTCAA GACAGCGGCG AACCTTCACC CGCGATGCCC CAACAGCTCC AAATGCCACA GGCTCCGGGA TCGTTACTTC CTCCGAACAT CGACTTTGCC GCGCTTCAAC GCACGTTGCT CGCCGCCAAC GCGCCGCATC AAAACGCCGG TTACCAGTAC CCCCCGCAGG CGCCTCTTCA ACACGCCTTC CCACCACCAC CCCCTCAAGC CGCGTACGCA CCTCAGCCCC CTTACCAACC CGGCGCGCAA CAGCCCGCGC AGCGACCGAC GAAACAGGCG CTCGTCGGCG GCGCCCCCGT GCCCGCGCAG CAGGCGCTCA ACCTCAACGG CAAGACGTAC CGCGGCGTGT GCGCCTTCTT CAACACCCCT CGAGGATGCA GCTGGGGAGA TAAGTGCGGC TACCTCCACC AAGTCGGCGT CAATCCTCCG TCGACGGGTT GATCCCGCGT CCGCGACTCG CCC
|
Protein sequence | MTDVITMSSS SDDDRPPIAR GGEASSSSSD GDAGFNGDVL ATELAKMPLA LRAQLTAAAK SSTSTNQRTL VSEWSEKTKL SVDLVEKIFA SFAKSSRASA EAARETVARL AAADGVDSAD GDEYSPGAAE AATADDDAMD DSPKIPASKQ PVLEEGVPFP GITRDVSVDV ACGDLRGVLE LKKGYKKLQE RVRCEGEMMT PSKFESEGGR GSAKKWKISL RIVRSDGKLG MTVGDWIDRY GYHPSGLVIG GEVADAPAPP KVKKSAFESQ LETLLDYDGD IALRSIAKFV RMMRETTKPK ERGLLLQVIR GTKNKECLRQ FGQSAEIKGL DTLQDWMDDA KRKFQSTLLV SILRTLKMIP VTLDALTRTS IAPNLGKLKS YVVPEGEEEF ANTEMNTKVV LLSKSVKNAW KAQITAPHTA PAPAPKPAPV VNPAPAAVPA SKAVELGDDD LFGAKSKISP APSKAPVVKT TVTKITMEKK VAPPSVTTKK PSVSVNDLLK TSSQSTKITA PPVKTKEKAK DDDKIDEKTG KKRKRKTVTW AKDENLEQVR IFEKDAKQPK ETAFPDPTRD GGTDGASRKA LERRDREVEA ERKAAAKQHQ RRLDEMRATT TWRPPRRIEI PRWEEEESDR VPGDESEESQ RILRIEAEKP SVKYRSLKDI PDSPAEAPNE DAQLDLDNTP AFYMKFQEEP LSQDSGEPSP AMPQQLQMPQ APGSLLPPNI DFAALQRTLL AANAPHQNAG YQYPPQAPLQ HAFPPPPPQA AYAPQPPYQP GAQQPAQRPT KQALVGGAPV PAQQALNLNG KTYRGVCAFF NTPRGCSWGD KCGYLHQVGV NPPSTG
|
| |