Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31390 |
Symbol | |
ID | 5001546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 790301 |
End bp | 794524 |
Gene Length | 4224 bp |
Protein Length | 1407 aa |
Translation table | |
GC content | 57% |
IMG OID | 640416967 |
Product | predicted protein |
Protein accession | XP_001417358 |
Protein GI | 145345738 |
COG category | [Z] Cytoskeleton |
COG ID | [COG5022] Myosin heavy chain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.590275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGGGA AGATGAAGGA CGCGCGCGCG CGAGGGGACG CCGTCGACGA CGCGCGGCGG CCGAGCGCTC GCGCGAGCGG GGAGCGACGA CGGGCGGTTT TGAGCGAGCC GGACGCGGTG CCGGTGGTGC GATTCGGCGC GGTCGACGTG GGAGGTACCG GGTGCGAGGG GTTGGATATC GTGAACGACA CGGCGTTGAC GCAAGAAGTC GCGTTCGCGG ATTGCGAAGC GATCGCTCGA GACGGCTTCT CGCTCGATCG AGAGACGTTG ACGATTCAAC CCGGCACGAC GGAGCGCGTA CAGTTGACGT GGACGCCGCA GAGGGCGATG GCGGCGACGT ACTGCGGGCA AATCTCGTTC GTGGTCGTCG CGGAGGCGCC GGTACCCATG CCCGAGGTGG AGATGAAGGC GCGCATTCGC GGCGTGGCCA AGGGCGAAGT TTTACGACCG TTGACGAATG AAGCGCCCGA GGAACGCGTG GCGAGCGCGA CGAAACGTCC GCGAGATGCG AGGTTGGCGA CGCGGCGGTC GCTCGCGACG TCGCCGCGCA AACTCGCGGG TGAGGATAAG TTGGCGTTGG ACGCGATGCG TGCGAAGCAG CGTCGGCTGG AGGACGAACC GGCGACTTTG GCGCTCGCGC CGGTGGCGCG AGCGCTGCAA CTTCAGCGAG AAGATTCGGC GTCGGGCGAC GCCCACGAGG GTTCTACGAG TCGAAACGTG TCCGACGAGT TCCAAGCGGG AATATGGTTG CGTCAGCAAG AGCTCGCGTT CATTGCGTGG TTGAATCACA CCATCGTCAT CGACGACGTC GGCACGATGG GCGACGACTC GCCGTCTGCG AACCGAGGAG GGAACGCCTC GGCTCGCGAA GTGCGGCAAA CTGTTCGAAA CAAACTCACG TCATTGTACA GCTACGATGA CGAGCTCGGG AGAGTCCTGA AAAAGACCTA CAGACACGTT GACAACGCTC GATTCAGATT GAACACCGGA CAGACGTTCA TGGATAACGT CGCGCTCAAG GAGGAATTTG CGCGCGCGCT GTCGTGCTTC TCACCGTTTT GGTTGCAACT CGGCGTCGAT GTGGTTGTCG GCGGAGGCAT CGTGTGGAAG CGACGCGGCG ATTTGCACGA GATTCAGAAG GAATGCATCG CCGCCTTGTT CCGTGATAGA GATTTGGAGA TTGAATTTGG AACCGGGCAC GTGCCCGGCG CGCCGCCGTT CGCACACGGA TACGAAGAGG CGTTGAGCCG AAGCGTTCTC AAGCGCGTTT TACTGCTGGT GTTCATTCTC GATCGCGCAG CGATGAGTGG GTTACCCCCG AACACGCCAT TACTTATGCG TCCTCACGCG GCGCTCAAAC GAAGTGAAGA CATACTTCGC ACGGCGTTGC AAGGCTCGAT GTACGGCGAA GGCGACGTCA TTCGCAACTT GAGTCAATGT TCGTACAAGT TGCACTACAA ACAAAATCCC ATTCGTGAGT ACGACTTTCA GTGCACGAAC TTGGCTGTCG ATTTGCGCGA CGGCGTGCGT CTATGCCGAC TCATGGAAGT TTTGAACGCC GACGTACTCT TCATGAGCTA CGACGAAAAG AACAAGGAAT GGAAGCGAAG TCTGTTGAGC GAGGTGCACT TCCCATGCGC ATCCAGGGCT CACAGGGTGC AGAATGTCGA GGTTGCGCTG CGAGCGATCA AGGATCAACA AGTTGGCTTG CCTGGTACGT GGAGCCGAAT CAAAGCTGAG GATATCGTCG ATGGTCACCT CGAGCACACG ATGGGGTTGT TGTGGGCGTT GATGATGCAC TATTCGGCGC CGGGCTTGCT CCTGCCCAAG TCGTTGGACT CAGAGATCAC TAGATTGGGC GGTAAAGTGC CAGATATCAA GCGTATCGAA CGGCTGTCAG CGGCACGACG CGGTGATTCC GTCATAGAAG CGCCGCAGTG CGCCATGGAA GCGCGCCTAT ACGCCTGGGC GCGGGCTGCG TGCGCGACTC AAAACGTTGA GCTCAACAAT CTCGGCGGCG CTTTCACCGA TGGTCGCGCT CTGTGCGCTC TTATTCGCGC TTACGCGCCC ATGATGATTC CAAAGCGTCG CATTGGTAAT GCTCCGCTGA AGCTCGACGA TGCGAATGCG GATACGGCAA AACACGCTCG GGAGTTGGCG CGTGACAACT TTGCTGCGGT TGCCAAGGCA CTTCAAGCGC TCGGCGGCGT GCCGAACCCA ACGTTTGATA TACGATTTAC CAGTGACGAA GGCCTGGACT CTCCTGATCC ACGGGCAGTA AGTGGATACT TGCTGTTTCT CAGCGCGCGC TTGCTGTTAT TGCGGCAGCA AGAAGTCGCT TGCGTCCGCA TCCAACGCTG GTGGAGATGG AATCGTCCAA ACCGACCCAA GTTTGCCGAA GTCGTTCGCA AGTGGAACGC GGCGTCGACG GTAATTGCAT CGCACGTTCG CCGCGTACAG GCAGTGGACG CGGTAAATGC GCGAAAGAAT GCAATCGTAA AGCTGCAATC TTTCCGACGC GCGTGCGTAG CGCGGCGAGA GTTCCTCAAT ATGAAGAATG CCGCGGTGAA GATTCAATCC TTCAAACGCA TGCACACGGC GCGCCTGGAA TTCCAAGACA CAAAGTGGGC GGTGGAAAAG GTGCAAAAGA TGCGACGAGG TTGCGCGCAA AGAAATCAGT TCTTGCGCAA AAAGCAAGCT GCGACTTTGA TCCAAGGTTG GTACCGAACG GTTTGTGCGC GTAACGAATA CGTGAACAAG ACGTGCGCTG CGACCATTAT ACAGATGCAC TGGAGGGCTT TTGCCGCGCG CGCCGAAGCA AAGCGCATCG TCGAAGCGCG AATGAAAATA ATTCACAGCG CTGCAACGAA GATTCAAGCT GCGTTTCGAA AATGTATGAT GCGCAAGCAC TTCCTTCGCT TGCGTTGGTT CGTGATTTTA TCGCAAGCGC GCGCTCGCGC CGCCGCCGCG CGCCGGACCT TTGTGGCGCA AAAGAAGGCT AGCGTGACAA TCCAACGGCG TGTTCGACGA TTTTTGGACT ACAATGCGTA CAAACGTCGT TCGCAAATGA TAGAAAACGA GCGTCAGAAG AAAGCTGCGA CGACGATTCA ACGTCACTGG CGTGGATACA ACACGCGCGA CGGACTCGAC AACATTCAAT GGAAGACGTA CTTCGTCACG TTGCTGCAAG CATACGTGCG ACGTTGGCAA ACTCGTCGCA AGTTCGTGAA CGAAATTTTA CCGCGTCAAA AAGAGCTCAA GTTGCAAGCG CGGAAGACGC AGATGCGTGC GCGAATGGCT CGTGAGCGCG AGGCGGCGAC ATGCATTCAA AAGTTCTGCC GCGGTCACCT CGCTAGAAAA ACAGTGAGAA AGATGCGGCG CAAGGCGTCC AAGGCGAAAC GAGCCGAGAA GGATGCCGCC GCGAATCTAC TTCAAAAGGA AGAGAAATCT AGCGAGGAAA TCACTCAGCG AAGTCGGCGT CCTGTGTCGG CGTTCACGAG AAAAGCTTTG CTCGAACAAC ACGTGGTGGT AATCCAAGCC TTCGTTCGCG GCTGGCTCGC GCGAAAGCAC GCGGTGCATA AATTGGAGTG GCATCGCAAG AGAATCGCCG CCAAGGCGCA GCCTGTCAAT CCTTTACACG CGCGCGCTGA ACAAGCTGCG AATATGATCG CCGCACCGCA CGCACGCGAT GACTGCCTGC GTGGTTGCAC GTTCTTCACC GAACACTGGA ATCTGTCCAA GACGTGTCGA GGCATCGTCA CCTCGCCTCG CGTGCTGCAC GCGTTGATGC GCAACGTTCG ACAGTGCAGC CGCTCGGCGT CACAAGTACC TTTGTTGACC GCGGCGTACG ATTTGTTCGA GATCATCGCA CGCGATAAGC ACTACGCGAG TGCGTTGGAA CAATGCCCCG ACAGTGTGAT GACCATGACG GAGCATCTGC AACAGTACCG CGACAGGCCG ACGCTTTTGG AATCCGCGGT GAACACGATG GTGGCGTTGT TTGAAAACTC GTCCAACAAG CGGTCTTTGG TAAGCGAAAA GTTTTTGACG CGCGTCGAAA AAATGAGAGA TATCATCGAC AGCAATCGCA TCGTGCACAG ACGTCGGGCG ATGACTTTTG CGCAGCAAGG AAGATACAAG GAAGAAACCG AAGCGCGCGA CGCGTTGGTC AAGACGGAGC AAACGCTCGC GTGCTTGAAA AAGTTGACTC GTACGTTGAC GTAA
|
Protein sequence | MGGKMKDARA RGDAVDDARR PSARASGERR RAVLSEPDAV PVVRFGAVDV GGTGCEGLDI VNDTALTQEV AFADCEAIAR DGFSLDRETL TIQPGTTERV QLTWTPQRAM AATYCGQISF VVVAEAPVPM PEVEMKARIR GVAKGEVLRP LTNEAPEERV ASATKRPRDA RLATRRSLAT SPRKLAGEDK LALDAMRAKQ RRLEDEPATL ALAPVARALQ LQREDSASGD AHEGSTSRNV SDEFQAGIWL RQQELAFIAW LNHTIVIDDV GTMGDDSPSA NRGGNASARE VRQTVRNKLT SLYSYDDELG RVLKKTYRHV DNARFRLNTG QTFMDNVALK EEFARALSCF SPFWLQLGVD VVVGGGIVWK RRGDLHEIQK ECIAALFRDR DLEIEFGTGH VPGAPPFAHG YEEALSRSVL KRVLLLVFIL DRAAMSGLPP NTPLLMRPHA ALKRSEDILR TALQGSMYGE GDVIRNLSQC SYKLHYKQNP IREYDFQCTN LAVDLRDGVR LCRLMEVLNA DVLFMSYDEK NKEWKRSLLS EVHFPCASRA HRVQNVEVAL RAIKDQQVGL PGTWSRIKAE DIVDGHLEHT MGLLWALMMH YSAPGLLLPK SLDSEITRLG GKVPDIKRIE RLSAARRGDS VIEAPQCAME ARLYAWARAA CATQNVELNN LGGAFTDGRA LCALIRAYAP MMIPKRRIGN APLKLDDANA DTAKHARELA RDNFAAVAKA LQALGGVPNP TFDIRFTSDE GLDSPDPRAV SGYLLFLSAR LLLLRQQEVA CVRIQRWWRW NRPNRPKFAE VVRKWNAAST VIASHVRRVQ AVDAVNARKN AIVKLQSFRR ACVARREFLN MKNAAVKIQS FKRMHTARLE FQDTKWAVEK VQKMRRGCAQ RNQFLRKKQA ATLIQGWYRT VCARNEYVNK TCAATIIQMH WRAFAARAEA KRIVEARMKI IHSAATKIQA AFRKCMMRKH FLRLRWFVIL SQARARAAAA RRTFVAQKKA SVTIQRRVRR FLDYNAYKRR SQMIENERQK KAATTIQRHW RGYNTRDGLD NIQWKTYFVT LLQAYVRRWQ TRRKFVNEIL PRQKELKLQA RKTQMRARMA REREAATCIQ KFCRGHLARK TVRKMRRKAS KAKRAEKDAA ANLLQKEEKS SEEITQRSRR PVSAFTRKAL LEQHVVVIQA FVRGWLARKH AVHKLEWHRK RIAAKAQPVN PLHARAEQAA NMIAAPHARD DCLRGCTFFT EHWNLSKTCR GIVTSPRVLH ALMRNVRQCS RSASQVPLLT AAYDLFEIIA RDKHYASALE QCPDSVMTMT EHLQQYRDRP TLLESAVNTM VALFENSSNK RSLVSEKFLT RVEKMRDIID SNRIVHRRRA MTFAQQGRYK EETEARDALV KTEQTLACLK KLTRTLT
|
| |