Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37187 |
Symbol | |
ID | 5001247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 768794 |
End bp | 770785 |
Gene Length | 1992 bp |
Protein Length | 664 aa |
Translation table | |
GC content | 54% |
IMG OID | 640416668 |
Product | predicted protein |
Protein accession | XP_001417597 |
Protein GI | 145346232 |
COG category | [Z] Cytoskeleton |
COG ID | [COG5059] Kinesin-like protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.00858148 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGCGC AAGTTGAGAC ACCTCTCACG TCTACGTCGA CCTCTGATGA CTCTATATAC GTCCGCGTGC CACCATCTCG GCTTCGTGGA GTCGAAAAGC AAGCTCGACG ACCAGTGACG AGTCCGTCGG CGCAGACGGC CAAGCTTCTT GTCGCAGTGC GAACGAGACC AGTTGAAGTA TCGGTGGACA ACGCGGCGAG AGGCGAAAAC GCGGAGCGTA GCGCTACATC CGTATCGACG CCAAACGCAC GTTTAACGAG CTCGACGAAC GCACTGTCGT CTTCGGTGCA GTCCTTATCT GGAATTGGTC GCAAGAAGAG CATATTACGC GTCGTCAATC GCGACACAGT GGTCGTGATG GACCCCGACG AAGAGAAAGC ATACTTGGAC CAAGTGCAAC GGCGGAGCAA GGCGCGGCGA TACACCTTTG ACGTCGCCTT CAACGACACG GCGACGAATG CGGAAGTCTA CGACGCTACT GGTCGTGGTT TGATATCGGG CGTCGTAAAC GGCATGAACT CGACCGTGTT CGCGTACGGC GCGACGGGTT CAGGCAAGAC GCACACGATG ATCGGCAACT ACGATGAACC GGGAATGATG TTTCTTTCAC TCGTTGACAT ATTCGATCAA ATCAAGTCAT TGCGCGAGAG CTACGAATTC GAAGTCAAGT GCTCGTACTT GGAGGTGTAC AACGAGCTGA TTTACGATTT ACTCATCGTT GACAGCCCAC CGCTCGAACT GCGCGAGGAT CCCGAGCGCG GCCCAGTTCC GATGGGCTTG ACGCGCATCG CAGTCAAGGG TCCGGACGAC ATCACGAAAC TCTTGCACGA AGGTAACGAG CGTCGTAGCA TCGATCACAC CGAAGCAAAC GCGACGAGTA GCCGTTCTCA CGCAGTCTTA GAGATTAGCG TGAAGCGTTG GGAGAAGAGC GCCAAGGGCA AAGAGAAACA CGTTTTATGC GGCAAGCTCT CCTTGGTTGA TTTGGCAGGG AGCGAGCGAG CGTCGGACAC GCAAAACTGC GGACAGAAAC TCCGTGACGG TGCGAATATC AACAAGAGCT TACTGGCATT GGCAAACTGC ATCAACGCTC TGGGACGTCA CAGCAACTCG AAGGCGAAGG GCCGCATGTA CATCCCATAC CGCAATTCCA AGTTGACCCG TCTGTTGAAG GATGGTTTGT CGGGAAATTC ACGAACAGCG ATGATAGCCA CCGTGAGTGC TTCGAGCGAA CAGTACAACC ACACCATCAA TACCTTGAAA TATGCCAATC GGGCGAAGGA GATCAAGACC AACGTGGCGC AAAACGTCGT CATCACTCGC GAGCGGCATC ACATCGGCGA TTATCAACAC GTGATTGATG ATTTGCAATC GAAAGTCGCG CGTTTGAAGA CACAGTTGGC GAATAAAGAG GCGACGCCGA GCACGAGTCA GCCGAGAATG AGCGGGGGTG GTTCTTTCTC CGAGGCTTGG TTGGATGCGT TGAGCGACGA TATAAATGAA AACGTCGAGG AGCGCATTAA TATTCAAAAG GCACTGTTTG AGCTCGAAGA TATCAACGCA CAAAACGTGT ACGAGCGTTC GGCGCTCAAA CGCAGGCTGG TGGAAATCAC AGAAAAGAAC CCGAACAACG TCGAACGACG GAATCTTCGC GGGCGCATCG ACGCCATCGA TGAAACCATT CGCGCGAATG AATCCCTCGG CGCAAAGTAC CGAAAAGACA TAGAAGCGAA TGAACTCGTT CGAGTCGCCA TTCAATCGCG CATAGACGCC GCGATCGCTG AAGGTAAATC GCCTAGTTTC TTGCGAATTC TTTCACAGTA CAGGCTCGTC GGGGTGAAGA ACATGGAGTT GCACTTTCAG CTCGCCATTC GCGATCAAAT AGTCAGCGAG CAGCGTGAGA TGATTCGTGG GTTTTGGGAA GTCATGGCCC ACGCTGGATT GAGTAAGACT CTAGTGCAAG ACATCGCAAC TCGTGAGGGT ATACGCATCG AC
|
Protein sequence | MRAQVETPLT STSTSDDSIY VRVPPSRLRG VEKQARRPVT SPSAQTAKLL VAVRTRPVEV SVDNAARGEN AERSATSVST PNARLTSSTN ALSSSVQSLS GIGRKKSILR VVNRDTVVVM DPDEEKAYLD QVQRRSKARR YTFDVAFNDT ATNAEVYDAT GRGLISGVVN GMNSTVFAYG ATGSGKTHTM IGNYDEPGMM FLSLVDIFDQ IKSLRESYEF EVKCSYLEVY NELIYDLLIV DSPPLELRED PERGPVPMGL TRIAVKGPDD ITKLLHEGNE RRSIDHTEAN ATSSRSHAVL EISVKRWEKS AKGKEKHVLC GKLSLVDLAG SERASDTQNC GQKLRDGANI NKSLLALANC INALGRHSNS KAKGRMYIPY RNSKLTRLLK DGLSGNSRTA MIATVSASSE QYNHTINTLK YANRAKEIKT NVAQNVVITR ERHHIGDYQH VIDDLQSKVA RLKTQLANKE ATPSTSQPRM SGGGSFSEAW LDALSDDINE NVEERINIQK ALFELEDINA QNVYERSALK RRLVEITEKN PNNVERRNLR GRIDAIDETI RANESLGAKY RKDIEANELV RVAIQSRIDA AIAEGKSPSF LRILSQYRLV GVKNMELHFQ LAIRDQIVSE QREMIRGFWE VMAHAGLSKT LVQDIATREG IRID
|
| |