Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25378 |
Symbol | |
ID | 5005104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 74538 |
End bp | 76467 |
Gene Length | 1930 bp |
Protein Length | 518 aa |
Translation table | |
GC content | 64% |
IMG OID | 640420525 |
Product | predicted protein |
Protein accession | XP_001421042 |
Protein GI | 145353486 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.211316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0354367 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GACGCCGCGA GCGCCGATCG ACGGCGTCGA TCGACGCGAC GCGACGCGCT CGAATCACCG CGCGCGCCGT GCGCGAACGC GACGCGACGC GCGCGACGCG AAGAGGGACG CGACGCGGGA CGCGACGGCA TACCGCGGAC GACGGCGTCC GGACGACGGC GCGCGCGGAC GCGGGACGCG ACGCGACGCG ACGCGAGCGG CGCTCTTCCG CGCGTCGAGT GAACGATGCC GAAGCGATTG GTGTCGCAGG GAATGCTGCC GGACGTGTTC CAGCAGTACA TCATGAAGTA CCTGGACGAC ACGGATTGCA ACGTGCTGTC GCGCGTGTCG CACGGGTGCC TGGACGCGGT GATCGAGTCG GGACGCGATC CGGTGTGCGC GATGCGGGTG AGTAAGTTCG TGCACTCGCC GGCGATGTTG AGGTGGGCGC AGACGCAGGG CGTGCCGTGG GATTGGCGGT TGTGTCGCGC GGCGGCGGCG AGCGGGAAGC TGGAGAGCGT GCAGTGGCTG CGCGAGCAGG GGTTCTACGC GGGGCCGAAA CCGGAGGAGT GGGGGTATCG AAACGGGTGC CCGTGGACGG CGGACACGTG CGTGGCGGCG GCGCGGGGAG GGCACTTGGA TATTTTGAAG TGGGCGCACG CGCACGGGTG CCCGTTGAAC TCGGCGGTGT TCGCGAACGC GGCGAACGGG GGGTACTTTG AAATGATGTG CTGGCTTCGC GAGGTGGGGT GTCCGTGGGA CGAATTGACG CCGATTTGCG CCGCTCGAGG AGGGCATTTG GAGATTTTAC GCTGGTTAAA GGCGGAAGGG TGTCCGTGGG GAAGCGCGGT GTGCGCGAAG GCGGCGAAGA ACGGCCAGCT CGCGGTGCTG AAGTGGCTGA AACGACACAA CTTCCACTGG CACTCGAGCG CGATTCAGTA CGCGTGCGAG GGCGGGCACT TGGAGGCGCT GAAGTGGTTG CGCGCGGAGG GCGAGACCTG GGACGAGCAA GCGGTGTGGG ACGCGGCCAA GGGCGGACAC TTACACGTTC TCGAGTGGTT GCGAGCGGAG AACTGTCCAT GGCCGCACTC GGCGGGCGAC GTCGCCGCGC GCTTCGGCCA CCTCGACTGC CTCAAGTTTT TGCACTCGCA AGGGGGGATC CTGCACGACT GGACGTGCCA AGAAGCCGCG GTGGGTGGGC ATTTGGACAT GCTCAAGTGG TTACGAATGC AAGGCTGCCC CTGGAACTTG TGGACGCCCG TGCAAGCGGC GCGACACGGC CATCTCGAGG TGTTGAAGTG GGCGCATAAA CAGGGGTGCC CCGTCGACGC GCGCGTGTGC GCCGCCGCGG CGTACGGCGG TCACGTCGAG TTGCTCGAGT ACCTTCGCGC CGAAGAAGTG CCTTGGGATG AACAAACGTG CGCGCACGCC GCGCTCGGTG GTCACTTGAG CGTGCTGCAG TGGGCGCGCG CGCGCGGTTG TCCATGGAAC ATGTACACGT GCGAGTACAG CGCGTGGGAG GGAAATTTGC ACATTTTGAA GTGGGCGCGT CAGCACGGGT GCTTGTGGAA CTCACGCACG TGCGCGTTCG CCGCTATAGG CGGACACTTG GACGTCTTAA AATGGCTTCG TCAGCACGGA TGCCCTTGGG ACGGTTGGAC GATTCAGAAA GCCACGGAAG AGGGCAACTT TGAACTTCGC GACTGGGCGC TCAAGCACGG CTGCCTCTCG ACTTCGGCGT CGACGGAGTT TAATTTGTCC AATTTCCCCA ACGATGAATT TTTACAAATG CTCGATGGAT AACATCGTTG CATACGCTTT AACGCGCACG CCGCTTTCAT CGATGCAGTA CCGACGGTGA ACGACACAGT AGATTAGTAG GGCGCTTCGC CAAAGCGCGT TTCACACGAA ACGCTCAGCT TTGCAATCAT GATTACAGAC TAAGCATTTC
|
Protein sequence | MPKRLVSQGM LPDVFQQYIM KYLDDTDCNV LSRVSHGCLD AVIESGRDPV CAMRVSKFVH SPAMLRWAQT QGVPWDWRLC RAAAASGKLE SVQWLREQGF YAGPKPEEWG YRNGCPWTAD TCVAAARGGH LDILKWAHAH GCPLNSAVFA NAANGGYFEM MCWLREVGCP WDELTPICAA RGGHLEILRW LKAEGCPWGS AVCAKAAKNG QLAVLKWLKR HNFHWHSSAI QYACEGGHLE ALKWLRAEGE TWDEQAVWDA AKGGHLHVLE WLRAENCPWP HSAGDVAARF GHLDCLKFLH SQGGILHDWT CQEAAVGGHL DMLKWLRMQG CPWNLWTPVQ AARHGHLEVL KWAHKQGCPV DARVCAAAAY GGHVELLEYL RAEEVPWDEQ TCAHAALGGH LSVLQWARAR GCPWNMYTCE YSAWEGNLHI LKWARQHGCL WNSRTCAFAA IGGHLDVLKW LRQHGCPWDG WTIQKATEEG NFELRDWALK HGCLSTSAST EFNLSNFPND EFLQMLDG
|
| |