Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18694 |
Symbol | |
ID | 5006283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | + |
Start bp | 186403 |
End bp | 188796 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | |
GC content | 68% |
IMG OID | 640421704 |
Product | predicted protein |
Protein accession | XP_001422120 |
Protein GI | 145355763 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0344474 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000276573 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGCCGT TCGCGCGCGA CGCGGTCGTC GCGCTGGACG CGGAGGCGGC GGCGCAGGCG CGCGCGCACG CGGGCTGGGG CGGGACGAGC GGGACGGGAC GCGCGCGCGC GACGCGGGCG CTCGAGGCGT ACGCGACGAT GGACGACGCG ATGGGCGACG AGGAAGAGGT CTGGGACGTC AGGGGGGAGC GGGTGCCGGC GGAGGCGCCG CTGGTGATCT GCGTCGCGGG ACGGCTGGGG GAGGCGGCGG CGAACGTGCG GCGGTGCGTG CGGGCGCGGC GACGCGCGAC GCGCGCGACG GTGTTCGTCG GATGCGACGA GGGCGATGAG GCGCACGGGG CGTGCGTGGA GGCGCTGACG GAGGCGTGTG AAAGGATTTT GAGCGACGCG GCGGCGGAAT TCGGGGAGAC GCCGACGGGG AATCGTTCGG GGGTGGGACG GAACGAAGAA GGAGACGAAG ACGACGAAGA GGACTGGGGG AGTTGGGGCG ACGAGGACGA GGCGAGAGAC GAGGCGCCGG TGGAGGACTC GAGGGAGGGG ACGGGCGATG GGTGGAACGA TGCGCCGACG CCGACGTCGA AACAAGGGCG AAATCGCACC GCCGCCGCGG ACGCCGTCGC GGGTCGGTTT TCGGTCAAGT TTTTCCCTCC TATGATGTAT CGCGCGCTCG GCGACGGTGC GTTCACGCTT CCGCGAACGC GAACGATGGG GTTGGTGAAC GATGCCGCGG CTGCGTTGAC GGGGGAGTCG AAAGAGAATC GCGTCATCGG GCATCACTTG GCTGAAATCG CCGCGCACTG GGCGTTGGCG CCGGATTATT TCGCCCTCGG TCCGAACGCC GAAGCCGTGT CGCGCGTCGC AGCGCAAGCG AAGACCGATC CCGTGGGCGT GGACACGACG GTGAAACCAC GAACGGCGGC GATCATCGTC GTCGATCGCG AGGTGGACTT GATGACGCCG AGCGTGAGTC GAGACGGTTG GCTCGAGCGC GTGTTGGAGA CGACGGACGA CGAAGACGCG GCGTCTTCAT CGACCTACGT CGACCGCGTC ACGGCGACGT TGTCCCCGCT CTTGAGCGAT GAAAATGTTC TCACGTTAGA TGAAGCTCTG TGCGCGAAGA CTGCGCGTGA TGGCGCTGTG CACGTGCGGA AGTTGCTGCG CGAGGCGGCT CGCGTGGAAT CCGTCGCCGC GCCGGCGGCG GATGGAAAGA GCGCGCGCGT CGTCGGTGCT GACGATATTC TTAGCTTAGT GCGAGCGCTG GAGGTTGATC CTAGCGTTGC TTTGCGTCAT CGCGCGTTGA TTCAGCGCGC AAAGCTCACC GCGCGGAGTT TGACGGATGA GAACGACATG AAGGCGAATA GGCAAATCAT CGCTTTGCAA CGACTCACCG CCGCTGCGCT CGAGCGCCAA GCCACGGGCG TGTGCGCGAC GGTGGTAGAG ATACTGAAAG TGATGTATTC TGCGGGAGGT ACCGCCGTAG GTCACCCGAG CGAGGCGCTG GCGCTCGTCT TAGCGGCGTA TGTACTCGCG ACGGAGGCGA ACGTTCAAGC GCAGGCGCCG ACGAACGCCG CGTCGCCGTT CACGGCGCAA GACGAGGCGT CCGTGCGCGA CGCACTTTTG GGCGCACTAC TGGCGAGCGA TCTCGCCGAT GTGAAGAAGC AAGTTCCAAG TTTCAACGCC AGTGCGCTCG AAGCGCTCGA AGCGCTTCAG AACGCGACCG CGGCCGCGGC CGCCGACGCG ACGCCGACGC CCTCGAAAGA TGACGACGGT TGGGACGACG ACGACGACGA CGACTGGGGC GACGATGAAT GGGGCAACTC TCCGACGGCG CCGAGTAAAT CCACGACGAC GAACGCGATG ACGGCGATCG ACGATCCCGA ACTCGCCGCC GCCGCGCTCG AGGCGCGAGA CGCCGTCGAG CGCGCGCTTC ACAACTTCGC CCTCGCCGCG CATCGCGGCC GCGCATCGCT CAAGCACAAC ATCCCAGAGT CTAACTCACT CTACGCCAAC GGTCTACCCA ACTCCATCCT GCTCGACATC ATCTCCCGCG TCAAGACGTC CGCCGACGAC GGCGGCGCGT GCGCCGATTT CGTCCACGTC GCCGCCTCCC TCGGCGGCTT ACTCAAGCAC GCCGCCGCCA CCGCCACCGC ATCCATCGTC ACCGGCGCCA TGGGTCGTCT CGGCAACCTC ATCAACAAGG TCACCGCCGC CCCCAAACCA TCCGATCGCG ACGTCGTCGT CGTCTTCCTC CTCGGCGCCC TCTCCTCCGG CGAATTCGCC GCCGCCCTCA CCGCGCGCGC CCCCGATCCC GCCGCCGCGC TCTTCGCCCG TCACAGGGAT CGCGAGTTCA TCTTCGGCGC GCTCGACCTC GCGTCCGCGC GCGCCATCGC GTGA
|
Protein sequence | MAPFARDAVV ALDAEAAAQA RAHAGWGGTS GTGRARATRA LEAYATMDDA MGDEEEVWDV RGERVPAEAP LVICVAGRLG EAAANVRRCV RARRRATRAT VFVGCDEGDE AHGACVEALT EACERILSDA AAEFGETPTG NRSGVGRNEE GDEDDEEDWG SWGDEDEARD EAPVEDSREG TGDGWNDAPT PTSKQGRNRT AAADAVAGRF SVKFFPPMMY RALGDGAFTL PRTRTMGLVN DAAAALTGES KENRVIGHHL AEIAAHWALA PDYFALGPNA EAVSRVAAQA KTDPVGVDTT VKPRTAAIIV VDREVDLMTP SVSRDGWLER VLETTDDEDA ASSSTYVDRV TATLSPLLSD ENVLTLDEAL CAKTARDGAV HVRKLLREAA RVESVAAPAA DGKSARVVGA DDILSLVRAL EVDPSVALRH RALIQRAKLT ARSLTDENDM KANRQIIALQ RLTAAALERQ ATGVCATVVE ILKVMYSAGG TAVGHPSEAL ALVLAAYVLA TEANVQAQAP TNAASPFTAQ DEASVRDALL GALLASDLAD VKKQVPSFNA SALEALEALQ NATAAAAADA TPTPSKDDDG WDDDDDDDWG DDEWGNSPTA PSKSTTTNAM TAIDDPELAA AALEARDAVE RALHNFALAA HRGRASLKHN IPESNSLYAN GLPNSILLDI ISRVKTSADD GGACADFVHV AASLGGLLKH AAATATASIV TGAMGRLGNL INKVTAAPKP SDRDVVVVFL LGALSSGEFA AALTARAPDP AAALFARHRD REFIFGALDL ASARAIA
|
| |