Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16903 |
Symbol | |
ID | 5003520 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 604954 |
End bp | 607908 |
Gene Length | 2955 bp |
Protein Length | 785 aa |
Translation table | |
GC content | 65% |
IMG OID | 640418941 |
Product | predicted protein |
Protein accession | XP_001419852 |
Protein GI | 145350944 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.500288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00972481 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCCGCCC TCGCGCTCGA CGACCGCGAT CTTCGCGGCT CCGTCGCGCT CAGCCCCGAC CGCACGCGCG CGCTCGTGCG CGATGGCGTC CATCTGACGC TCCGGCGCGC CGCGCGCGTC GACGAACGCG TCGGGAAGGC GATACAGTTT GAAAGCACAA TCGCGTCTGT CGCTTGGCAC GCGACGCGCG ACGTCTGCGC GGTGGTCCTC GACGACGGCA CCGCGTGCGT CGTCACCGTC GACGACCGCG GGTTGACGCT CGAGCGCGTC GCGCGCGGCA TCGGGGCGTG CTCGCGGGCG TTCCGCGCGG GCGACGGCGC GCTCGCGTTC GCCGGCGACG GCGACGCGTC GACGACGATG CTCGACGACC GAGACGGATT CGACTTCGTT CGAATCGCGC ACGCGAGGAA GAGTAGCTCC AGATTTAGCG CGCACGCGCG GTCGCGAGAT GGACGGTGGA TGGCGTTCGT GACGCGCGAC GACGAAGCGC GCGAACGCGT GGAGGTGTTC TCGGCGACGG CGCCGTACGA AGGCGCGATC TCGTTTCGCT TGCCTCCGGG CTTCGATGCG CAGATGATCG AGTTTGGCGG CGCAGGGGGG CAGGAGATTC TAATTTATGA CGATTCGTGC GCGGTGGAGG CGCGACCGGC GCTGGAGATT TTTTCGGCCG AAGGCCTCCG GCGAGCGACG ATTCGAGGCG CGTCGGCGCC GAGCGTTCGG GCGAAGGATG GACGTCTCAT CGTCACGTGC GTGTCGAACG AGGCCCTTTT ATGGATCGAT GAATTGACTT GGCGGATCAC GCGCGCGCTC ACGCACCCGC TGACGTGCGA CGCGACGGCA TCGACGAGAA TTTTCCGCGA GTCTCAGCGC GGAGACGGAT ACGAAAAAGT CGCGCGGTTT GAACAGAAAA CTTTGGCGTG CGACGCGCGA GGTATCGAAC GAGCGGGCGT GCGACTGGCG CTGAGTTCGT GTAAGACGAT GCTCGCGACT CTGTGCGCGA GCCACAACGA CAAGGTTTTA TTTCTATGGC CGGCGTCCGG GAACGACGCC GACGACGGAG CTCCGCTCGC CGTCTTCGTT CACAGAAAAC CAATCATCGA CTTCAGGTGG TTCGAAGACG ACGACCTGGA CGGTAGATCA CGCCTCGTGT TTCTTTGCGC AGACGAGCCG GCGCTTTATT CTTTCGTCCC GGGCATGCTC GCGCCCGCGC GAGTCGCGCT CGAGCGCGCC GCCGCGTCGT TTTCGCCCCG CGAAATCGTC GGCGCTTCGC TCGAGCGCGC CGACGTCTTC ATCGTGGCGA GCGCGTCGAA ATCATTCGTT CAACAGGCCG TTCGCGCGTA GCCCATCGTT CGATCGTCGC GCGCGCGATC GATCGAAATC CTCCGCGCGC CGCGCCGGGT CAGTCAGGGC TCCGCCAACC GTTGTAACAC CCCATTCCGA CGACGCGTCA TTCATGGCAC CCCACTCCGC ACATTCCGAC GCGTCATGGC GCCCGCGCGC GACGCGACGG AGTCCACGGC GCTGCTCCGC GCGGCGAACG AGCTCGACGT CGCGCGTCAC CGCCGACGCC GCGCGACGAT CGCGTCCGTC GCCGTCGTCG CCGTCGCGGT CGCGTCGATC GTCGCGCGCG GGCGCGCGTC TCGCGAGGGC GCCGGCGCGC CGTTCGAGGT GCAGTTTCAG GTCGACGTCG GGTGCATCGA GCTCGACGTC GTCGATCGCT CGCCGATAGA GGACTTCTAC GCGTCCGACA TCGCGAGCGT GCGGTTGCTG CGACGAGGGG ACGATCCGAT TGAAGAGTTC GGCGGCGCGA GCCGACGCGC GGGCGGGGTC GAGCTGACGC AGAGCGGATG GTCGACGATA TACGTCGGCC GAGCGCGCGT GCGCGCGGGA GAGGAGATCG GGTTCGGGCT GGTGAACGCG CGAGGCGAGA CCATCTTCGA ACTGGGGCAC AACAAGCGGT TTCCGGCGAC GGGCGAACTC GCGAACGCGA CGTGCCTGGC AAACGTCCCC GCGGGCGGGG GCGCGTACAG AAATCGCGTG ATTCCGAAAA GGTCGAGCAT GCGGCTCGTC GACGGCGTGC GTCGATTCGA GACGACGTGG GCTGGATGTC TCGAGCGGTG CCCGCTGACG GTGAAGCTCA TCGCGTGCAC CGGTCCGGCG ACCGGCGACG CGGACAACGT CTGGGGAATC GAAGAATCGG TGGCGAAAAC TGATGTCTGG CGTAACTACA AAGGTCACTT TACCTCGGTG AGTTCGGGAG AGTTCAACAC GTGGGGTTTA AATACCGTGA CTGGAACTCT CGCCTGGACG CGTAACGCCG ACCTCAATCA GTTCTCTCGC GATAACTGGA ATACGGCAAC CAACAATCCC GTAGCCAACG GCGTCGCCCA AGGCACCATG GTTGACTTTG ACGCGGGCTA CGATGCGCTC ATCGGAGTCA CGGACACGAG TATACACACC TCCGGCGGAT ACATGTGGAG TCGTCCCGTC GACGGCTCGG GCGATTGGGG TTTCGCCGAT GACGGCGGCG GAAAAGGTGT CCAAGTGACC ATCGGTCGCA CGCACCACTT TCACATGAAC AAGCAACACA ATATGTGGAG CGCGGAACTG CCTAATGGCG GATGGGTTCG TCAACACGTA AAAACCGTCA AGCAGGTCGA AGTCGGCGAT TCGGACGCGT TCGTCGTATA CCAAGATGGA AGGACGCTGA AACGAAAAGC CTCGGACATG ACCGGCGACT GGTCCACAAT CTCCATTCCG AGTGCGCTCT CGACCGCAAC GATCAGTCAA ATCACCGTCG GCGCCACCGC GCTCTGGCTT CTCGACGGCA ACGGCAATCT CTGGGGCTGT GACCTTCCTT GCAGAGACAG CGGTGGCTTC GTTCGCGCCG CCAACGCGCC CGCGAACATC ATCTCCATCG ACGCCGGCAA GGTGATCCAC AACGTCCCGA ACTAG
|
Protein sequence | MPALALDDRD LRGSVALSPD RTRALVRDGV HLTLRRAARV DERVGKAIQF ESTIASVAWH ATRDVCAVVL DDGTACVVTV DDRGLTLERV ARGIGACSRA FRAGDGALAF AGDGDASTTM LDDRDGFDFV RIAHARKSSS RFSAHARSRD GRWMAFVTRD DEARERVEVF SATAPYEGAI SFRLPPGFDA QMIEFGGAGG QEILIYDDSC AVEARPALEI FSAEGLRRAT IRGASAPSVR AKDGRLIVTC VSNEALLWID ELTWRITRAL THPLTCDATA STRIFRESQR GDGYEKVARF EQKTLACDAR GIERAGVRLA LSSCKTMLAT LCASHNDKVL FLWPASGNDA DDGAPLAVFV HRKPIIDFRW FEDDDLDEDF YASDIASVRL LRRGDDPIEE FGGASRRAGG VELTQSGWST IYVGRARVRA GEEIGFGLVN ARGETIFELG HNKRFPATGE LANATCLANV PAGGGAYRNR VIPKRSSMRL VDGVRRFETT WAGCLERCPL TVKLIACTGP ATGDADNVWG IEESVAKTDV WRNYKGHFTS VSSGEFNTWG LNTVTGTLAW TRNADLNQFS RDNWNTATNN PVANGVAQGT MVDFDAGYDA LIGVTDTSIH TSGGYMWSRP VDGSGDWGFA DDGGGKGVQV TIGRTHHFHM NKQHNMWSAE LPNGGWVRQH VKTVKQVEVG DSDAFVVYQD GRTLKRKASD MTGDWSTISI PSALSTATIS QITVGATALW LLDGNGNLWG CDLPCRDSGG FVRAANAPAN IISIDAGKVI HNVPN
|
| |