Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15666 |
Symbol | |
ID | 5002443 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 129651 |
End bp | 131569 |
Gene Length | 1919 bp |
Protein Length | 592 aa |
Translation table | |
GC content | 63% |
IMG OID | 640417864 |
Product | predicted protein |
Protein accession | XP_001418383 |
Protein GI | 145347870 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.479693 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0769406 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCAAA ACGACGACGC CCCGAAGGGC ATGTTTGCTC GTCTCGGTTT CGTACGTACA ATTCGCGGCG CGGCGGCGCA AGACGCGACT AATCTCTCAC GAATCCGTCA ACCGCAGATT CGTGTTGAAA CCAAAGATCG GCTCGCGACG ACGCGCACTG ACCGCGAATC GCTCGTTTTC GCGCGGCGCA GGGCGGAGGG AAAAAAGTGA CCAAGGCCAA GCTCGGGCTG GAGTCGGCGT TTTACTTTGA TGAGAAGACG CAGTCGTGGG TGGACGGGAG CGCGCCGGCG AGCGGGACGG CGAGCGGCGC CGCGGAGATC GGCGCGCCGC CGGTGATGCA GCCGCCGAAC GCGACGATGG CGGCCGCATC TCTAGACGGC GACGCGCCGA GCGCGAGCGC GTCGAGGGAA GGGCCGCCGA CGGGGGGAGG CGCGGGACCA ACTCACGCGG GGACGCGCGC GCGATACGTT GACGTCTTCG CGAGGGAAGG GATGACTTCG GCGCAGGTCG CGGCGCCGAT GGCGAACGTG AGCGCGTTCG TGCCGAGCGT GGCGCCGATT GGTGGGACAG GAGCGCCCAT ACCGCACGGG GGGATGCAAT TTTTCGTGCC GGCGCCTCGG GCGGCGGATG AAGGCGGCGA CGATGAGAGC GAGAGTTTAC GCGAGCCTCT GGCGTTGATG CAGCCGACGG TGAGCGATGC GAGCGCGGTA GAGGAAGACA GAGCACCAGT ATTGCATGAT GTCGACGTCG GCGGGACGTC GGTGAGCGAT GCGCCCGCGC CGGTGGTCGA CGCGGGAGAC CGCGAGGCTG AAGGCGCGGA TTTCGGCGGC GGCGCGAGCG ACTGGGCCGA AGCCGCGGAT GGGTCTCCGC GCTGGGCCGA AGCCGACGTA CCGCCAGTTG AGAGTGATCA ACAACACCCG CAATGGGAAA CGCAGACGCA GGAGTGGTTG CAGCACGAAG ACGCGTCGAT GGGCGCGGCC GAGGTGGCGC AGTGGGACAC GAACGCGCCT TCGACGACTG AGTGGACTGA AACAGAAATC GTCGCTTCGG AGACTTACGA AGCGCAGCAA GGTGACGAAC GCGGCGATTG GAAAGGTTAC GAAGGCCACG ACGATTACGA TTACGACCCT CGATGGAAGT ACGATGAAAA CACCGGCGAG TGGTACTGGG ACGGTGGCGA CGACGAAAAC TGGGTCGAAC GCGCGACGCA CGACGCTGCG ATAGCAGAAC TCCAAGCGGC GATCGACGCG AGGGCGTCGG AATTAGAAGA CATCAAATTG AATCGAGACG CTCTTACCGA AGAGCTGGCG CACGCGCACG CGACGACGAA CGAACTATCC ACAAAGGTAC AAGATTTGGA GGCCAAGCTC GCCGAGCGGT CGACGTCGCT CGATTCCGCC GTCGATGCCT CGGAAGAGAG TTTAAGCGCG GCGTTCGAGC GAGGGTTCGA GCGAGGTAAA GAAGAGGGCT ATACAGAAGG TTACGCCGCC GGTAGTGCGG AAGCGCAGGA AGAGCTCGCT GATCTCTTGG TGTGCCTCGG CCAGGAAGGA CGCCGCGTGG AGAAGCTTCG CGAGATGCTC GCGGAAACCG GCGCCGACAT TGACGCCATC ATCGCCGAGT TCGAGGCGGA CGAAGAGGAA CAAATCGCGA ATCTCATCGA TGGCCAAATA CAACACGATG AATTCTCCGC GCCAGAAGAC GTCTCCATCG ACGCCATCGC CGAGCAAGAT CTCTCCAACC TTCCCGAAAT TTCCCAATCC ATGAAAGCCA TGGCGGCGGA AATCGACACC CCCGAGCGTC TTCGCGACTT CGGCGCCGCC TCCAACTTAA ACCCCTCCGC GGAGGAGTTC ATTCTTCCCA CGCCTCCCAA GGCTAAAGGT CAAGAGCGCA ATCTCGCCAA CGCGTTCGAG ATGGCGTAG
|
Protein sequence | MAQNDDAPKG MFARLGFGGG KKVTKAKLGL ESAFYFDEKT QSWVDGSAPA SGTASGAAEI GAPPVMQPPN ATMAAASLDG DAPSASASRE GPPTGGGAGP THAGTRARYV DVFAREGMTS AQVAAPMANV SAFVPSVAPI GGTGAPIPHG GMQFFVPAPR AADEGGDDES ESLREPLALM QPTVSDASAV EEDRAPVLHD VDVGGTSVSD APAPVVDAGD REAEGADFGG GASDWAEAAD GSPRWAEADV PPVESDQQHP QWETQTQEWL QHEDASMGAA EVAQWDTNAP STTEWTETEI VASETYEAQQ GDERGDWKGY EGHDDYDYDP RWKYDENTGE WYWDGGDDEN WVERATHDAA IAELQAAIDA RASELEDIKL NRDALTEELA HAHATTNELS TKVQDLEAKL AERSTSLDSA VDASEESLSA AFERGFERGK EEGYTEGYAA GSAEAQEELA DLLVCLGQEG RRVEKLREML AETGADIDAI IAEFEADEEE QIANLIDGQI QHDEFSAPED VSIDAIAEQD LSNLPEISQS MKAMAAEIDT PERLRDFGAA SNLNPSAEEF ILPTPPKAKG QERNLANAFE MA
|
| |