Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26149 |
Symbol | |
ID | 5004150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 308322 |
End bp | 311564 |
Gene Length | 3243 bp |
Protein Length | 1080 aa |
Translation table | |
GC content | 53% |
IMG OID | 640419571 |
Product | predicted protein |
Protein accession | XP_001420141 |
Protein GI | 145351561 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0731433 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGA AATTGAACGA GGACGCCGTG CGCGCGGACG CGGACGCGAA AATGAAATCA GACAGGATTG AAACGCTCGC GAGAGAACGC CAACGCGAGG AAGCGAACGC GAGCGACGAC GAAGACGAGG ACGAAGACGA AGGGTTTCGA ATGAAGCGAA AAGAAGAGGA TGCGGCGGTA CCGGCGTACA CGTGGACGGC GACGCTGCGA CGACGGGTGC GAGAACTCGA GGAGGGCGCT GACGTCGAGG CGCGAGAGGC GGGGAGGCTT CGGTTGGACG AGGCGCTCGG CGCCGCCGAC GACGAAGACG CCGGTGATGT CAGCACGTTT GTGAATGGCG CTCCGACAGT GATTACTAAA ATGAAGAGTC GAATCAATTC GATGAATGCG CCGACGGGCC CGACCGGTGT GCTGGTGAAG ACCGCGCACG CAGAGGATAA GCCGATGCAG CAAGTCGTGG AAGACTCGTA CTACAAAGCT GTCATGAGCG AAAAGCTCCC GGCGTACAAA GCACCGCGCC GACACATCGC GCGATTGGAG AATCTTGAGT CTGCGGTTTT AGACAAAGAG GGCAACCAGC TTCCTTTGTG GAAATCGACG AGCGACGATC TCGACGTTCT CGGCCCGGGT ATCAGCTTGT GGTTCGCTCA GTTGAAGACG TTGGGTAAAA CGTTTGTCTA CATCACAATG CTCGCGTCTG TCGCGCTCTC GCACTATGTG TATATCGCCA ATAAAAGTAG TAGCGTCGAG GTAGAGCCGG AAATAACGAC GACACTCGGT GTCACGGCGG CGGGCGTTTA TGCTTCGGCG TATGGTATAG ATATTCGTAC CATCATGGCT GTTCTCTCGT GCTTGGACGT TTGCATGGTT TTGATGTTCA TGCTCATAAC GGCGATTTTA TCCAGACGTA TTCGAAGCTT TGTCGTCCGC GTTGATGAAG CTTTACTCAC TTTGGCGGAC TATTCGCTCC AAGTGACCGG CTTGCCAGTA GACGCGACGG AGGACGAAGT GCGCGAGCAC TTTGAAGAAT TCGGACCCGT GGCAGATGTC GTCATAGCTC GTTCGTATGG TGCAGTTTTA CGCGCGCGAA TACATCGTGC GCGCTTATTT AAACGTGCAG AACAGCTGAA ATCTGAACTG AGCGCGATCC GTCACACGCT GAAGAAGCGC GGCGAGGATT ACAACGCAAA CTACGCCTTC AAACGCAAAT GGAAACAATT TTACAAAGCA CGCGATAAGA TGAACGCCTT GAAGGAAAAA ATCGAGCAGC GTATGCTCGA GCCTTTTAAC ATCGTTTACG CTTTCATCAC GTTTGAACAA GAGTCGGACA AAATGACATG TCTGGACGAG TACGCTCCGT ACTTTAATTT TTTCAGAAGT AAGCGAACGA GATTTCGCGT CCACCCTCGC GAAGACGGCA AGGGCGACGC ATTTCACTAC TTGCGGGTTA CCCAAGCCGT GGAAGCCTCT GATATCATGT GGGAAAATAT GGCTAACGTG GGTTGGAAAC AATACTTCGT ACGACGGGCG ATTACAACGA TTGCTATCCT TGGTCTTCTC ATCGGCAACA TTATCCTTGT CGTTTTTGCC ACCGACTGGG TAAAATCCGG CGGGCGATTA CTCGTCAACT GTGGCGACTT GTTCGCCTCG GGGGCTTCTG ATAACATGTA CTGCCCTGCG ATCTGGAACA TCAACGAAAA CAGTCTCGAA ACCGATTTGG GATTGATTTC GAACGTCAAC TTCCGAAAAC AAGTCGAGAG CTCAGACTGC AACCCATTCA TCGAGTCGGC AATGTGGTCG TATGACATGA CGCAATACTC GCCATATTAC GAAGCCGTGG GCGCGTACTC TTCATTAAGC GTCTCTAGCG CTAACGCTTA TGGATATTCT GGTGGAGTAT GGAATGGAGG CATGGACGCC TCTACGAAGG CCGATGAGTG TGCGGCGAAA ATTTGTTATG ACTGCATGTG TGAGAGCGCG ATACGGTCGG GACGCGTGAC AACCATCTGC AAAGACTACT TTTACGATCA ACTACTAATC TTTGGTTTCG AAGTCGGTCG GCTGTCCGTC GTCGCATTGA CGTCTATGCT CTTGTTGTGG ACGTCGGGAA AGTTTGCGCT CTTCGAGCGC CACAAGACTG TATCTGCCAC GGAAAGAACG ACGAGTCGTT TTGCGTTCTT CACCCTCATC ACGAACGCCC TGATTCTTCC GTTACTCGTT AACATCGAAA TCAAGGGTTT CTCAGGCTTC CCAATCTTGT TCAGAGGAAG CTACGAAGAC GTGACGTTGG ACTGGTACGC GCTCGTGATG AGATCGCTCA TGATTACTAC ATTCATCAAC GCCTCGTGGT TTGGACCGTC GCGTCTTGTG CAAATGTGGC TCACGCAACT CTGGCGTTAT TGGACGGCGC GATGGTGCAC CACGCAGTAC AACTTGAATC GTCTTTATCA ACGACCAAAA TTTACGCTCG CTGAACGTTA CGGTCAGTGT ATGACGGTGA TATTTACCGC AATCGTTCTC TTCGCGGCCG CTCCAGTGCT CGTTCCGGTG GCGGCGTTCT ATTGCTTCTT GGCGTACTGG AGCGACAAGA CGATGATTTT ACGTCACTCG CGCTACCCTT CGCTGTATGA TCACAAGTTG GCGAGACAGT TTATGTCGTA CGCCCCGCTG GCGTGCTTGG CGCACTTCGC CTTCTCATCT TGGGCGTTCT CTCAGTGGGA CATACCGTCC TACTTTCTCA CTGGGTTGGA CGGGTGGGCC GAGGAAATTT ACGACCAACG CGACTCCGAT TTCTGGACCG TTAGAGCGAA TATGACCAAG TACGAACAGC TCGACTTCAA AGAGCGATTC TATCGCGTCA ATGGATTGAT CCAATTGATC CCACTCATGG CGTACTTTAT CTACTTGCTC ATCAAAGCCT TCTTCTCGAG CGTCGGTAAT ACGGCGCTCT ACTTGCTCGG CTGTAAAGGC TTGGGAGCGA GCGAATGGAA GGCTGACGTG CAATTCAGCA ACTTCTCTGA AGCTCGAGAC AACATGTTAA AGGATGGTGA TCATGAAACT TCACTCTCTG GTTTGCCGAG TTACCGCGTG CAGGACAATC CGGAATACAC GGCGTTATTC CCAGAGGCGA AACAGGTGCA CGGCGCGTTC ATCAGTCACG ACGACGCCGA GCGCGCGACC GACGCCCCCG CGCCGGCTTC GACGCCCAAA TGGGCCAATC GAGTCGCATC ACCCCGCAAG TGA
|
Protein sequence | MATKLNEDAV RADADAKMKS DRIETLARER QREEANASDD EDEDEDEGFR MKRKEEDAAV PAYTWTATLR RRVRELEEGA DVEAREAGRL RLDEALGAAD DEDAGDVSTF VNGAPTVITK MKSRINSMNA PTGPTGVLVK TAHAEDKPMQ QVVEDSYYKA VMSEKLPAYK APRRHIARLE NLESAVLDKE GNQLPLWKST SDDLDVLGPG ISLWFAQLKT LGKTFVYITM LASVALSHYV YIANKSSSVE VEPEITTTLG VTAAGVYASA YGIDIRTIMA VLSCLDVCMV LMFMLITAIL SRRIRSFVVR VDEALLTLAD YSLQVTGLPV DATEDEVREH FEEFGPVADV VIARSYGAVL RARIHRARLF KRAEQLKSEL SAIRHTLKKR GEDYNANYAF KRKWKQFYKA RDKMNALKEK IEQRMLEPFN IVYAFITFEQ ESDKMTCLDE YAPYFNFFRS KRTRFRVHPR EDGKGDAFHY LRVTQAVEAS DIMWENMANV GWKQYFVRRA ITTIAILGLL IGNIILVVFA TDWVKSGGRL LVNCGDLFAS GASDNMYCPA IWNINENSLE TDLGLISNVN FRKQVESSDC NPFIESAMWS YDMTQYSPYY EAVGAYSSLS VSSANAYGYS GGVWNGGMDA STKADECAAK ICYDCMCESA IRSGRVTTIC KDYFYDQLLI FGFEVGRLSV VALTSMLLLW TSGKFALFER HKTVSATERT TSRFAFFTLI TNALILPLLV NIEIKGFSGF PILFRGSYED VTLDWYALVM RSLMITTFIN ASWFGPSRLV QMWLTQLWRY WTARWCTTQY NLNRLYQRPK FTLAERYGQC MTVIFTAIVL FAAAPVLVPV AAFYCFLAYW SDKTMILRHS RYPSLYDHKL ARQFMSYAPL ACLAHFAFSS WAFSQWDIPS YFLTGLDGWA EEIYDQRDSD FWTVRANMTK YEQLDFKERF YRVNGLIQLI PLMAYFIYLL IKAFFSSVGN TALYLLGCKG LGASEWKADV QFSNFSEARD NMLKDGDHET SLSGLPSYRV QDNPEYTALF PEAKQVHGAF ISHDDAERAT DAPAPASTPK WANRVASPRK
|
| |