Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31755 |
Symbol | |
ID | 5002100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 482284 |
End bp | 484251 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | |
GC content | 57% |
IMG OID | 640417521 |
Product | predicted protein |
Protein accession | XP_001418023 |
Protein GI | 145347115 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0216247 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGG CGTGCGAGCG GTGCGAGCTC GCGATCGTCG GCAGTGGACG CGCGTGTTTG AGCGTCTTGT CGAGGCTGAG CAGAGGTCGC GCCGAACGAG CGGTGGTGAT CGACCCGTCG GGGGCGTGGT TGTACTCGTT CGCGAGGACG CAACTCAGGC TGGGGGCGAC GCACTTGAGA TCGACGACGA CGCAAGTGCC GTTTGAGAAC GCATGTGGAC TCGAGCGGTA CATAGAGACG TTGGGGAAGA AACGCGATGT GGTGCGGACG GGTAGTGGGT TCGCGGGAGT GCCGAGTGTG AGGGTGTTCG CGGAGTACTG CGCGAAGACG GTGGCGGAGC GATTCGGTGG CGTGCGCGTG GAGCGAGGCA CCGTCGTGGA CGTGCGGTGG TGCGACGAGA CGTCGGAAGA GGTTCGTGAC GCGTTCAAAG CAATTGACGA GAGCGAAGGC GCTGCTGGCG TGGGCCAGGA CGAGCGTGAT GCGGTGGCTA TGATGCGGTG CGGCGCGATT TTATTGACGC TCGACACCGG AAAGACGTTT CTCGCCGCGC GATGCGTGTG GACGCCGAAG TTTTCCCTTC CGCTAGTTCC GTCCTGGGTC CTGGAAGCGA AAGCGTCGTA CGCAAAGTAC AACGCGTCTT ACGACCGCCA AAGCATCGAT TGTGGAATCA TGAACGCAGC CGACGTGGAT ATGAGCGCCA CTGATTGCGC GCGAGGAAAG TGCATCTTGG TCGTCGGCGG AGGAACGACG GCGGCGACGC TCGCGCTCGC TGCGCAGACG CGCGGCGCCA AGGTAGTGAC GCTGATGTGC CGAAGGAAAA TCACCGTGAG TGAGTTTGAA TGCGACGTGA AGTATTTTGG GAATAAAGGA CTGTACGAAT TTCACGCGTG CGCCGACGCG CAAATTCGCG CCAACAAGCT CGAGTCCTTC AAGTCGAAGG CGAGCGTGAA CGAGCACACG CACCGTCGTT TGCGAGATGC AGCCTTAAAG ACTGATAATA TTCGAGTTCT GGAGCAACGC GTGTTGAATG GCGCCGTGTG GAGCGACAGA GAAAAGAAAT GGCGCGTTCG ATCGGCGCCT ACGGATGAAG CCAAAGTGGA GTTCGAGTCG GCGATGTACA GAAGGTATCG CGACGAAGGA ATCGAGCCCG ATAGTTCAGC GCTCGCCATT TTCGAAAAAG AAGTCACATC TACACACGAT GAGATATGGC TAGCGTGTGG TGAATTCGTG GACCTGGCGA AAGATCCAGC GCTCCGCACG CTCGTGGAGA CAACGTCCGT AGAAATCGCT CGAGGCTTCC CTGCGCTCGC GGAGGAAAAA ATTGAATGTG CGCACGACAA AGGACAAACC GCGGCTGCAG GCGGAGGTGG AGGCTGTCGA TGGCCAGGAA CGTCGATGTA CGTGTTGGGT GCGTACGCCT CGCTAACCAT AGGTCCCGGG GCAGACCTCC CCGTGGGGCA TCGAATGGCG GCGAAGCAAG TCGTCGACGC AATGAAGAAA CACGAGACGG CGATATTGCG CAATAAGAAC CCGTATCAAG TCGCCGAAAC GAGCAGTGAA GCACAGCGAA CGCCGGATAG AGGTGAAACG TTCGATCGCT TCAAAAAGCT CCCACCTGAG CTCGCGGACA AAGGTTTGAT TGATATTGAA AGCTTGATCG CTGGCGCCGC GATGGAGCGC GTCGAGTTGG ACAATTACGA AATGTACGAA GAAGATATGA GGGCGGAGAT TCGTTTGAAA ATTCCCGAGG CCATTCTTGC GCGCGATGTT TACGTGTGCT TTCAAGATCG AGCGTTAGAG ATGTGGGCGC TTGGTAAACA AAATGCTTAT CGCTTCTTCA TCCGCAAGCT CTACAAGAAC GTCATCGTTG ATCGCTGTTC GTACAGGGTG TACGCCAATA AAAATCGCGT CGTGCTCAAC ATTCACAAGT ACACGAACCA TTATTGGCGG TATCTTAGAG ACAGGTAG
|
Protein sequence | MATACERCEL AIVGSGRACL SVLSRLSRGR AERAVVIDPS GAWLYSFART QLRLGATHLR STTTQVPFEN ACGLERYIET LGKKRDVVRT GSGFAGVPSV RVFAEYCAKT VAERFGGVRV ERGTVVDVRW CDETSEEVRD AFKAIDESEG AAGVGQDERD AVAMMRCGAI LLTLDTGKTF LAARCVWTPK FSLPLVPSWV LEAKASYAKY NASYDRQSID CGIMNAADVD MSATDCARGK CILVVGGGTT AATLALAAQT RGAKVVTLMC RRKITVSEFE CDVKYFGNKG LYEFHACADA QIRANKLESF KSKASVNEHT HRRLRDAALK TDNIRVLEQR VLNGAVWSDR EKKWRVRSAP TDEAKVEFES AMYRRYRDEG IEPDSSALAI FEKEVTSTHD EIWLACGEFV DLAKDPALRT LVETTSVEIA RGFPALAEEK IECAHDKGQT AAAGGGGGCR WPGTSMYVLG AYASLTIGPG ADLPVGHRMA AKQVVDAMKK HETAILRNKN PYQVAETSSE AQRTPDRGET FDRFKKLPPE LADKGLIDIE SLIAGAAMER VELDNYEMYE EDMRAEIRLK IPEAILARDV YVCFQDRALE MWALGKQNAY RFFIRKLYKN VIVDRCSYRV YANKNRVVLN IHKYTNHYWR YLRDR
|
| |