Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119396 |
Symbol | Unk4 |
ID | 5000210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 200959 |
End bp | 202743 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | |
GC content | 54% |
IMG OID | 640415631 |
Product | hypothetical protein |
Protein accession | XP_001416101 |
Protein GI | 145342033 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTTCA AAGAGCTCCT CAAAGACTCC GTCACGGAGG TCTTCACCTA CCGCACCACG AAGCTCGTGA AGACGAACGA TAAGTTCTTA GTGACGCTGC ACTTCATATT CATGAGCCTG ATCGCGACGT TCGCGCTGGT GTCGATCTTA TTGAGTCATA ACTACATGCT TTTCGAGCTT CCGGCGCTGT ACGTGACGAC GACGTATCAG AAATTCGGAC CGGATCCGGT GACGAGCAAC GTGGTGCTGG CGCGTGAAAT CGCGCAGAAT GCGACTGTGG ATTATTGTAA CACAGGGTAC GTGAATTGGC GTCGAGGGGG GAAGGTGTTT GACGACGCTT ACATGACGGC GCTCGAATGT ACGGCGCAGC ACGCGCCGGC TGAGTACGTG TGGCCGTTCG ATGCGGGGAA TGGGATGACC ATCGGCACGT TTGCAGAAGT ACACTCCGTG ACGAGATCGT GTACATCACC GGGCGCGTCG ACGTACGGGG CGGCAGGGTG TACGGAGACT CAGACGTCGC CGCAAGCGTA CATGGTGGTC GGCGTAGACT TGCTGAGCAT GCAGTTAGAT ATTCGGTACC AAACAGCCAC TGGAACTCTT AACGAGGCGG AAGTCGTTGA CGTAAATGGA GTAGAGTATA TCACCTCGAT AACCAAGCCG ACGGTCACGT TTCCGCAGTT GTTAGCCATG GCGGGGATCA ACTCGCTCGA CGAAACAAAT CCCTCAATCG TCGGCGACCG AGGCTCCGTG ACAGGGCTGC CCTATCGCAT GTCGGGCCTG CGCCTGAACG TGAAGGTGTC GTTTACAAAC ACTTATTTTA GTAAACCGTT GAAGACGACG GTCACGGCGA CTTTGTCTGC GGAACAAGTG AAGACGAATA ACTTCAACGC GAAGACGACG GTGACGTATT TGCCTCACCC AACAGATGCC GGGATACATT CCATATCGCG GTACTTCCAC ACGTGGGCCA CGTCAACCGT CACGTTTGTG TCGTCCGGGC ATATCGGCAA GTTCGACTTG TTCGCCTTGG CTCTCGCCAT CACGAACGCT TTCGTGCTAA TTGGCATGGC GACGACCATC GTAGATTTTG TTGGCGTCAT GTCTTCTGAA ACGTTTTTAG ACGACAAGTA CGAAGACGAC GGCGAACGTT TCGGTTTAGA AATGATGCTA GCGAACATCG AGAACGATGA TCACCCGGGA GTGCCGTTCG ATCCCAACGA CTTGCGCTTG AAGGACGCTG CCGGCGACCC TGGCCTGAGC TACGAAAAGA CACTCGAGCA GTTACTCGAC GAGGTGCGGG AAATCCAAGA ACAGCTCAGT CTGCTGCCGG AAGATGAAAA TGAGCTTCGC GCTATCACTA CTGGCCATGC TGAGGAAGAA GAGTATAGAA AGCTGCGCTT GATTTACGTC CCTGACCCGC TCTCAACTGA GGCAAACGAC AAGTCCTACG TTCCTCCAGA GATTTTATTG CACGACGGTC AGCAAACGAT CGGTCGTGGC ATGGGGGGTA TTGAAAACAA GGGCGTGAGT AGACAGCAGT TCTCAATCGC GGTCATCAAG GAAAGGATCC GCATGAAGTC TCTGCACGAA GGACCTGGGG TGTGGCGCCA GAGCTCAGGT CGTTGGGAGA TGCTCCCAGT CGGCAAGGCC GCTGTCTTGA GTGTTGGCGA TCGTTTGTGC TTTAGAATGC GAGAGGGTAA ATTGGGGGGG CACGAAGGCG TGTTCACGCT CGATTTCCAA GACACGCGCA TGGAGTGCAC GGTGTTTGGA ATCCCGTTGC GTTGA
|
Protein sequence | MGFKELLKDS VTEVFTYRTT KLVKTNDKFL VTLHFIFMSL IATFALVSIL LSHNYMLFEL PALYVTTTYQ KFGPDPVTSN VVLAREIAQN ATVDYCNTGY VNWRRGGKVF DDAYMTALEC TAQHAPAEYV WPFDAGNGMT IGTFAEVHSV TRSCTSPGAS TYGAAGCTET QTSPQAYMVV GVDLLSMQLD IRYQTATGTL NEAEVVDVNG VEYITSITKP TVTFPQLLAM AGINSLDETN PSIVGDRGSV TGLPYRMSGL RLNVKVSFTN TYFSKPLKTT VTATLSAEQV KTNNFNAKTT VTYLPHPTDA GIHSISRYFH TWATSTVTFV SSGHIGKFDL FALALAITNA FVLIGMATTI VDFVGVMSSE TFLDDKYEDD GERFGLEMML ANIENDDHPG VPFDPNDLRL KDAAGDPGLS YEKTLEQLLD EVREIQEQLS LLPEDENELR AITTGHAEEE EYRKLRLIYV PDPLSTEAND KSYVPPEILL HDGQQTIGRG MGGIENKGVS RQQFSIAVIK ERIRMKSLHE GPGVWRQSSG RWEMLPVGKA AVLSVGDRLC FRMREGKLGG HEGVFTLDFQ DTRMECTVFG IPLR
|
| |