Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31572 |
Symbol | |
ID | 5001982 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 153023 |
End bp | 154710 |
Gene Length | 1688 bp |
Protein Length | 536 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417403 |
Product | predicted protein |
Protein accession | XP_001417933 |
Protein GI | 145346927 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.314666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.270485 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGGCGG TGGATGGGGC GGTGATGCTG CGCGCGGCGG ATACCGGAGA CGCGCCTGAG GCGATTTCGG CGCTGCAGCG GATGAAGAAG GCGGGCGAGC GAAACGGCGT GGCGGTGCCG CTGGCGTTTT ACAACATGAC GCTGCGAGCG TGCAAGCGCT CGCGTCCGCC GGCGAGCGCG GACGCGACGA GATTGCTGCG CGAGATGCGA GAGCACGGCC CGGGTCCGGA TGCCAAGACG TATCACGAGG TGATCGCGGC GTACGCGCGT GCGATTGAGT GGAAGCTCGC CGAACAAACG TTTGAGGAGA TGAAGCGCGA CTTCAGAGGT CGCGGGCCCA CGTGGCACCC GAGCGTGCGC GTGTACACGT CCCTCATCAG CGCGTACGGC AAGGGCGGAC AATTCGAAAA GGCGAGCGAG TTATTCGAAA GCTTGTTTGC GAGCTCGCAC GTGCAATTGG ACACGGGTGT GTACAACGCA TTGCTCTCTG CCGCGGTGAA CTCGGGCCGT TACAAGGACG CCGCCGCCGT GTTTGAGCGC ATGCAAACGG AAGGTGTCAG GCGAAACGTG ACGACATACA ACGGTATGTT GCAATCGCTC GGAAGGCAGC GACGCATTCG TGACATGGAA AATATGAGCC AATCCATGCA GCGCGCGGGG GTCATGCCGA ACGAAACCAC GTACAGCGTG TTAATCACCG CGCACGGCAA TAGCGGCAAC ATCGATCGGG CTCTCGAGCT CCTGCATCAA GTCATCATTG CCCCGCGTTT GCACGCGACG GCCGTGATAT TCAACAGCGC GCTCGGGGCG TGTGTCAAGG CGGGTAATCT TGAAGGCACG CAACGGGTTT TACGAGTGAT GGAGACGGAG GGCGTGCGAT CGACTCTCGT CACGTACAAC ACGCTCCTGA TGGAGGCGAG CGCAGAGCGT GACTGGGTGC GCGCGACGAA AATATATAAA GAACTTCTTC TTTCGGGATT CGCGCCGGAC ACCATCACAC TCGATTGCTT GTGCGGTATT GAAAAGCTTC AGGCGTGTCG CGAGGAAAGG CTTCGCGAGG AAATCAAGCG AGCCGAATTG GAGGGCATCG ATTTGCCCGA ATTCGAACGC ACGTGCGACG TGAGCGATAG TCCAGTCGGG AACCTCCCGG TTCTTATACG CGCGCTCGCC GACGATCGCG AGCTCGAAGA AGTTCCAGGA TGGCGAGGGT TCGTTTCCGA CGCCCTGCTC CGAGTGCTTC ACGTTAACAA CGAGTACGCC GAGGTGGAGG ACACATTCAA ATACATGCTG ACGAGCGACG TCACGCGCAC CGTGCACACG TACAACTCGC TATTGATTTC GTATGAGGCT CGTAAAGAGT GGCAAAAGGC GGGCGAGGCG ATGACGCAAA TGACGAGTGA AGGGATCGTG CCAAACGCGC TCACGTTTGA CGCGCTCATC GATGTTTGCG AGGAGATGGG TCAATGGGAT CGCGCAACGA CGTGGCTCGA ACAAGCTCAA GCGGCTGGGC ACTTCCAATG CGAGGACGAT CTCGGTGTTT TAGACCTGCA CCGTATTCGT TCCGCCGGCA CCGCGCAGTG CGTCTTACGT TGGTGGTTGC GCCGAATGCG TCAACGCGCT TTGGCACCGC TCGACGTTCG AGCCGCGGGC AAAGGAACGC GTGCGCTCGT ATCAGGGTTG AAGAATAA
|
Protein sequence | MWAVDGAVML RAADTGDAPE AISALQRMKK AGERNGVAVP LAFYNMTLRA CKRSRPPASA DATRLLREMR EHGPGPDAKT YHEVIAAYAR AIEWKLAEQT FEEMKRDFRG RGPTWHPSVR VYTSLISAYG KGGQFEKASE LFESLFASSH VQLDTGVYNA LLSAAVNSGR YKDAAAVFER MQTEGVRRNV TTYNGMLQSL GRQRRIRDME NMSQSMQRAG VMPNETTYSV LITAHGNSGN IDRALELLHQ VIIAPRLHAT AVIFNSALGA CVKAGNLEGT QRVLRVMETE GVRSTLVTYN TLLMEASAER DWVRATKIYK ELLLSGFAPD TITLDCLCGI EKLQACREER LREEIKRAEL EGIDLPEFER TCDVSDSPVG NLPVLIRALA DDRELEEVPG WRGFVSDALL RVLHVNNEYA EVEDTFKYML TSDVTRTVHT YNSLLISYEA RKEWQKAGEA MTQMTSEGIV PNALTFDALI DVCEEMGQWD RATTWLEQAQ AAGHFQCEDD LGVLDLHRIR SAGTAQNACA RIRVEE
|
| |