Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16119 |
Symbol | |
ID | 5002997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | - |
Start bp | 351862 |
End bp | 353262 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | |
GC content | 64% |
IMG OID | 640418418 |
Product | predicted protein |
Protein accession | XP_001418910 |
Protein GI | 145348962 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.485455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0341809 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAAGC GCGCGCGTCG AGCGGATCGC GCGCCCACCG ACGCCGTCAA GCGGCGCATA TGGGACCCGC GCGTGCGCGC CTTGGACGTG GACGACCGGA GGAGCTCGGA CACCCCCACG GGCGTGCGCG TGCGCGGCGT GCGCGAGCGC GCCGACGTCG GCGACGCGCG TCGGGCGGCT GCGCGCGAGC TCGCGCTGTT TCGCGGCGTG AAGCGCGCGA GGAGCGCGCT GCAGCGCATC GCGCGGGCGA GGACGGGGCG GACCGACGAG GGAGGGACGC CGCTGCTGCT GTTTGAGCGT TGGTTGGCGA GGTGCGCCGT CTCGGGCGCG CTCGCGGCGC CGGTGTTGCC GGCGCAGGGG CTCGGTTTGG CGAAAGATTT GATTAGACAC GGGGCGAGCG AGCGAGACGG TGCGGAAGGG GCGGCGGAGG CGCTCGCGGT CGCCAAGGAG AGCGCGGCTC GATGGGCCGA GGCGCGCGAC GACGGCGGAG AGGCGCGCGA CGCCGTCGTC GTTCGCGACA AAGGTGCTTT CTTGACGATG CAATTGGGAA CAGAGAAACC GTACGTGAAG TGTGCGAAAG CACATCTCGG TAAACTGCGC GCTTTATATT GCAGAACAGT GCGAGGCTGT GAACCGTTGA CGGAGGACGT CGATTCAGAC GAGTACCAAA AGTTCGCGTG CGCCGTGTTC GCATTGTTGA TGCGATACGA ATCGCTGGGC GGGGCTGGAT ACCAAAGCGC GCTCGCCGAG GATGCGTTCG ATGTTTTGAA CGAAAAGTTG GGCGTGTCGT GCGAGTGTTT CGCGTCGCCG CTCAACGCTC GGTACGGGCA ATTTTGTTCG CAATTTGGTT TTGACGAGGA CCGCGCGCCA GACGTCGACG CGTTCTTCGG ATCGCTCGGA AGCTTTTTTA GCGACGACTT TGCACCAAAA CGCGGATCGT TCGAGATGAA CCCGCCTTTC GTCCCGGAAA CGATGTCGCG CGCGGTCGAA AAGGCGAACG ATTTGCTCGA TCGCGCCGCG AACGCGAACG AGGCGCTCAG TTTCGTAGTC ATCGTACCGC TGTGGAAGGA ATGCCATTAC TGGAGTGCGC TCTTGGAAAG TCGACATCTG CAGCACGGTC CAGACATCAT CGATGCGCAG TCTCACGGCT TTTGCGACGG CGCCCAGCAC GCTCGTCCGA GTCACGAGCG CCATCGCGTG TCGAGTTTCG ACACGGGCGT CTTCTACCTG CGAACGTCGC GCGCCGAGCG CGAGCGACCG GTGGATGAGG AAATCAGAAA GCGCGTGTTG CGCGGCATGA AGACCGCGTT GGGGTCGTGC AAAGACGTGC AAGAGTTGGA AGTGCGATAT CGCGGCGAGC GGGCGCGCGG CGGGCCCGCT AAAATAGAAG ATAGAAAATA G
|
Protein sequence | MPKRARRADR APTDAVKRRI WDPRVRALDV DDRRSSDTPT GVRVRGVRER ADVGDARRAA ARELALFRGV KRARSALQRI ARARTGRTDE GGTPLLLFER WLARCAVSGA LAAPVLPAQG LGLAKDLIRH GASERDGAEG AAEALAVAKE SAARWAEARD DGGEARDAVV VRDKGAFLTM QLGTEKPYVK CAKAHLGKLR ALYCRTVRGC EPLTEDVDSD EYQKFACAVF ALLMRYESLG GAGYQSALAE DAFDVLNEKL GVSCECFASP LNARYGQFCS QFGFDEDRAP DVDAFFGSLG SFFSDDFAPK RGSFEMNPPF VPETMSRAVE KANDLLDRAA NANEALSFVV IVPLWKECHY WSALLESRHL QHGPDIIDAQ SHGFCDGAQH ARPSHERHRV SSFDTGVFYL RTSRAERERP VDEEIRKRVL RGMKTALGSC KDVQELEVRY RGERARGGPA KIEDRK
|
| |