Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24762 |
Symbol | |
ID | 5003051 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 140708 |
End bp | 142406 |
Gene Length | 1699 bp |
Protein Length | 348 aa |
Translation table | |
GC content | 63% |
IMG OID | 640418472 |
Product | predicted protein |
Protein accession | XP_001418620 |
Protein GI | 145348364 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00131529 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.027105 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGTTCCGTGC GACGCCTGCG CGCACGGCGT CGCGCGCCTG TACTGCGCCG CGGACGACGC GAAATTGTGC CTGCGATGCG ATCGAACGGT GCGCGACGCG CGAACGCGAA CGCGATTTTC GACACGCCGC CGGAAAGCGG GCGAGGGCAC TTTACGCGGG AACGACGCGC GGCGCGCGCG CGCGCGCGAC GAGGACGCGC GCGGGGGCGG GATTTCGGTG ACGCGTATTT TGTGCCACCG TCGCGACGCG TCGAGGAGGT CGACGACGCG CGTCCGCGGG ATCGCGCGAC GACGTTCGTC GCGGAGGCGT TCGTTACGAA TGCGTGAATT GTCGACGACT GACGACGACG ACGACGACGC GATGGACGCG CGAGCAGGTG CACAACGCGA ATAAATTCGC GGAAAAACAC TCGAGGCGTT GGTTGTGCGA GATGTGCGCG CACGCGTGCG CAGAGGTGCG AGAGAGGCGA GGCGAGGCGA ATTCGAACGG CGGTTGAGTA GTTAGGGCTC GGACGAGGAC GCGGCGAGAC GCGAGAAGGC GAGGGCGAAG GCGCGCGCGC GAACGGGGAC GCGATGGGGC GTTTCGAACT CGCGAGAGAC GATGTGCGAG GATGTGGTGT CGACGAGAGA CTGACGAAAC GTTCACGTCG CGCGACGACG ATCGGCGGAA ACGTAGGTTA TGCGCGAAAA CGAAGACGGC GTCACGGGGA CGACGACGTG CGAGGCGTGC GATCGAGCGC ACGGTTCGGG ACGCGCGGTG GCGCTCAGAG CGTTTCACGA ATGTGCGATG TCGGATGAGG CGTATCGCGC GGTCGGGAAA GGGATGAAGC AGACGCGGCG GGGGAGGCCG TCGAATAAGA GCAAGTCGGA GGTTCAAGCG CTCGACGCCG AGGTGTCTGA TGAAACGTTT TGGAGCCTGG TGAAAAAGTC GCAAATTCGT CCGAGCGAAG AGCCCTCGTT CGATCGAGTG GACCTTTTTA GCGGCGACGA CGGTCGGGGT CAAATGACGA GCGATGATAA GGACGATGAA GCGTTCATCG ACGGTTTGAT TCGCATGCCG TCGTTTACGA TGCTCGACGC GGAGATGAAC GTGCGAGTGG ACGGTTCGGC GGCGAATTTC GGGACGCCGT CGCCCGCAGA AAGCTCGGGC GCACCGGTGC GACAGGCGTA TAACAGTATT TTCGAATCGA GTCAGGCTAC GGGGCAAATG TCGCGCGCCC GTGCGACGTC TCAATCGGAC AGCGCCGCCG ACGGCATCGA TTCCATGTTC ATGCCGGGGG GCGTGAGATC GAGCAACGAG CACGTAGAAG AGCTTTTGGC GCCGCGAGTG GATGGCGAAA GAAACATGTT CTACGACATC GACGCCGCGG GACAGTGGCG AATGAAGAAA CCTAAGAAGA GTGGGCGTCC CAAGAAGATG TCCGCCGCCC GCAACGCGCG CAAACCCGCG GCGACGAATT CGTCGGCGGC GTACGCCGCG AACGCGCCAG ATCATTCTGC CAAACTCCCG GGCGGTGCAT CTCGTCAACA AGTGCTCGAT CGGTACCACG AGAAGCGTAA GGCTCGCACA TACGGTAAAA CAATCCACTA CGAGGCGAGA AAAGTCCGCG CCGAAACGCG CGTTCGAGTC GGTGGTCGAT TCGCCAAAGC GGAAGACAGA TCTTGCGACG TCTCAATCAA GGGTGCCGTA CCCGCGTGA
|
Protein sequence | MCAHACAEVM RENEDGVTGT TTCEACDRAH GSGRAVALRA FHECAMSDEA YRAVGKGMKQ TRRGRPSNKS KSEVQALDAE VSDETFWSLV KKSQIRPSEE PSFDRVDLFS GDDGRGQMTS DDKDDEAFID GLIRMPSFTM LDAEMNVRVD GSAANFGTPS PAESSGAPVR QAYNSIFESS QATGQMSRAR ATSQSDSAAD GIDSMFMPGG VRSSNEHVEE LLAPRVDGER NMFYDIDAAG QWRMKKPKKS GRPKKMSAAR NARKPAATNS SAAYAANAPD HSAKLPGGAS RQQVLDRYHE KRKARTYGKT IHYEARKVRA ETRVRVGGRF AKAEDRSCDV SIKGAVPA
|
| |