Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_4352 |
Symbol | |
ID | 5004830 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 117077 |
End bp | 118000 |
Gene Length | 924 bp |
Protein Length | 277 aa |
Translation table | |
GC content | 59% |
IMG OID | 640420251 |
Product | predicted protein |
Protein accession | XP_001420761 |
Protein GI | 145352879 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 0.63409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.270845 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAGC TTCCCGAGGT CGAAAAAGCT CGTCGCCTGG TGCACGACCT CGCGATCGGA TCCCCGATCT CGCGCGTCCA TCGACCCATC ATCGACGACA AAGTTTTTGT CGACGTCGCA TCGGGACAGT TTGAGCGCGC GCTCTCGGGA CGGAAAATCA CCCACAGTAA GCGCCACGGT AAGCAGTTGT GGTGGCAGCT CGACGGGAAC GACGCGCTCG TTCCGTGCTT TCACTTCGGG ATGACGGGGG CGTTCGTGGC GCGAGGAATC GATGGGATTC AGTATTACAA TAGCAAGGCG AGCGGAACGG GCGACTGGCC GCCGCGGTTC GCCAAGCTCG TCGTCGCGTT CGAGAACGGC GTCGAGCTCG CGTTCGTAGA CCCGAGAAGG TTTGGAAAGA TCAAACTCGT CGCGGACGTC GCGGAGGTGA TCGGCCAACT CGGGCCGGAT CCGCTGTTGG AGATGCCGAA CGAGGAGGCG TTCGCGGCGC TATGGCGACG AAGGAGCGCG CCGATAAAGA CGGCCATCAT GGACCAGAAG GTGATCGCTG GGATAGGGAA TTGGATGGCG GACGGTGCGT CGAATCGAGC GATCGCGATC GCGAGGCGTT TTCTCGAATG ACTCGGCGTT TGATTGACAT TGAGAATATT GGTTTCGTCG AGCGTAGAAA TTTTATACCG AGCGCGAGTG CATCCGGAGA CTCGAGCGAA CGAGTTGAGC TCGACGCAGC TCGAAGCGAT TAGATTTCGC GTCACAGAAG TCGTCAAAGT AGCGTGCGAG GCAAACTCTG ACCACGATTT GTTTCCAGAC GACTGGTTGT TCCACCATCG CTGGGGGAAA ACCGGCGGAG CGAAAGTCAA CGGAGACGCG ATTAAGTTCA TCGAAGTCGG CGGTCGCACC ACCGCCTTCG TGCCCAAACT TCAG
|
Protein sequence | MPELPEVEKA RRLVHDLAIG SPISRVHRPI IDDKVFVDVA SGQFERALSG RKITHSKRHG KQLWWQLDGN DALVPCFHFG MTGAFVARGI DGIQYYNSKA SGTGDWPPRF AKLVVAFENG VELAFVDPRR FGKIKLVADV AEVIGQLGPD PLLEMPNEEA FAALWRRRSA PIKTAIMDQK VIAGIGNWMA DEILYRARVH PETRANELSS TQLEAIRFRV TEVVKVACEA NSDHDLFPDD WLFHHRWGKT GGAKVNGDAI KFIEVGGRTT AFVPKLQ
|
| |