Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33669 |
Symbol | |
ID | 5003858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 608032 |
End bp | 609078 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | |
GC content | 62% |
IMG OID | 640419279 |
Product | predicted protein |
Protein accession | XP_001419853 |
Protein GI | 145350946 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3145] Alkylated DNA repair protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.421608 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0171071 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGACG CGCGAGACGC GACGGGCGGC GAAGGCGCCA GAGGGCGGTG TCAGAAGCTC GCGAAGGCGT TGCGGGGACA GCTGGCGAAG AAAGCGTACG ACGCGCTGCG CGCGCACGCG CGGGCGTTTC AGCGCCGGGA GGTGACGACG TCGGAGTTCG CGAAAGTATT GGTGGAGTGC GCGCGGACGG GGGGGGTGAC GAGGGCGACG GCGCGAGAGG TGATCGCGAC GACGCCGTCG GCGTTGGACC GGAGGCGGCT GCGAGAGTGC GTCGGACGGG ATTTAGACGA CGACGCGGCG GTGAAATCGA CGCGTGAACG AGAGAAAAAG CCGCACGGAT TTAAGACGGA GCGTCTCGGA CCAGGATTGG TGTGCTTGAG GAAGTTTTTG AGCGTGGAGG CGCAAATGTG GTTGGCGAGC GAATCGTTCG CGCTCGGCGA ATCAGGATCC GACGACGCCG CGCGCGGGCA AGGTTTCTTC GCGAAGATGG GAGATGGGAC GTTCAAGCTG AATCAAGGTA GTCGTGGGCG GATGATCCTC GAACCCGATG CGTTTCCAGA CGGGATTTTG ACGCAGATGT GCGAGGACGC GGTGGCGGCG GCGTGCGCCG CGGACGCCGA GATGCCGACA AACATGAACC CGACGACGTG CCTGGTAAAC TTTTACAAAG ACGGCGCCGA GTTCAAGTGG CACAAAGATA GTGAAGATCC AAAGCTCGTA AAATCGCGCA CGGGTCCGCC CATCGTGAGT TTCTCCGTAG GGCTGAGCGG CGACTTTGGG TACAAATATT CGTTTGACGA TCCCGAGCAC AAAGTCGTGC GCCTGAACTC GGGCGACGTC TTGCTCTTCG GCGGCCCTTC GCGCATGATC GTGCACAGCG TGTTAAACGT GTACCCGGGA TCGATGCCCG GTCACTTGCG TGGGAAAATG CTCAACGGTC GCTTGAACGT CACTGTGCGA GACATCGGTT GCGGCGTCAT CGACGCCAGC CAATTCCCGG CGTACAGAGT CTCCTACGAC GGCGTCCAGG CCGACGGCAA CGTCTGA
|
Protein sequence | MDDARDATGG EGARGRCQKL AKALRGQLAK KAYDALRAHA RAFQRREVTT SEFAKVLVEC ARTGGVTRAT AREVIATTPS ALDRRRLREC VGRDLDDDAA VKSTREREKK PHGFKTERLG PGLVCLRKFL SVEAQMWLAS ESFALGESGS DDAARGQGFF AKMGDGTFKL NQGSRGRMIL EPDAFPDGIL TQMCEDAVAA ACAADAEMPT NMNPTTCLVN FYKDGAEFKW HKDSEDPKLV KSRTGPPIVS FSVGLSGDFG YKYSFDDPEH KVVRLNSGDV LLFGGPSRMI VHSVLNVYPG SMPGHLRGKM LNGRLNVTVR DIGCGVIDAS QFPAYRVSYD GVQADGNV
|
| |