Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28121 |
Symbol | |
ID | 5006062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | - |
Start bp | 261784 |
End bp | 264638 |
Gene Length | 2855 bp |
Protein Length | 897 aa |
Translation table | |
GC content | 60% |
IMG OID | 640421483 |
Product | predicted protein |
Protein accession | XP_001422022 |
Protein GI | 145355547 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0542452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0190346 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGGTGCAT GTTGACGCGG AGCGCGCGGT CGTTCGGCGC TTCGAGCGCG TCCGTCGTCG CGCGCGCGTG TCGAGCGATG GCGACGACCG CGCGCGACGT CGCGCGGGCG CGAGGCGCGA CGACGCGCGA CGCGCGCGAG AGAAGTTGGC GAAGACGCGC GATGGGTTCA AATTTCAAAC CGACGACGAC GCGAACGACG CGCGCGACGG CGCGACGCGC GACGCGAACG TACGCGACGG CGACGGGAAG CGCGGAGGAG AAGATCATCG ACGTTGAACT GGCGAGCGAG GCGAAGACGT CGTACCTGTC GTACGCGATG AGCGTGATCG TGGGGCGAGC GCTGCCGGAC GCGCGCGACG GGCTGAAGCC GGTGCATCGA AGGATCCTGT ACGGCATGCA CGAGCTGGGA TTGCGGGCGG ATAAACCGCA CCGAAAGTGC GCGAGAGTGG TGGGAGACGT GCTGGGGAAG TATCACCCGC ACGGGGACGG ATCGGTGTAC GAGGCGCTGG TGCGGTTGGC GCAAGATTTT TCGATGTCGG CGCCGTTGGT GGACGGACAC GGGAACTTTG GGTCGTTGGA CGACGATCCG CCGGCGGCGA TGCGTTACAC GGAGTGCCGA TTGAATAAGT TGGCGGAGAA GGGGTTGTTG GCGGACATCG GGAACGAGTG CGTGAATTTC ACGGAGACGT TTGACGGGAG TCAAACGGAG CCGGAGGTGC TGCCGGCGCG GGTGCCGAAT CTTTTGATCA ACGGCTCGAG TGGGATCGCG GTGGCGGTGG CGACGAACAT GGCGCCGCAT AATCTCGGTG AGTCTGTCGA TGCGCTGTGC GCGCTGGCGA AAAACCCGGA TTGCTCGTTG GACGAACTCA TGGCGTTGCT GCCGGCGCCG GATTTCCCCA CGGGCGGGGT GGTGACGAAT AAGAGCGGGA TGAAGGAAAT TTACGAAACC GGCAAGGGCG GGGTGACGCT TCGCGGGCGG GCGACGATCG AGCGCGTGTC GGCGGCGCGC GGTTCGCTGG ATAAGGACGC GGTGGTGATC AGCGAGATTC CTTACCAAAC CAACAAGGCG AGGTTGGTGG AACAAATCGC CGACCACGTC AACGGGCGCA CCATCGACGG CATCAGTGAT ATTCGCGACG AAAGCGATCG CGATGGCATG CGCGTCGTGA TCGAGATTAA GCGTGGATAT GATGCAGCGA GCGTGCTGGA GGAGCTTTAC GCCAAGACGA AGCTCGAAGT GAAGTTTTTT GTGAACAACG TCGCGCTCAT AGACAACAAG CCGACGGTGA TGCCTCTTCG CCAGATTCTC GACGAGTTCA TCAAGTTTCG CGTCGATACG ATCGAGCGAC GGACGAGATT TATGCTCTCA AAGGCGCAAG ATCGCAAGCA TCTCGTTGAA GGCTTCTCGA TCGTGCTCGC CGACGCGGAT GGAGTGGTGA AGATTATCCG AAAATCGAAA GACGGCCCGT CGGCGTCGAA AAAGTTGCGC GAATCGCACG GTTTGTCCGA CATCCAAGCC GACTCGATTC TCGCCATGCC GCTTCGTCGA TTAACCGGAC TCGAGGCGGA TAAGTTAGAC GCCGAGCTCA AGGAGTTGAA CGAGCAGATC GCACACTTCC AAGGTTTGTT GAGCAACAAG TCAAAGGTCA TCGACGTCCT CGTGCAAGAG GCGATGGAGG CGAAAGAGGC ATTCGCGCGT CCTCGACGTA CGTCCGTCGA GCAAATCGAA TCTTTAAGCG GCGTCGAAGA CGATTCGCCA CCAAAGGATA ATATTTTGAC CCTCTCTGAG CGTGGATACG TCAAGCGCAT CTGCCCGAAG AACTTTGGCG CGCAGAATCG AGGCACTCGA GGGAAGCGCA TGAGCAAGCT CCGCGCCAAC GATGAGCTTT CCAAAGCCAT GCACTGCAAA GACAGCGATC AAATATTATT CTTCTCCGAT CGAGGGCGAG TCCAAAAGCT CAGTGCGAAG GCGATTCCGC AATCCGAACT GAACACGATA GGAGTTCCAG CGACCAGTCT GTTGAACACG TTCGCAAAGC GCAACCAAAA CGTCACCGCC ATGTTGTCGA CGAACATGAA ACAGGGTGAA GTAGCGGACG ACCAAGTGGT GGTGATGTTA ACAAGCCAAG GTAAGGTATC CGTGGCGTCC GCGGCGTCCA TGCTCGGGCA CAAGGGTAAG AAGGTGATCA CGCTCGACAA GGGCGATAGG TTGCAGCAAG TGATGTTCGC GCGCACGTCC GACCATCTCT TCATCACAGG CACCGGGAAA GCTGGAAAAG GGCTCATCCT TCACTGCCGT GTCGGAGACT TCCGAGTCGT GAAGTCGGCG TGCAGGCCAA TCTCGGGCAT CAAAACCATG GGCGAGAAGA AGGTGGCAGA AGCCGTCGTC GAAGACGCGG GCGACGACGA AGACGACGAC GAAGACGACG GCGAACTTTT GCCGCCGAAA ACTGTCGGTA TGGCTATCGT CCCAGGTGAA CGCATGGTGT CCGCGAGCGA AGAATTCGGT CCGTTTATCT TATTTACGAC GAAGAAAGGT AAAGGCAAAG TGGTCGCCGC GAACTCGTAC CGCCTGCTCG GCCGCGGTCG CTCGGGCGTC ATGTGCATGA AATTCAAAAA GGGTGACGAC GACGCCCTGG CCACCATCAC TCTCGTCGAC CGCATCGGCG ACGACGTCAC GGATGAAGTA TTGCTCTCCA CCACGGGCGG AATCTCCAAC CGCATCGCCG TCAACGATTT ACCCAAGCGC TCGGATCCCT TGGCTCTGGG CGCCGCCATC ATCAAGCTCG ACGCCACGGA CGCCCTGAAA TCCGCCAATT TACTCCCGAG CGAAGTCGCG AGCGAGCTCG CGTGA
|
Protein sequence | MGSNFKPTTT RTTRATARRA TRTYATATGS AEEKIIDVEL ASEAKTSYLS YAMSVIVGRA LPDARDGLKP VHRRILYGMH ELGLRADKPH RKCARVVGDV LGKYHPHGDG SVYEALVRLA QDFSMSAPLV DGHGNFGSLD DDPPAAMRYT ECRLNKLAEK GLLADIGNEC VNFTETFDGS QTEPEVLPAR VPNLLINGSS GIAVAVATNM APHNLGESVD ALCALAKNPD CSLDELMALL PAPDFPTGGV VTNKSGMKEI YETGKGGVTL RGRATIERVS AARGSLDKDA VVISEIPYQT NKARLVEQIA DHVNGRTIDG ISDIRDESDR DGMRVVIEIK RGYDAASVLE ELYAKTKLEV KFFVNNVALI DNKPTVMPLR QILDEFIKFR VDTIERRTRF MLSKAQDRKH LVEGFSIVLA DADGVVKIIR KSKDGPSASK KLRESHGLSD IQADSILAMP LRRLTGLEAD KLDAELKELN EQIAHFQGLL SNKSKVIDVL VQEAMEAKEA FARPRRTSVE QIESLSGVED DSPPKDNILT LSERGYVKRI CPKNFGAQNR GTRGKRMSKL RANDELSKAM HCKDSDQILF FSDRGRVQKL SAKAIPQSEL NTIGVPATSL LNTFAKRNQN VTAMLSTNMK QGEVADDQVV VMLTSQGKVS VASAASMLGH KGKKVITLDK GDRLQQVMFA RTSDHLFITG TGKAGKGLIL HCRVGDFRVV KSACRPISGI KTMGEKKVAE AVVEDAGDDE DDDEDDGELL PPKTVGMAIV PGERMVSASE EFGPFILFTT KKGKGKVVAA NSYRLLGRGR SGVMCMKFKK GDDDALATIT LVDRIGDDVT DEVLLSTTGG ISNRIAVNDL PKRSDPLALG AAIIKLDATD ALKSANLLPS EVASELA
|
| |