Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33842 |
Symbol | |
ID | 5000903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 660702 |
End bp | 662624 |
Gene Length | 1923 bp |
Protein Length | 641 aa |
Translation table | |
GC content | 56% |
IMG OID | 640416324 |
Product | predicted protein |
Protein accession | XP_001417013 |
Protein GI | 145345003 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGCC AGAGCGCGCG CGGAGCGATC GGCGCGCGCG TGTTGAACGT GGCTGAGAAA CCATCGGTGG CGAAAGAGAT CTCGCGCGTG CTGTCGAACG GACGCGCGAG CGCGCGCGAA GGGACGTCGA GGTATAATAA AGTCTGGGAA TTCCCGTACG AAGTGCGCGG TCGGCGGGTG ACGATGGTGT TCACGTCGGT GACGGGGCAC TTGAGTAATT TTGAATTCGC CGACGACAGG CACAGACGGT GGAACGGCGT CGATCCGCGA GAGCTGCTCG TGAACGCGGC GGTGGCGAAG CGGGTGCCGG AGGATAAGAG ACAGGTGGCT GATAATGTGA AACGAGAGGC GCGAGGGTGC GATTCGGTGA TTTTATGGTT GGATTGCGAT CGCGAGGGAG AGAATATCGC GTTTGAGGTG CTCGCGGCGT GTCGAGAAGC GAATCGGGGC ATCGCTGCGT TTCGAGCGCG GTTTTCGGCG CTGAGTCGGG GTGATGCCGA CCGAGCGTTG ACGAATCTCG TGGAACCGAA CCAACACGAG TCCAAGGCGG TAGATATGCG CATGGAGTTG GATTTGAGAC TCGGCGCGGC GTTTACGCGA TTCAACACGT TGGCGTTGCA GCGCGCAGGC GTGGGGCTGC CGGTAGACGA TAAAGGCAAA TCGATCGTGT CGTACGGGCC GTGTCAATTT CCCACGCTCG GTTTCATCGT GCAAAGGAAG TGGGACATTG ATGCGCACGT GAGCGAAGAT TTCTGGGCCA TCAAGTGTTC GCATTCTCGC GAAGGCACAA CGACGCAGTT TGAATGGAGC CGCGGAAGGT TGTTCGATCG TGCTTTCGCC TCTGCGTTGC ATGACCTGTG CGTGCGGGCC AACTCTGCCA CTGTCATCGA CGTGGACGGG CAAGAAAGCA AACGTTGGCC ACCGCATCCC TTGAACACCA TTGAGATGCA AAAGCGCTTG AATAGAGTGT TGCGCATTTC CCCCGAGCAA ATTATGAAGA TTGCTGAGGA CTTGTACAAC GATGGTTTCA TCTCTTATCC GCGCACGGAG ACAGACAAAT TTCCAAATGA TTTCGATTAC GATGGAACGC TGCGCGAGAT GCATCAGCAC CCGCAGTTTG GTTTTTACGT CGAGCGATTG ACGACAGGCG GGCAGTTTCG ACGACCTCCG GGCGGCACCA AGGATGACAA AGCGCATCCC CCCATCTACC CAACCAAGCT CGCCACGGAT GCACAGTACG CTCAAATGCG CAACAAAAAC CACAACGCCC CTAAAGTGTA CGAGTTCGTC TGCAGGCACT TTTTGGCCAC ATGTTCATAT CCAGCTGTGG CGTTGAAGAC GCACGTGGAT GTCGACATCG CCGGCGAAAC CTTCCGTGCG ACAGGCGTGA TGATTCGTGA ACGTAACTAT TTGGATATCT ACGGCCCTGG TCCTCCTGAA GGTCCACGGT TAGCGCCGAC TTACGATAAC TGGGGCAATA GCACGCTGCC TGTGTACACC CCAGGAGAAC AATTTGTCCC AACTCTGAAT TTACACGAAG GATCGACGAG ACCTCCAGAT TATCTCAGCG AGGTGGATTT GCTTTCGCTC ATGGAGTCAC ACATGATCGG AACAGACGCT ACGCAGGCGC AGCACATAGA AAAAGTAGTT GGCGAACGAG GATACGCTCG AAAAGTGGGC GATAACAGAT TGATGCCGAC AGAGCTAGGG GAAGCGCTGG TTCTCGCCTA CGATCGAATG GGAGTCGCGG ACATGTGGCT GCCGACGAAA CGAGCAAAGA TGGAAGCGGA TGTAGACGCT GTCGCACACA ACCGGATGGA TCCCAACGCA GGTTTGCGAC TGCATTTACA AACTATGCTG CAAGCTTACG ACCGTGTCGC GACTGATGAA AACATGTTGA CAAACACCGT CGGATCGTAC ATG
|
Protein sequence | MQRQSARGAI GARVLNVAEK PSVAKEISRV LSNGRASARE GTSRYNKVWE FPYEVRGRRV TMVFTSVTGH LSNFEFADDR HRRWNGVDPR ELLVNAAVAK RVPEDKRQVA DNVKREARGC DSVILWLDCD REGENIAFEV LAACREANRG IAAFRARFSA LSRGDADRAL TNLVEPNQHE SKAVDMRMEL DLRLGAAFTR FNTLALQRAG VGLPVDDKGK SIVSYGPCQF PTLGFIVQRK WDIDAHVSED FWAIKCSHSR EGTTTQFEWS RGRLFDRAFA SALHDLCVRA NSATVIDVDG QESKRWPPHP LNTIEMQKRL NRVLRISPEQ IMKIAEDLYN DGFISYPRTE TDKFPNDFDY DGTLREMHQH PQFGFYVERL TTGGQFRRPP GGTKDDKAHP PIYPTKLATD AQYAQMRNKN HNAPKVYEFV CRHFLATCSY PAVALKTHVD VDIAGETFRA TGVMIRERNY LDIYGPGPPE GPRLAPTYDN WGNSTLPVYT PGEQFVPTLN LHEGSTRPPD YLSEVDLLSL MESHMIGTDA TQAQHIEKVV GERGYARKVG DNRLMPTELG EALVLAYDRM GVADMWLPTK RAKMEADVDA VAHNRMDPNA GLRLHLQTML QAYDRVATDE NMLTNTVGSY M
|
| |