Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43237 |
Symbol | |
ID | 5005512 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 242092 |
End bp | 244953 |
Gene Length | 2862 bp |
Protein Length | 918 aa |
Translation table | |
GC content | 61% |
IMG OID | 640420933 |
Product | predicted protein |
Protein accession | XP_001421245 |
Protein GI | 145353917 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1525] Micrococcal nuclease (thermonuclease) homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.31647 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCG GATGGCTCCG AGGCGTCGTC AAAGCCGTCC CCAGCGGTGA TCAAGTCATC ATCGCCGCAC CGTGCGCGCC AGGGGTGCGT TCGCGCGCCG TACCGACCGA TTTCGTCGCG CGATGTCGAC GCGACGGTTG CCCGATCGCG CGAAACTGAC CACTCTCGCG ACGATTCTCG CGCTCGCAGG CCCCGCCCGG CGTCGAAAAA ACGCTCACGC TCGCCGGTAT CGTCGCGCCG CGTCTCGGTC GCCGCGACGG GTCGAGCGCG GACGAGGCGT TCGCGCGCGA GTCGAGGGCG TCGCTGCGGC GCGCGCTCGC GGGACGACGC GTGTCGTTTC GCGTCGAGTA CGCGGTGGAG TCGATAAATC GCGAGTTCGG CGTCGTGTTC ACGGAGAGCG GGGAAAACGT GAGCGTGATG CAAGTGTCTA AGGGGCTGGC CAAGGTGAAG GCGCCCGGGG GGAACGATCG AGCGGTGGCG AACGCGGAGG AGTTGGAACG ACGCGAACTC GAGGCGCGAG AAGCCGAGGC GGGGATGTGG AGTAAGGATC CCGCGGTGCT CGCGGCGGCG AGTCAGCGAA CGGTCGTGCA GGCGATGAAA GCGGAGGACG TGCTGGGTGC GTTGCGGATG AAACCGACGC CCGCGGTGGT GGATTACGTG CTGAACGGTG GGACGGTGAA GCTTGTGCTG ACGGGGGACG GCGCGACGCG CGATCAGAAT ATCACGTTGT CTATCGGTGG GATTTCAGTG CCGTCCGTCG GGCGCAAGGG GGCGAAGAAC GAAGATGGGA CAGATCAAGG TCCAGAGCCG TTCGCGCTCG CGGCGAAGCA TTTCACGGAG ATGGCGCTCC TGCATCGAGA CGTGCGGGTG ATTTTGGAAG GTCTCGATCG TCGTAATAAT TTCATCGGTT CAATCTTGCC CGCGGACGTG AACGATACGT CGTTCGTGAA CGTCGGCGAA GAGTTGTGTC GGCTAGGTCT CGCGCAAGTG CACGAGGCGA GTGCGGCGGC GTTGATCGGT GGCGCGGCGA CGCTTCGCGC GGCGGAGAAG ATGGCCAAGG ATCAGCAGTT GCGACTTTGG CATGGATACG TCCCGCCAAT ATCTTCCTTG AACGCGATGA CGACGAAAGT CTTCGATGCC AGAGTAGTAG AAGTCATCAG CGGTGATTGC ATTTCCGTGG TGCCGACGTC AGGGCCGGAT ACGTCTGAGA GACGAATCAA TCTGTCGTCG ATTCGGGCGC CTAGAATTTC CAACTCACGA GATGACAAGT CCAATCACGA ACCTTGGGCG ATAGAGGCAA AAGAGTTTTT GATCTCGCGT CTGATCGGGC GCACCGTATC GATTAATATG GATTACGCAC GCAAGATTGG AGAAGGTGCG AACGAACGAA CGTTGCACTT CGCCACGGTG AAGCTGCCAA ACAACAAGAC GGGCGGTGAC CCGCTCAACG TTTCAGAGAT GCTTCTCATG CGCGGTTTCG CGTCGTGCAT TCGTCACCGT TCTGAGGAAG AACGTGCGGC AGACTACGAT GAGCTCATCG CGGCGGAAAA GAAGGGCGTG GAGAGCAAAA AGGGAATGCA CAACAAGAAT CGCGAGGCGC CTGTACACAG GACGAATGAT TTTAGCATCA ACGCGCATAA GGCGAAGACG TTTTTGCCGT TTTTGCAACG CGCGGGTAAG TGCGTCGCTA TGGTAGACTA CGTCGCCGCT GGACACAAAA TTCGAGTTTC AATTCCCAAA GAAGGCGCGG TGATCGCCTT TTGCTTGGCG GGCGTTCGCT GTCCCCAGCG CGACGAGCCG TACGCCGCCG AGGCGTTGGC GTACACGCGT TCTCGAATTC TTCAGCGAGA GGTGGAAATC GTGGTAGACT CCGTGGATAG AACTGGAATT TTCCTTGGCA CCTTATTTGC GGACAACGGG CGATTAAATC TCGGTGAAGA ACTCCTTCGA GCCGGATTAG GAAGCTTGCA CCCGGCGTTC CCGGTGGATC GCGTTCACTA CGGTCGCGCG CTCGCGGACA TTGAAGCCGC GGCACGGGAA GTCAAGGCTG GTTTATGGAA AGACTGGACC CCTCCGATCG TCGAAGTAGA CGGGCCTGAG GATAGTTCGA CCGGCGAACT CGTGCGAGTC GGCGTCACCG AGTGCGTCGC CGGGGGCCGA TTCTTCGTGC AGAAGTTAGA TGGGAGTAAG ATTCAAGAGG TCACGGACAA ACTCGCCGAG CTTTACGACG GCGTGGACAC GAGCAAGCCG CACGATGGCG TGTTCGAACC AAAGCCTGGC GATGCCGTCG CCGCCAAGTT CACCGGAGAT GACAAGTGGG CGAGAGCCAT CGTCACCGCG AAGCGCGTCG GTGATAAGCC CGTCAGCGTC TTCTACTGCG ACTTTGGCAA CGTCGAGGAC ATCGGTTTCA ATCGTCTTCG ACCTTTGAAG GATCCAACGG TCACCACAGT TGCTATCCCA CCCATGGCCA ACTTCTGCGC GCTTTCCTTC CTCAAGATTC CTCGCATCGA TTCCGATTAC GGCTACGCCG CCGCTTCGCA CGTCGGCAAA CTCATCTCTG GCCAGGCTTT CCACGCCCGA ATCGACGCCC GCGATCGTTT CCCCACCACA AAACCATGGG AAATCGACGC ACAGCCCGCG TTCTCGCTCA CATTATTCCC CGACGCCAAC GCTCGCGCCG CTGAATCCGT CGCCCTCGAC CTCCTTCGCG CCGGCTTTGC GCGCGTCCAC CGCCGCGCCG CCGCCCGTCG TCTCGATCGC GACGTCTTCG ACGCCATGGT CGACGCCCAG GAGTCCGCGC GTCGCGCGAG GGTCGGTCAG TGGGAGTACG GCGACGTCGA TTCCGACGAC GACGCGTCTT AG
|
Protein sequence | MSTGWLRGVV KAVPSGDQVI IAAPCAPGAP PGVEKTLTLA GIVAPRLGRR DGSSADEAFA RESRASLRRA LAGRRVSFRV EYAVESINRE FGVVFTESGE NVSVMQVSKG LAKVKAPGGN DRAVANAEEL ERRELEAREA EAGMWSKDPA VLAAASQRTV VQAMKAEDVL GALRMKPTPA VVDYVLNGGT VKLVLTGDGA TRDQNITLSI GGISVPSVGR KGAKNEDGTD QGPEPFALAA KHFTEMALLH RDVRVILEGL DRRNNFIGSI LPADVNDTSF VNVGEELCRL GLAQVHEASA AALIGGAATL RAAEKMAKDQ QLRLWHGYVP PISSLNAMTT KVFDARVVEV ISGDCISVVP TSGPDTSERR INLSSIRAPR ISNSRDDKSN HEPWAIEAKE FLISRLIGRT VSINMDYARK IGEGANERTL HFATVKLPNN KTGGDPLNVS EMLLMRGFAS CIRHRSEEER AADYDELIAA EKKGVESKKG MHNKNREAPV HRTNDFSINA HKAKTFLPFL QRAGKCVAMV DYVAAGHKIR VSIPKEGAVI AFCLAGVRCP QRDEPYAAEA LAYTRSRILQ REVEIVVDSV DRTGIFLGTL FADNGRLNLG EELLRAGLGS LHPAFPVDRV HYGRALADIE AAAREVKAGL WKDWTPPIVE VDGPEDSSTG ELVRVGVTEC VAGGRFFVQK LDGSKIQEVT DKLAELYDGV DTSKPHDGVF EPKPGDAVAA KFTGDDKWAR AIVTAKRVGD KPVSVFYCDF GNVEDIGFNR LRPLKDPTVT TVAIPPMANF CALSFLKIPR IDSDYGYAAA SHVGKLISGQ AFHARIDARD RFPTTKPWEI DAQPAFSLTL FPDANARAAE SVALDLLRAG FARVHRRAAA RRLDRDVFDA MVDAQESARR ARVGQWEYGD VDSDDDAS
|
| |