Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16988 |
Symbol | |
ID | 5004221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 117181 |
End bp | 119175 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | |
GC content | 55% |
IMG OID | 640419642 |
Product | predicted protein |
Protein accession | XP_001419907 |
Protein GI | 145351064 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1793] ATP-dependent DNA ligase |
TIGRFAM ID | [TIGR00574] DNA ligase I, ATP-dependent (dnl1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.606353 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.523448 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAAGG CGAAGGCGAA GGCGAAGACC AAGGCGAAGA AGGAAGAACC GGCCACGGCT GGGCTCGGAG ACAAGTCCGT GGAGGCGGCG GAGCGGTATT TGGAGTATGA TCCGGTGTCG AAGGTGACGT GGAAGCTCGG CGACGCGACG CCGTATTTGT ATCTCGCGAA TATTTTCGAG TCCATCGCGG AAACGACGAA AAGATTGGAG ATCGCGGAGC TGCTGACAAA TGCTTTCCGG ACGATTTTAG CGAGTAAACC GACTGATCTC TTGGCGGCGG TGTACTTGGC GTCGAACACC ATTCATCCGC AGCACGAAGG TATCGACCTC GGCATCGGGG ATGCGACGCT CATCAAGGCG CTCGCTGAGG CCACGGGGCG GAAAGACGAA TCGATTAAGA ATGACTATAA GGAAGCTGGC GATCTTGGAA GCGTGGCCAT GGCGAGTCGT TCGACGCAGC GCATGATGTT CCCGCCGCCA CCGTTGACGG TGACGGGCGT GCTCAAGGAG TTTAGAGCCA TCGCCACGAC CGATGGCGAA AAGAGCGTCG ATCGCAAGAA AGGTATGATC AAGAAGCTCC TCGTCTCCGC GCGCGAGTGC GAGGCGGGTT ACGTCATTCG CTCGCTCCAA GGTAAACTAC GAATCGGCCT CGCTCAACAG ACGGTGACGC AAAGTCTCGC GCACGCCATC GTCTTGCACG GCGACGCCGG AAAGAAGAAG AAGGGCGCCG AGCTCGCCGA TGCCTTGGGT GCAGCATTCG ACGTGCTCAA GCAAGTATTT AGCGAATGTC CGACTTTTGA TCAAATCGTC CCGGCGCTAC TTGAGGTTGG TATCGAAGGC TTGCAAGAGC GTTGCAAATT CACCCCGGGT GTCCCCGTCA AGCCGATGTT AGCCAAGCCT ACGACTGGGG TGTCTGAAGT GTTGAAGCGC TTCGAAGACA TCGCGTTTAC GTGTGAATAC AAGTACGATG GAGAGCGCGC GCAGATTCAC CTCCAAGAAA ATGGCAAGGT GTCCATCTTC TCCCGCAACC AAGAGGACAA CACGGCCAAG TTTCCCGATC TCATAAACGG TCTCAAGCGA TACATCAAGC CTCATGTGAA GTCTGTCGTC ATCGACTGCG AAGCCGTTGC ATACGATAGA GAACAAAACA AGATCTTGCC GTTCCAGATT TTGAGCACGC GAGGAAAGAA AAACATCGTT GAATCGGAGA TCAAAGTCAA AGTCGCTCTC TACGCGTTTG ACTGCCTGTA CCTCAACGGC GAACCGCTGT TGCGCGAGCC GATGCACAAG CGCCGCGAAG CACTTTACAG CGCATTCCAA GAAGTTCCCG GCGAATTCTT CTTCGTCACC GAGAAGACAT CTCGCGATAT CGATGAACTG CAAGGATTTT TGGACGAATC TATCGCAGAG AACACCGAAG GTTTGATTGT CAAGACGCTC GACGCGACGT ACGAACCGTC CAAGCGCTCC CTCAACTGGC TTAAGCTCAA GAAGGATTAC ATGGAAGGCT GCGGTGACTC ATTAGATTTA GTTCCAATCG GCGCTTGGCT CGGACGCGGT AAGCGTACGG GCGTTTATGG CGCGTACTTA CTCGCCTGTT TCGACGAAGA CGGCGAAGAG TACCAATCCA TCTGTAAAAT AGGCACCGGC TTCAGCGAAG TCATCCTCGA GGAATTGGCC AACGCCATGA ACCCACACGT CATAGACGGT CCGCGTTCGT ACTACAAGGT ATCCGACGCC ATGAAACCCG ACGTGTGGTT CGAACCGAAG CAAGTGTGGG AAGTCAAAGC CGCCGACTTG TCAATCTCTC CGGTTCACCA AGCCGCGTGT GGATTGGTCG ATCCGCAAAA GGGCATCGCG CTGCGCTTTC CTCGCTTCTT ACGTCGCCGC GACGACAAAG AGCCCGAGAT GGCGACCAAC TCCGAGCAAG TCGCCGAGTT TTACAACGCC CAGGCCAACA AGCAAGAGTT TAACGCCAAC GGAGATGACG ATTAA
|
Protein sequence | MPKAKAKAKT KAKKEEPATA GLGDKSVEAA ERYLEYDPVS KVTWKLGDAT PYLYLANIFE SIAETTKRLE IAELLTNAFR TILASKPTDL LAAVYLASNT IHPQHEGIDL GIGDATLIKA LAEATGRKDE SIKNDYKEAG DLGSVAMASR STQRMMFPPP PLTVTGVLKE FRAIATTDGE KSVDRKKGMI KKLLVSAREC EAGYVIRSLQ GKLRIGLAQQ TVTQSLAHAI VLHGDAGKKK KGAELADALG AAFDVLKQVF SECPTFDQIV PALLEVGIEG LQERCKFTPG VPVKPMLAKP TTGVSEVLKR FEDIAFTCEY KYDGERAQIH LQENGKVSIF SRNQEDNTAK FPDLINGLKR YIKPHVKSVV IDCEAVAYDR EQNKILPFQI LSTRGKKNIV ESEIKVKVAL YAFDCLYLNG EPLLREPMHK RREALYSAFQ EVPGEFFFVT EKTSRDIDEL QGFLDESIAE NTEGLIVKTL DATYEPSKRS LNWLKLKKDY MEGCGDSLDL VPIGAWLGRG KRTGVYGAYL LACFDEDGEE YQSICKIGTG FSEVILEELA NAMNPHVIDG PRSYYKVSDA MKPDVWFEPK QVWEVKAADL SISPVHQAAC GLVDPQKGIA LRFPRFLRRR DDKEPEMATN SEQVAEFYNA QANKQEFNAN GDDD
|
| |