Gene OSTLU_16988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16988 
Symbol 
ID5004221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp117181 
End bp119175 
Gene Length1995 bp 
Protein Length664 aa 
Translation table 
GC content55% 
IMG OID640419642 
Productpredicted protein 
Protein accessionXP_001419907 
Protein GI145351064 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR00574] DNA ligase I, ATP-dependent (dnl1) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.606353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.523448 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAGG CGAAGGCGAA GGCGAAGACC AAGGCGAAGA AGGAAGAACC GGCCACGGCT 
GGGCTCGGAG ACAAGTCCGT GGAGGCGGCG GAGCGGTATT TGGAGTATGA TCCGGTGTCG
AAGGTGACGT GGAAGCTCGG CGACGCGACG CCGTATTTGT ATCTCGCGAA TATTTTCGAG
TCCATCGCGG AAACGACGAA AAGATTGGAG ATCGCGGAGC TGCTGACAAA TGCTTTCCGG
ACGATTTTAG CGAGTAAACC GACTGATCTC TTGGCGGCGG TGTACTTGGC GTCGAACACC
ATTCATCCGC AGCACGAAGG TATCGACCTC GGCATCGGGG ATGCGACGCT CATCAAGGCG
CTCGCTGAGG CCACGGGGCG GAAAGACGAA TCGATTAAGA ATGACTATAA GGAAGCTGGC
GATCTTGGAA GCGTGGCCAT GGCGAGTCGT TCGACGCAGC GCATGATGTT CCCGCCGCCA
CCGTTGACGG TGACGGGCGT GCTCAAGGAG TTTAGAGCCA TCGCCACGAC CGATGGCGAA
AAGAGCGTCG ATCGCAAGAA AGGTATGATC AAGAAGCTCC TCGTCTCCGC GCGCGAGTGC
GAGGCGGGTT ACGTCATTCG CTCGCTCCAA GGTAAACTAC GAATCGGCCT CGCTCAACAG
ACGGTGACGC AAAGTCTCGC GCACGCCATC GTCTTGCACG GCGACGCCGG AAAGAAGAAG
AAGGGCGCCG AGCTCGCCGA TGCCTTGGGT GCAGCATTCG ACGTGCTCAA GCAAGTATTT
AGCGAATGTC CGACTTTTGA TCAAATCGTC CCGGCGCTAC TTGAGGTTGG TATCGAAGGC
TTGCAAGAGC GTTGCAAATT CACCCCGGGT GTCCCCGTCA AGCCGATGTT AGCCAAGCCT
ACGACTGGGG TGTCTGAAGT GTTGAAGCGC TTCGAAGACA TCGCGTTTAC GTGTGAATAC
AAGTACGATG GAGAGCGCGC GCAGATTCAC CTCCAAGAAA ATGGCAAGGT GTCCATCTTC
TCCCGCAACC AAGAGGACAA CACGGCCAAG TTTCCCGATC TCATAAACGG TCTCAAGCGA
TACATCAAGC CTCATGTGAA GTCTGTCGTC ATCGACTGCG AAGCCGTTGC ATACGATAGA
GAACAAAACA AGATCTTGCC GTTCCAGATT TTGAGCACGC GAGGAAAGAA AAACATCGTT
GAATCGGAGA TCAAAGTCAA AGTCGCTCTC TACGCGTTTG ACTGCCTGTA CCTCAACGGC
GAACCGCTGT TGCGCGAGCC GATGCACAAG CGCCGCGAAG CACTTTACAG CGCATTCCAA
GAAGTTCCCG GCGAATTCTT CTTCGTCACC GAGAAGACAT CTCGCGATAT CGATGAACTG
CAAGGATTTT TGGACGAATC TATCGCAGAG AACACCGAAG GTTTGATTGT CAAGACGCTC
GACGCGACGT ACGAACCGTC CAAGCGCTCC CTCAACTGGC TTAAGCTCAA GAAGGATTAC
ATGGAAGGCT GCGGTGACTC ATTAGATTTA GTTCCAATCG GCGCTTGGCT CGGACGCGGT
AAGCGTACGG GCGTTTATGG CGCGTACTTA CTCGCCTGTT TCGACGAAGA CGGCGAAGAG
TACCAATCCA TCTGTAAAAT AGGCACCGGC TTCAGCGAAG TCATCCTCGA GGAATTGGCC
AACGCCATGA ACCCACACGT CATAGACGGT CCGCGTTCGT ACTACAAGGT ATCCGACGCC
ATGAAACCCG ACGTGTGGTT CGAACCGAAG CAAGTGTGGG AAGTCAAAGC CGCCGACTTG
TCAATCTCTC CGGTTCACCA AGCCGCGTGT GGATTGGTCG ATCCGCAAAA GGGCATCGCG
CTGCGCTTTC CTCGCTTCTT ACGTCGCCGC GACGACAAAG AGCCCGAGAT GGCGACCAAC
TCCGAGCAAG TCGCCGAGTT TTACAACGCC CAGGCCAACA AGCAAGAGTT TAACGCCAAC
GGAGATGACG ATTAA
 
Protein sequence
MPKAKAKAKT KAKKEEPATA GLGDKSVEAA ERYLEYDPVS KVTWKLGDAT PYLYLANIFE 
SIAETTKRLE IAELLTNAFR TILASKPTDL LAAVYLASNT IHPQHEGIDL GIGDATLIKA
LAEATGRKDE SIKNDYKEAG DLGSVAMASR STQRMMFPPP PLTVTGVLKE FRAIATTDGE
KSVDRKKGMI KKLLVSAREC EAGYVIRSLQ GKLRIGLAQQ TVTQSLAHAI VLHGDAGKKK
KGAELADALG AAFDVLKQVF SECPTFDQIV PALLEVGIEG LQERCKFTPG VPVKPMLAKP
TTGVSEVLKR FEDIAFTCEY KYDGERAQIH LQENGKVSIF SRNQEDNTAK FPDLINGLKR
YIKPHVKSVV IDCEAVAYDR EQNKILPFQI LSTRGKKNIV ESEIKVKVAL YAFDCLYLNG
EPLLREPMHK RREALYSAFQ EVPGEFFFVT EKTSRDIDEL QGFLDESIAE NTEGLIVKTL
DATYEPSKRS LNWLKLKKDY MEGCGDSLDL VPIGAWLGRG KRTGVYGAYL LACFDEDGEE
YQSICKIGTG FSEVILEELA NAMNPHVIDG PRSYYKVSDA MKPDVWFEPK QVWEVKAADL
SISPVHQAAC GLVDPQKGIA LRFPRFLRRR DDKEPEMATN SEQVAEFYNA QANKQEFNAN
GDDD