Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26493 |
Symbol | |
ID | 5004601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 327320 |
End bp | 330304 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | |
GC content | 53% |
IMG OID | 640420022 |
Product | predicted protein |
Protein accession | XP_001420309 |
Protein GI | 145351923 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1793] ATP-dependent DNA ligase |
TIGRFAM ID | [TIGR00574] DNA ligase I, ATP-dependent (dnl1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.164071 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGACG TTGCGAATGA TTCCGGGCAC CGTCTCGGAA CGCGCGACGG CGGCGACGAC GACGACGACG ACGCGGAAGA GGACGGCGCG CGGTTGACGA GCGAACCTTC GACGTCGGGC GCGTCGCTGA GATACGCCAC CGCGCGCGAC GTGCCGTTTC GAAAGCTGTG CGGCGCGTTC GAGGCGTTGC AGAACGAACG CGGTCGGTCC AAGGCGACGA GCGCGCGGCG GTATCGAGTG CTCGATACAC TGCACGATCG ATGTCTCGTG AGCGCGCGCG ATGGAACCCG CGTCGACGCG TTCGACGCGT ATCGGTTGAT ACTGCCGGCG TTGGACAAGG AACGAGGGGC GTATTTCTTG AAACACGACG CGTTGGCCAA GGTGGTGGTG TCGGCGTTCG ACATGTCTCG AGCGAGCGAG GACGCGAAAA AGTTGGAGGA TTGGAAACGA AAGGGTGGCG GTAACTTTCC GCAAGTTGTT CGAGACGTTT TGGCGGGTGG ACATCTCGCG GATCGTGAGA AAGATGAAGA CGCGCGCACG TTGACCATCG GAGACGTGAA CGAGGCGTTG GACGCATTGG CGGCGTGCGA TAACAAGGAT CAGCGCGCGT CGACGCTGCG GGCGCTGTTT TCAAAGATGG ATGCGACGCA AGTGAAATGG ACGTGCGCAA TCATATTAAA GGAAACAAAA ATTTCAATGG GAGAGAAGGC GATATTGCGT CATTATCACT CCGAAGCGAA TGACTTGTGG GATTGGACTA GCGATTTGCG AAGAGTTTGT GAAGACTTAC CGCGGCGAGA CTTCCGGATG AAGCGCGAGG ACATCGATGT GGGCGTAGGT TTAATTAATC CTCAGATGGC GAGAAGGCAA AACAGCATCG ATGCCGTGTA CCGCGCGTTG AAAGGTACGC AGTTTGTCAT CGAAACCAAA TTCGACGGCG AGCGCATTCA AATTCACAAA GACGGCGACA TCATCAACTA CTGGACGAGA AACATGAACG ATTTTGGCCC TCGTGGCTAC GATGTGATGA ATGCTCTCTT TCGTCATTTG CCAAAGCGAT GCATTCTCGA CGGCGAACTC ATGGTTTGGA ACAAGCTTCG CGATCAGTCG TACCCGTTCG GCGCGTTGAA GAATCTCATC AAGGCTGCAA ATATGCGCAA GGCCAAGGAT GAATTGTTTC CACTCAAGGC ACACCTCGAA AACCGTGATG GTGGTGATAG CGACGGTGAG GAGGATCTCA TGAGCTCCAA GTATGCGTGG TACGCCGAGA ACGTAAAGAA ACTCACGTAT GGAGACTTGG AATTGGTCTA CGTTCCGTTT GATATATTGT ATGCCGTCGA TCGAAGTGTG AAGACTCATA AGCTGAGAGA GCGGCATGAA CTTTTGAAAG AGCATGTGCG TGAAGTATCA GTCATGTGTG GGAATATTCG TGCTAGAATT CTTCTCGCGA CGCCCGATTG CGCGTTCTCA CGTGTAGGCT CAACGAAGGA GGATATCGAG CGCGCGCTGT TAGAAGCGAT GGATAACGGA GAGGAGGGAT TGGTAATCAA AGATCTCGAC GGTCCATGGA TACCAGGTGA TCGTAGCAAT AACTGGATGA AAATCAAACC AGACTACTTA TCGAGTGAAG ATTTGGACGT CGTTCTCATC GGCGGCTATT ACGGCGCCGG TGAGTTGCGC GGGGGCAAAA TTTCGCAGTT CCTCTGTGGC ATTGTGGAGG CAACGAACGA TCCGATTAGC ACCGCTTCAG ATGGAGTCAA AATCATGAGC TTTTGCAAGG TAGGTACTGG TATCAGCGCC ATGCAGCTCG ACGATTTGAG AAGTCGGCTA GGCGACTTCA TGCACAGAGA AAAGCCTCAA GACATGAACT ACAACGTCAC GGATGCTTAC AACGAAAAGC CGGACGTTTG GATCTGGCCT CCTCAGCGAT CATTGGTTGT GACGGTGAAA GGTGACATTC GTGCGATTCG CACGACAACT TTCGCAACTG GTTACAGCCT GCGTTTCCCC CGCGTCACCG GTATTCGATA TGACAAAAAA TGGTCCGACA TCTTGACACT CTGCGATCTC TTAAAGACGA TTGAAGAGGA AGTACCGACA TTGAAAGTGA ATATCGACGA TACACGAGGC GGTACGAATG CCCGAAAGCG TTCGCGCACG ACGGGACCCG CGACGCTACT ACCTGCACAT CTCATGCCGG TTGACGTTTC GGGCGTCGTG CAGGAATCTG AAATCTTCAA AAACATGCAA GTGCACATCG CGAACTGTGA AAGTCGAGAA CAAAAGCAGG AGCTGTTCAA AGCTGTCATC GCACGTAGTG GTACTACGAG TGAGTTGTGG CACAAGCATG TGACGCACAG CGTTGCGTTG AACCGCGACG GCTCAAAGTT CAGGACGGCG AGTCGGGAAG GCGATGTGTA CACGATTGCC TGGCTCCAAG AGTGCATTCA CGAAGGACGT GTGGTACCGC CGAAACCTCG ACATCGCTTA CATCTCTCAG CGACGACGTG GTATGGTACG GATGCCATGG ATCGATACGG AGATGATCAT TTCTCGGACT GCAATTTGGC CGACGTACAC GCGCTCGTGC ATCAAGTTGG CGATCAAACT AAGCGTTGGA AAACTGCCGA AGTTCCAGCA TTGGTGGAGC TCGATCGCGA GTATCCATCG GTGTGTGCTG ATGTGAAACT TTTGACGTTT AGGGGTTGTG TGTTCGAGAT TGACGATTCT CTCGGCAAAG ATGCGCGTGT TGATGAGCAG AGTAGTATGT TTCCAAGTGT GCTGGCGGGC GAACGTCGAA ACGTCGAACA GATTCTTCGG ATTTATGGTG GAAAGCTCGC ACCTGAAGGC AGAATGGGGA CACATATTGT GCGCATGTAC GATGACTCCG TTGTCTTTGA AGATATGGAT GATGGAAGAA AGCAAGTTTC GCTCTCTTGG GTGAAACGTT GCATCGACCG GTCGACAACG GCTGATGTAG TCTAA
|
Protein sequence | MDDVANDSGH RLGTRDGGDD DDDDAEEDGA RLTSEPSTSG ASLRYATARD VPFRKLCGAF EALQNERGRS KATSARRYRV LDTLHDRCLV SARDGTRVDA FDAYRLILPA LDKERGAYFL KHDALAKVVV SAFDMSRASE DAKKLEDWKR KGGGNFPQVV RDVLAGGHLA DREKDEDART LTIGDVNEAL DALAACDNKD QRASTLRALF SKMDATQVKW TCAIILKETK ISMGEKAILR HYHSEANDLW DWTSDLRRVC EDLPRRDFRM KREDIDVGVG LINPQMARRQ NSIDAVYRAL KGTQFVIETK FDGERIQIHK DGDIINYWTR NMNDFGPRGY DVMNALFRHL PKRCILDGEL MVWNKLRDQS YPFGALKNLI KAANMRKAKD ELFPLKAHLE NRDGGDSDGE EDLMSSKYAW YAENVKKLTY GDLELVYVPF DILYAVDRSV KTHKLRERHE LLKEHVREVS VMCGNIRARI LLATPDCAFS RVGSTKEDIE RALLEAMDNG EEGLVIKDLD GPWIPGDRSN NWMKIKPDYL SSEDLDVVLI GGYYGAGELR GGKISQFLCG IVEATNDPIS TASDGVKIMS FCKVGTGISA MQLDDLRSRL GDFMHREKPQ DMNYNVTDAY NEKPDVWIWP PQRSLVVTVK GDIRAIRTTT FATGYSLRFP RVTGIRYDKK WSDILTLCDL LKTIEEEVPT LKVNIDDTRG GTNARKRSRT TGPATLLPAH LMPVDVSGVV QESEIFKNMQ VHIANCESRE QKQELFKAVI ARSGTTSELW HKHVTHSVAL NRDGSKFRTA SREGDVYTIA WLQECIHEGR VVPPKPRHRL HLSATTWYGT DAMDRYGDDH FSDCNLADVH ALVHQVGDQT KRWKTAEVPA LVELDREYPS VCADVKLLTF RGCVFEIDDS LGKDARVDEQ SSMFPSVLAG ERRNVEQILR IYGGKLAPEG RMGTHIVRMY DDSVVFEDMD DGRKQVSLSW VKRCIDRSTT ADVV
|
| |