Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119559 |
Symbol | Exo1c |
ID | 5000485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 550231 |
End bp | 552336 |
Gene Length | 2106 bp |
Protein Length | 701 aa |
Translation table | |
GC content | 53% |
IMG OID | 640415906 |
Product | Exodeoxyribonuclease I |
Protein accession | XP_001416426 |
Protein GI | 145343647 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGTCG ACGCAGGGTG GAAAGCGTTC GACGATGCGG TCGTCCGCAC GTCCATCGGC GGCGAGTTCG CCAACGCGAA AGCGGGGGTA GACGCAAACA TCTGGATACA CCAGGGGTGG GCGATGCGAA AAACTGGTTC TGTTCAGGAT AAACTGGATT CGTCTGTAGA AACGGTCATT TCGCGCGCTA CGAAGCTCGT GAATGCAGGT GTCTTCCCAG TCCTTGTTTT CGACGGTGCG AGGACTGCAC ACAAGAAGGA GACGCACGCG AAGCGAGCAG GTACGATAGG TGAAAAATTC GAACGATGTT ATTTGCCTCG GATACTGAAT GAAGTGCGGA AGCGTGGATT CGTGTACGTC GTGGCGCCAA ACGAATCAGA TCACCAGTTG AAGTACATGG AGTCGAGTGG TCTCGTTGAT TTCGTGTTGA CGGATGACAC AGATGCAGTG GTGCTCGGGT GCGCCAAGGT CGTCCACAAG GTGTCTTGGA GCGTTTCGAC GTTAAAGTGC AACGTTTTTA ACAGAAACAA CCTGAGGATG CCGGTAGACA CGAACGAGGT GAACACAGTT GCGGAGCTCA TGTGCATGCA CGGCGACGCC GCCATGCGCT TGTGGGCGGC GGCGAGCGGG TGTGATTACA GGGAGGGGAA AGTTCCCGGA CTTGGACCCC AAACGGCTCT GAAAGCCATT GTAGCGTGCA TACAGAGTGC AGAGACTCTC AGTATCCGCT CTTTTGTGAA ATACCTCGTC AAGAAGGACG TCGTCGACGC GAGTGATGAG GACACGCAAA TTCTCAAACT TGAGGAAAGC CTGGCGGGCT TTGAGCGAGC AATCGTGTAC GATATGCGGA CGAAAGAGAG ACGCTGGCTC AATGACGCAA CCATATTCAA CGCAAATCAC TCAGAAAACG AAGAATTCGC TTTAGGTTTG CGCGATGCGG ATACTCACGA ACCCGTTGAA TTGGTTGCAG TGGCTGCCCT TTTTAGACAT GGTGAAACAC GGGCGAGACG AATCCCCAAG TATCTCATCA AAGGCGCCGT TTTACCGGAA AAGAGAGTCG AGGACAACTC CAAATCGGAT TTAATACGTT GGCTGAACGT GCGTAGGAAC GATCGCCGCC GCGGATGTCA GGATATTCGG GACAAAGATG TCATCATCTC AGAGGTCATA CAGAGGATGG AGCTTGAGCG TCGGTACGAG GAGCTAGATA TCGACGTCGA CGGCGACGTC CAGGATCCAG AAGGGAAAAG TCTTCATACA TATTTGGTTC ATCATTTTCA TCTTCCGGTG ACGCAATTCC CTGAGCTAGA TCCCGATCTG GACGCACCAA TGGATGCTGA AGTGTGGAGC ACTGACGTTG AGCTTTTCCG CGAGACGTCA CCGCTTATGG GTGAGGACAT CATCACGACG TGGCTCGCGG GGATGTCCGT TTACGGCGAC CCGGTTCGAT CCAAAGCGTA TCGACAAGGA TACGCACGAA TACACGCCCG GACGCCCCTT CCGATTCGTT TTGCGCACGT GGGGCATCCG TGGATGCAAA CTCACTTCCG CGTGTGGTTC CGAGTTGGTA TTCCGGCATC ACTCAAAGCA GAGCGTTATG CCGTCGCTGT ATGCTTGTTA TGCAAGTATG GGTGCATCGA CGAAGAAACG AATGAACATG TCCATGACCA TGTCGTCAGT GTCGAGCGTG GGACGTGCGC GTGCAAGGCG GGAGCGGGCG GAGACTTCGG AGGGTGCATC CATGTCATCG CAGCGCTCTG GTATTTTGCC AAGTTGCAAC GACCAGCTGA TCCATGCACG TCGCTCGAAA GTGAGTGGTT CGGCACGAGT GGCGAAGGGG ATCCGATGAA TCGCAAAAAT CCTCTTAGCA ACATTGACTT TAATCGATTC GAAGCTGGGC GCGCAAAGCG ACGATGCACT GTGGACACTC GCGGTGACGG TGATTTGGAG CTCCCTGAAG TTGCACCTGA TGTAAGCGCG ATGTGGTCGC GAAACAAGCC ATCAACACTT CTCAGGGAGT ATTTCGACGT ATATGAAAAA GAGAACAATC ACAAATGTAC TTTACACAGG GTTACAGACG CGTGCGAGAC GCGGAATCTA CCTTAG
|
Protein sequence | MGVDAGWKAF DDAVVRTSIG GEFANAKAGV DANIWIHQGW AMRKTGSVQD KLDSSVETVI SRATKLVNAG VFPVLVFDGA RTAHKKETHA KRAGTIGEKF ERCYLPRILN EVRKRGFVYV VAPNESDHQL KYMESSGLVD FVLTDDTDAV VLGCAKVVHK VSWSVSTLKC NVFNRNNLRM PVDTNEVNTV AELMCMHGDA AMRLWAAASG CDYREGKVPG LGPQTALKAI VACIQSAETL SIRSFVKYLV KKDVVDASDE DTQILKLEES LAGFERAIVY DMRTKERRWL NDATIFNANH SENEEFALGL RDADTHEPVE LVAVAALFRH GETRARRIPK YLIKGAVLPE KRVEDNSKSD LIRWLNVRRN DRRRGCQDIR DKDVIISEVI QRMELERRYE ELDIDVDGDV QDPEGKSLHT YLVHHFHLPV TQFPELDPDL DAPMDAEVWS TDVELFRETS PLMGEDIITT WLAGMSVYGD PVRSKAYRQG YARIHARTPL PIRFAHVGHP WMQTHFRVWF RVGIPASLKA ERYAVAVCLL CKYGCIDEET NEHVHDHVVS VERGTCACKA GAGGDFGGCI HVIAALWYFA KLQRPADPCT SLESEWFGTS GEGDPMNRKN PLSNIDFNRF EAGRAKRRCT VDTRGDGDLE LPEVAPDVSA MWSRNKPSTL LREYFDVYEK ENNHKCTLHR VTDACETRNL P
|
| |