Gene OSTLU_119559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119559 
SymbolExo1c 
ID5000485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp550231 
End bp552336 
Gene Length2106 bp 
Protein Length701 aa 
Translation table 
GC content53% 
IMG OID640415906 
ProductExodeoxyribonuclease I 
Protein accessionXP_001416426 
Protein GI145343647 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGTCG ACGCAGGGTG GAAAGCGTTC GACGATGCGG TCGTCCGCAC GTCCATCGGC 
GGCGAGTTCG CCAACGCGAA AGCGGGGGTA GACGCAAACA TCTGGATACA CCAGGGGTGG
GCGATGCGAA AAACTGGTTC TGTTCAGGAT AAACTGGATT CGTCTGTAGA AACGGTCATT
TCGCGCGCTA CGAAGCTCGT GAATGCAGGT GTCTTCCCAG TCCTTGTTTT CGACGGTGCG
AGGACTGCAC ACAAGAAGGA GACGCACGCG AAGCGAGCAG GTACGATAGG TGAAAAATTC
GAACGATGTT ATTTGCCTCG GATACTGAAT GAAGTGCGGA AGCGTGGATT CGTGTACGTC
GTGGCGCCAA ACGAATCAGA TCACCAGTTG AAGTACATGG AGTCGAGTGG TCTCGTTGAT
TTCGTGTTGA CGGATGACAC AGATGCAGTG GTGCTCGGGT GCGCCAAGGT CGTCCACAAG
GTGTCTTGGA GCGTTTCGAC GTTAAAGTGC AACGTTTTTA ACAGAAACAA CCTGAGGATG
CCGGTAGACA CGAACGAGGT GAACACAGTT GCGGAGCTCA TGTGCATGCA CGGCGACGCC
GCCATGCGCT TGTGGGCGGC GGCGAGCGGG TGTGATTACA GGGAGGGGAA AGTTCCCGGA
CTTGGACCCC AAACGGCTCT GAAAGCCATT GTAGCGTGCA TACAGAGTGC AGAGACTCTC
AGTATCCGCT CTTTTGTGAA ATACCTCGTC AAGAAGGACG TCGTCGACGC GAGTGATGAG
GACACGCAAA TTCTCAAACT TGAGGAAAGC CTGGCGGGCT TTGAGCGAGC AATCGTGTAC
GATATGCGGA CGAAAGAGAG ACGCTGGCTC AATGACGCAA CCATATTCAA CGCAAATCAC
TCAGAAAACG AAGAATTCGC TTTAGGTTTG CGCGATGCGG ATACTCACGA ACCCGTTGAA
TTGGTTGCAG TGGCTGCCCT TTTTAGACAT GGTGAAACAC GGGCGAGACG AATCCCCAAG
TATCTCATCA AAGGCGCCGT TTTACCGGAA AAGAGAGTCG AGGACAACTC CAAATCGGAT
TTAATACGTT GGCTGAACGT GCGTAGGAAC GATCGCCGCC GCGGATGTCA GGATATTCGG
GACAAAGATG TCATCATCTC AGAGGTCATA CAGAGGATGG AGCTTGAGCG TCGGTACGAG
GAGCTAGATA TCGACGTCGA CGGCGACGTC CAGGATCCAG AAGGGAAAAG TCTTCATACA
TATTTGGTTC ATCATTTTCA TCTTCCGGTG ACGCAATTCC CTGAGCTAGA TCCCGATCTG
GACGCACCAA TGGATGCTGA AGTGTGGAGC ACTGACGTTG AGCTTTTCCG CGAGACGTCA
CCGCTTATGG GTGAGGACAT CATCACGACG TGGCTCGCGG GGATGTCCGT TTACGGCGAC
CCGGTTCGAT CCAAAGCGTA TCGACAAGGA TACGCACGAA TACACGCCCG GACGCCCCTT
CCGATTCGTT TTGCGCACGT GGGGCATCCG TGGATGCAAA CTCACTTCCG CGTGTGGTTC
CGAGTTGGTA TTCCGGCATC ACTCAAAGCA GAGCGTTATG CCGTCGCTGT ATGCTTGTTA
TGCAAGTATG GGTGCATCGA CGAAGAAACG AATGAACATG TCCATGACCA TGTCGTCAGT
GTCGAGCGTG GGACGTGCGC GTGCAAGGCG GGAGCGGGCG GAGACTTCGG AGGGTGCATC
CATGTCATCG CAGCGCTCTG GTATTTTGCC AAGTTGCAAC GACCAGCTGA TCCATGCACG
TCGCTCGAAA GTGAGTGGTT CGGCACGAGT GGCGAAGGGG ATCCGATGAA TCGCAAAAAT
CCTCTTAGCA ACATTGACTT TAATCGATTC GAAGCTGGGC GCGCAAAGCG ACGATGCACT
GTGGACACTC GCGGTGACGG TGATTTGGAG CTCCCTGAAG TTGCACCTGA TGTAAGCGCG
ATGTGGTCGC GAAACAAGCC ATCAACACTT CTCAGGGAGT ATTTCGACGT ATATGAAAAA
GAGAACAATC ACAAATGTAC TTTACACAGG GTTACAGACG CGTGCGAGAC GCGGAATCTA
CCTTAG
 
Protein sequence
MGVDAGWKAF DDAVVRTSIG GEFANAKAGV DANIWIHQGW AMRKTGSVQD KLDSSVETVI 
SRATKLVNAG VFPVLVFDGA RTAHKKETHA KRAGTIGEKF ERCYLPRILN EVRKRGFVYV
VAPNESDHQL KYMESSGLVD FVLTDDTDAV VLGCAKVVHK VSWSVSTLKC NVFNRNNLRM
PVDTNEVNTV AELMCMHGDA AMRLWAAASG CDYREGKVPG LGPQTALKAI VACIQSAETL
SIRSFVKYLV KKDVVDASDE DTQILKLEES LAGFERAIVY DMRTKERRWL NDATIFNANH
SENEEFALGL RDADTHEPVE LVAVAALFRH GETRARRIPK YLIKGAVLPE KRVEDNSKSD
LIRWLNVRRN DRRRGCQDIR DKDVIISEVI QRMELERRYE ELDIDVDGDV QDPEGKSLHT
YLVHHFHLPV TQFPELDPDL DAPMDAEVWS TDVELFRETS PLMGEDIITT WLAGMSVYGD
PVRSKAYRQG YARIHARTPL PIRFAHVGHP WMQTHFRVWF RVGIPASLKA ERYAVAVCLL
CKYGCIDEET NEHVHDHVVS VERGTCACKA GAGGDFGGCI HVIAALWYFA KLQRPADPCT
SLESEWFGTS GEGDPMNRKN PLSNIDFNRF EAGRAKRRCT VDTRGDGDLE LPEVAPDVSA
MWSRNKPSTL LREYFDVYEK ENNHKCTLHR VTDACETRNL P