Gene OSTLU_526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_526 
Symbol 
ID5002514 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp685235 
End bp688012 
Gene Length2778 bp 
Protein Length802 aa 
Translation table 
GC content58% 
IMG OID640417935 
Productpredicted protein 
Protein accessionXP_001418304 
Protein GI145347709 
COG category[A] RNA processing and modification
[D] Cell cycle control, cell division, chromosome partitioning
[L] Replication, recombination and repair 
COG ID[COG5049] 5'-3' exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.306945 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.90792 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTCC CGAGCTTCTA CCGCTGGATC GCGCAGAAGT ACCCGAAGAT CGTCGCCGAC 
GTCGTCGAGG ACGAGCCCGT CGACGCGCTC GGACGCCGGG TGGAGCTGAA CAGCGCCGAG
GCGAACCCGA ACGGCATCGA GTTCGACAAC CTGTACCTGG ACATGAATGG CATCATACAC
CCGTGCTTTC ACCCGGAGGA CCGACCGGCG CCGACGACGG AGGAGGAGGT GTTTGAGTGC
ATATTCGATT ACATCGATCG GTTGTTCTTG ATGATACGAC CGAGGAAGGT GCTGTACATG
GCGATCGACG GGGTGGCGCC GCGAGCGAAG ATGAATCAAC AGCGAAGTCG ACGGTTTCGA
AGCGCGCAGG AGGCGAGAGA GAAAGCGGAG GAGGAAGAAA AGTTGAGGGA GAAGTTGATC
AGGGAGGGGG TGAAGGTGCA GCCGAAACAA GAGTCGGGGG TGTTTGATTC GAACGTAATC
ACGCCGGGGA CGCCGTTCAT GGGACGGCTG AGCGAGGCGT TAAAGTACTA CGTGCACGAT
AAGTTGAACA ACGATCCGGG ATGGCGGGGA ATTGAGGTTA TCTTTAGCGA TGCGAGCGTA
CCGGGGGAGG GAGAGCACAA GGCGATGCAC TACATACGGC AACAAAGGGG GCTGCCGGGG
GCGAATCCGA ACACGCGGCA CGTCGTCTAC GGTTTGGACG CGGATTTGAT CATGTTGGCG
CTCGCGACGC ACGAGCCCCA CTTTTGGATC TTGCGCGAGA TTGTGTTTCA GAAGAAGGAT
AACGAGGCGC CGCAGACGCT GGGGCTCGGG ACGGAGGAGA CGAAGAAAAA GGTGGCCATC
GCGCGTAAGC CGTACCAGTT GCTCAGCGTG AGCGTCTTGC GCGAATACTT GGCGTTGGAC
ATGCGCCCTG TCGCACCGAC GCCGTTCAAG CTCGACCCAG AGCGGATGTT TGACGACTTT
ATCTTCATGT GTTTCTTCGT CGGAAACGAT TTCTTACCGC ATTCGCCGAC GTTGGAAATC
CGTGAGGGCG CCATCGACTT GTTGATGACG CTCTACCGAA ACTGTCTTCC GACGCTCGGT
GGATACCTGT GCGCGGACGG CCGTCCCAAT CTCTCCATCG TCGAAAAGTT TGTGCGGCTC
GTGAGTGAGC ACGAGGACGC CATCTTTCAG AAGCGTGCCA AGAAGGAGGC GCGAATGCGG
AGTTCAAGAC AACGCGATAA ACAAAAGGCG AAGGATTACT ACGAGCGCCA GCGCAAAGGG
AGCGGTACCA ACGTGCCGCA ACACCGCGTG CTCGGTGGAT CGAGAAACGC GAGTGATCGC
GCGCCCGCAG CGCCGACAGA ACAGCTCGTC GCGCTCGGTC GCGGAAAACC GACGCCACCT
CCGAGCGCGC CGAAGACGGC GGCGGAGAAC AAGAGCGCCG CGGATGCGCT GCGCGAACGA
TTAAAGACGC GAGGTAAGCG CGCCGCCGAA GCGACGGCTG ACGTCGCGCC GGAAGCCGAT
ACCGACGCCA AGAAGGCTAA AGTATCGGAC GCGGATAAGG CTGAAAAGGA CGAAAAAGCA
AAGGAGTTTT GGAATCAGCT CGCGGAGAAG GCTGAATCTG AATCAGCCAC GACGGTTGAA
ACCATCGACA CCGTCGAAAG CGATGAAGTG CCGTCGCACC CTTTTGTACA ATCGACGGCA
CCCGACGCGC AGCCCGGCGA TTGGATGTGT CCCACAGGGT GCGGAAGCAT GTACGCATCT
AAAGGTTCGT GCTTCAGGTG CGGATGCCCA CGTCCGAGCG AGGTGCGAGA GTTCAAAGCG
GGCGAGGTGA TGGACAGTAA ATCCTTCTTG AAGCAACTTG AAGGCATCGT CAAGGCCATG
GGTGAGCGCG AGGAAGAAAC GGACAACATT CGGCTCGGAG AAAGTGGCTG GAAAGATCGC
TATTACGAAG CAAAGATGCA GGCAACGCCA CAAACGCGAG ATGAGATCAT TCGGGGCATG
GTCATTGAGT ACGTGCGCGG CTTAATTTGG GTGTGTCGAT ATTACTTCGA AGGGTGTTGC
TCTTGGAGTT GGTTCTACCC ATACCATTAC GCCCCGTTCG CGAGTGATTT ATATGACTTG
AGTACGATCT CTACCGATTT TGATCTCGGC AAACCGTTCA AGCCGTTTTC GCAGCTCATG
GGCGTGTTGC CGGCGGCGTC TTCGCACGCG TTGCCCGCAG CGTTCGCGCC GTTGATGTCG
GACAAAGACT CTCCTATCAT CGACTTTTAT CCCGAAGACT TCGCGCTCGA TATGAACGGG
AAGCGTTTTA CGTGGCAGGC CGTGGCACTG CTCCCTTGGA TCGACGCCAA CCGTCTGCTA
GAACAAACGG AGATGCTCGA ATACACGCTC ACCGCCGAGG AAAAGCGCCG AAACTCCATC
AACGAGGAGG AAATATACGT CAATGCCGCG CATCCGCTCG CAAAACAGTT TCTCGAGCTC
GAAGAGCGGG AAGACGAAGA CGTGGAAAAA ACGCTCAAGA TGGATCCGAA GCTTAGTAAA
GGCATGAACG CCACGCTCGT GTCGGTGAAA CGCGACGCGC AGCCCACGAT GATACCGTCG
CCGATGTCGA GTCGCCAAGA CATCTCGAAC AACAAAGTAG TCGTGGCGTC GATGCGTTTG
CCGACCGATA GATTTGTGCC GCCGGTGCTG ATGCCCGGTG CGGTGTTACC GACGCCTATG
GTGACGGAGG CTGACTTGCC GCCGCCGCCG CAGTTGTTTC ATCAGACGGA CAACTACGGG
CGCAACAACA ACAACAAC
 
Protein sequence
MGVPSFYRWI AQKYPKIVAD VVEDEPVDAL GRRVELNSAE ANPNGIEFDN LYLDMNGIIH 
PCFHPEDRPA PTTEEEVFEC IFDYIDRLFL MIRPRKVLYM AIDGVAPRAK MNQQRSRRFR
SAQEAREKAE EEEKLREKLI REGVKVQPKQ ESGVFDSNVI TPGTPFMGRL SEALKYYVHD
KLNNDPGWRG IEVIFSDASV PGEGEHKAMH YIRQQRGLPG ANPNTRHVVY GLDADLIMLA
LATHEPHFWI LREIVFQKKD NEAPQTLGLG TEETKKKVAI ARKPYQLLSV SVLREYLALD
MRPVAPTPFK LDPERMFDDF IFMCFFVGND FLPHSPTLEI REGAIDLLMT LYRNCLPTLG
GYLCADGRPN LSIVEKFVRL VSEHEDAIFQ KRAKKEARMR SSRQRDKQKA KDYYERQRKG
SGTNVPQHRV LGGSRNASDR APAAPTEQLV ALGRGKPTPP PSAPKTAAEN KSAADALRER
LKTRGKRAAE ATADAMGERE EETDNIRLGE SGWKDRYYEA KMQATPQTRD EIIRGMVIEY
VRGLIWVCRY YFEGCCSWSW FYPYHYAPFA SDLYDLSTIS TDFDLGKPFK PFSQLMGVLP
AASSHALPAA FAPLMSDKDS PIIDFYPEDF ALDMNGKRFT WQAVALLPWI DANRLLEQTE
MLEYTLTAEE KRRNSINEEE IYVNAAHPLA KQFLELEERE DEDVEKTLKM DPKLSKGMNA
TLVSVKRDAQ PTMIPSPMSS RQDISNNKVV VASMRLPTDR FVPPVLMPGA VLPTPMVTEA
DLPPPPQLFH QTDNYGRNNN NN