Gene OSTLU_43237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43237 
Symbol 
ID5005512 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp242092 
End bp244953 
Gene Length2862 bp 
Protein Length918 aa 
Translation table 
GC content61% 
IMG OID640420933 
Productpredicted protein 
Protein accessionXP_001421245 
Protein GI145353917 
COG category[L] Replication, recombination and repair 
COG ID[COG1525] Micrococcal nuclease (thermonuclease) homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.31647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCG GATGGCTCCG AGGCGTCGTC AAAGCCGTCC CCAGCGGTGA TCAAGTCATC 
ATCGCCGCAC CGTGCGCGCC AGGGGTGCGT TCGCGCGCCG TACCGACCGA TTTCGTCGCG
CGATGTCGAC GCGACGGTTG CCCGATCGCG CGAAACTGAC CACTCTCGCG ACGATTCTCG
CGCTCGCAGG CCCCGCCCGG CGTCGAAAAA ACGCTCACGC TCGCCGGTAT CGTCGCGCCG
CGTCTCGGTC GCCGCGACGG GTCGAGCGCG GACGAGGCGT TCGCGCGCGA GTCGAGGGCG
TCGCTGCGGC GCGCGCTCGC GGGACGACGC GTGTCGTTTC GCGTCGAGTA CGCGGTGGAG
TCGATAAATC GCGAGTTCGG CGTCGTGTTC ACGGAGAGCG GGGAAAACGT GAGCGTGATG
CAAGTGTCTA AGGGGCTGGC CAAGGTGAAG GCGCCCGGGG GGAACGATCG AGCGGTGGCG
AACGCGGAGG AGTTGGAACG ACGCGAACTC GAGGCGCGAG AAGCCGAGGC GGGGATGTGG
AGTAAGGATC CCGCGGTGCT CGCGGCGGCG AGTCAGCGAA CGGTCGTGCA GGCGATGAAA
GCGGAGGACG TGCTGGGTGC GTTGCGGATG AAACCGACGC CCGCGGTGGT GGATTACGTG
CTGAACGGTG GGACGGTGAA GCTTGTGCTG ACGGGGGACG GCGCGACGCG CGATCAGAAT
ATCACGTTGT CTATCGGTGG GATTTCAGTG CCGTCCGTCG GGCGCAAGGG GGCGAAGAAC
GAAGATGGGA CAGATCAAGG TCCAGAGCCG TTCGCGCTCG CGGCGAAGCA TTTCACGGAG
ATGGCGCTCC TGCATCGAGA CGTGCGGGTG ATTTTGGAAG GTCTCGATCG TCGTAATAAT
TTCATCGGTT CAATCTTGCC CGCGGACGTG AACGATACGT CGTTCGTGAA CGTCGGCGAA
GAGTTGTGTC GGCTAGGTCT CGCGCAAGTG CACGAGGCGA GTGCGGCGGC GTTGATCGGT
GGCGCGGCGA CGCTTCGCGC GGCGGAGAAG ATGGCCAAGG ATCAGCAGTT GCGACTTTGG
CATGGATACG TCCCGCCAAT ATCTTCCTTG AACGCGATGA CGACGAAAGT CTTCGATGCC
AGAGTAGTAG AAGTCATCAG CGGTGATTGC ATTTCCGTGG TGCCGACGTC AGGGCCGGAT
ACGTCTGAGA GACGAATCAA TCTGTCGTCG ATTCGGGCGC CTAGAATTTC CAACTCACGA
GATGACAAGT CCAATCACGA ACCTTGGGCG ATAGAGGCAA AAGAGTTTTT GATCTCGCGT
CTGATCGGGC GCACCGTATC GATTAATATG GATTACGCAC GCAAGATTGG AGAAGGTGCG
AACGAACGAA CGTTGCACTT CGCCACGGTG AAGCTGCCAA ACAACAAGAC GGGCGGTGAC
CCGCTCAACG TTTCAGAGAT GCTTCTCATG CGCGGTTTCG CGTCGTGCAT TCGTCACCGT
TCTGAGGAAG AACGTGCGGC AGACTACGAT GAGCTCATCG CGGCGGAAAA GAAGGGCGTG
GAGAGCAAAA AGGGAATGCA CAACAAGAAT CGCGAGGCGC CTGTACACAG GACGAATGAT
TTTAGCATCA ACGCGCATAA GGCGAAGACG TTTTTGCCGT TTTTGCAACG CGCGGGTAAG
TGCGTCGCTA TGGTAGACTA CGTCGCCGCT GGACACAAAA TTCGAGTTTC AATTCCCAAA
GAAGGCGCGG TGATCGCCTT TTGCTTGGCG GGCGTTCGCT GTCCCCAGCG CGACGAGCCG
TACGCCGCCG AGGCGTTGGC GTACACGCGT TCTCGAATTC TTCAGCGAGA GGTGGAAATC
GTGGTAGACT CCGTGGATAG AACTGGAATT TTCCTTGGCA CCTTATTTGC GGACAACGGG
CGATTAAATC TCGGTGAAGA ACTCCTTCGA GCCGGATTAG GAAGCTTGCA CCCGGCGTTC
CCGGTGGATC GCGTTCACTA CGGTCGCGCG CTCGCGGACA TTGAAGCCGC GGCACGGGAA
GTCAAGGCTG GTTTATGGAA AGACTGGACC CCTCCGATCG TCGAAGTAGA CGGGCCTGAG
GATAGTTCGA CCGGCGAACT CGTGCGAGTC GGCGTCACCG AGTGCGTCGC CGGGGGCCGA
TTCTTCGTGC AGAAGTTAGA TGGGAGTAAG ATTCAAGAGG TCACGGACAA ACTCGCCGAG
CTTTACGACG GCGTGGACAC GAGCAAGCCG CACGATGGCG TGTTCGAACC AAAGCCTGGC
GATGCCGTCG CCGCCAAGTT CACCGGAGAT GACAAGTGGG CGAGAGCCAT CGTCACCGCG
AAGCGCGTCG GTGATAAGCC CGTCAGCGTC TTCTACTGCG ACTTTGGCAA CGTCGAGGAC
ATCGGTTTCA ATCGTCTTCG ACCTTTGAAG GATCCAACGG TCACCACAGT TGCTATCCCA
CCCATGGCCA ACTTCTGCGC GCTTTCCTTC CTCAAGATTC CTCGCATCGA TTCCGATTAC
GGCTACGCCG CCGCTTCGCA CGTCGGCAAA CTCATCTCTG GCCAGGCTTT CCACGCCCGA
ATCGACGCCC GCGATCGTTT CCCCACCACA AAACCATGGG AAATCGACGC ACAGCCCGCG
TTCTCGCTCA CATTATTCCC CGACGCCAAC GCTCGCGCCG CTGAATCCGT CGCCCTCGAC
CTCCTTCGCG CCGGCTTTGC GCGCGTCCAC CGCCGCGCCG CCGCCCGTCG TCTCGATCGC
GACGTCTTCG ACGCCATGGT CGACGCCCAG GAGTCCGCGC GTCGCGCGAG GGTCGGTCAG
TGGGAGTACG GCGACGTCGA TTCCGACGAC GACGCGTCTT AG
 
Protein sequence
MSTGWLRGVV KAVPSGDQVI IAAPCAPGAP PGVEKTLTLA GIVAPRLGRR DGSSADEAFA 
RESRASLRRA LAGRRVSFRV EYAVESINRE FGVVFTESGE NVSVMQVSKG LAKVKAPGGN
DRAVANAEEL ERRELEAREA EAGMWSKDPA VLAAASQRTV VQAMKAEDVL GALRMKPTPA
VVDYVLNGGT VKLVLTGDGA TRDQNITLSI GGISVPSVGR KGAKNEDGTD QGPEPFALAA
KHFTEMALLH RDVRVILEGL DRRNNFIGSI LPADVNDTSF VNVGEELCRL GLAQVHEASA
AALIGGAATL RAAEKMAKDQ QLRLWHGYVP PISSLNAMTT KVFDARVVEV ISGDCISVVP
TSGPDTSERR INLSSIRAPR ISNSRDDKSN HEPWAIEAKE FLISRLIGRT VSINMDYARK
IGEGANERTL HFATVKLPNN KTGGDPLNVS EMLLMRGFAS CIRHRSEEER AADYDELIAA
EKKGVESKKG MHNKNREAPV HRTNDFSINA HKAKTFLPFL QRAGKCVAMV DYVAAGHKIR
VSIPKEGAVI AFCLAGVRCP QRDEPYAAEA LAYTRSRILQ REVEIVVDSV DRTGIFLGTL
FADNGRLNLG EELLRAGLGS LHPAFPVDRV HYGRALADIE AAAREVKAGL WKDWTPPIVE
VDGPEDSSTG ELVRVGVTEC VAGGRFFVQK LDGSKIQEVT DKLAELYDGV DTSKPHDGVF
EPKPGDAVAA KFTGDDKWAR AIVTAKRVGD KPVSVFYCDF GNVEDIGFNR LRPLKDPTVT
TVAIPPMANF CALSFLKIPR IDSDYGYAAA SHVGKLISGQ AFHARIDARD RFPTTKPWEI
DAQPAFSLTL FPDANARAAE SVALDLLRAG FARVHRRAAA RRLDRDVFDA MVDAQESARR
ARVGQWEYGD VDSDDDAS