Gene OSTLU_38526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38526 
Symbol 
ID5001719 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp628124 
End bp631834 
Gene Length3711 bp 
Protein Length1153 aa 
Translation table 
GC content59% 
IMG OID640417140 
Productpredicted protein 
Protein accessionXP_001417825 
Protein GI145346706 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.168449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGCT CGAACGGCGA CGGCGCGTCG GACGCGAGCG CCGCGCTCGT GTTCCTGCCC 
AAGCGCAAGG GCACGAGCGG GCGGGACGGC GCGAGCGCGA TCGAGGCGCG CGATCGGCGA
CGCGCCGAGG CGCACGCCGC GCGCAGGGTG ACCAAGCTGA GCAATTCGCA AAAGCGCAAA
CTGAAGAAAC TGGCTGAGGA GGAGCGGAAA CGAAGCGAAA GGGCGAGCGT GATGGCCATG
CTGGAGGCAA ACGTGGCGGA TGAACGCGCG CTGGGGCTGA TGCGGGCGAC GACGTCGTTG
GGGGCGAGAG AGACGGCGAA GGAGAAGATG CGGCGCGCGC TCAAGGCGGA ACGCGCGGGG
GTGACGCTGG ACGACTTGGA CGACGCGCGG TTGACGAAGC GACCGAAGGC TGTGGGAAGA
GAAGATGAAG AACAGAGCGA GGACGAGAGC GAGAGCGAAC GGGAAGCGGC GGTTGAGGTC
GGAGGGGTGC GAGATGTCTC GACGGGGAAG ATGGAAGACG ACGTCGATTC GGAGGACGAC
GAACGCGCGG AGGACGACGC GTTGACTCGA GCCGCCATTC GCGCGGCGGA GGCGGCGGGC
ATGGCGTCAC TCGATGACGC CACGCGCGAA CTAGTGAGAT CATTGCGCGC GCGAGCTGGG
GTCGGGGATG ACTCAGATAA GCCGAAATCG AACGAGTACA AAGGCTTGCA TCACGAAGTG
TTTCGAGGAT GTTCCTTTGT CGTTCCCGTT CAACGGACCG GGAAGATTAA CGATTCGCGC
GAGGGGCTTC CGATCGTTCA AGAGGAACAT GAAATCGTAG ACGCGATTAA TACCAATCCC
GTGACCGTCA TCTGTGGCGC CACGGGTTGC GGTAAGACGA CGCAAGTGCC GCAGTTCTTG
TACGAGGCTG GTTACGGCGA TCCGGATTGC GACAGTCACC CTGGGGCGGT GGCGGTGACA
CAGCCTCGAC GTGTGGCGGT GACCTCCACG GCGCGTCGTG TGGCAGAAGA GCTGAACGTT
CCTCTCGGTG GTGATGTCGG ATATCAAGTG CGTTACGACA AGAACGTCGG CGACAACCCG
CGTATCAAAT TTATGACGGA TGGCATCTTG CTTCGAGAAG TACAGGCAGA CTTCCTTTTA
CGCAAATATT CGGTGGTGAT CATCGATGAA GCGCACGAAC GGAGCGTAAA TACGGATATT
CTGTTGGGTC TTCTCTCACG AATCGTTCCT CTTCGAGCCG CACTCGCGGC TGAAGGGAAA
GCTGTGACGC CGCTTCGATT GGTCGTGATG TCCGCGACGT TGCGAGTCGA AGAGTTCGTG
GAGAACAGGA AGCTATGTCC GACACCTCCA GCGTTGCTTC AAGTCGCCAC GCGTCAGTTT
CCAGTCACTG TGCACTTTTC ACGGAAAACG GAACACGCTG ATTACGTCGG TGCGGCGACG
AAGAAGGTAC TCGCGATCCA TCGCAAGCTT CCACCTGGCG GCATTCTCGT CTTTCTGACT
GGTCAGCGAG AGGTGGAGAT GGTCTGTCGC AAGCTTCGAG ACGCGTACCC GCTTCACGGG
AAGCGCGTCA ACGCGGCGGA AAGTTCGGAT GATGAAGATG AAGATAGCGG CGACGCGATG
GATGACACAT ACGACGTCGA CGCGATCGAC GCGGGGGGGG AAGATTTGGG CGGCGACGAC
GACGAACCCG ATTTCGACGG CGAGGACGAT ATGTCGGACG CCGCGAGCGA CATCAGCGAA
GAGGACGAAG TCCTAGTCAT GGGTGGTGAG GGCGTTGGAG AAGAAGAAGC CGCAGAAGCC
GAAGCAGCTT GGACTCGAGC GAATGCTCCA TCTACGGGTT TGGGCGCCGA TAAGACGGCG
GACGGACCGG GAGGTTTAAA TGTGCTTCCG CTCTACGCGC TACTGCCCCC GAATTTGCAG
CAGCGCGTGT TTCAAGCTTC ACCCGACGGC TCTAGAATGG TCATCGTCGC TACAAACGTC
GCGGAAACAT CTTTGACGAT TCCAGGGATA CGGTACGTCG TCGACGCCGG GAGAGCAAAG
GAGCGGGTGT ACGAGCGAGA CGCAAGTTTG TCGCGGTTTC AAGTCGGATG GGTGAGCAAG
GCTAGCGCCG ATCAACGCGC CGGGCGGGCC GGGCGTACGA GTCCAGGGCA CTGTTATCGT
CTGTTTAGTA GCGCACACTT CGTGGATGAG ATGAAAGCGC ACGCAGATCC ACAAATCCTG
GGTGTACCCG TCGAAGGCGT CGTGTTGCAA ATGCGTGCCA TGGGCATCGA CAAAGTGGTC
AACTTTCCTT TTATTTCGCC TCCCGAAAGA TCGGCTCTCG CGGCAGCGGA GAAGACTTTA
CAGATTCTCG GAGCGGTAGA GAAGAGTAGG CATGGCGAAG AGATCGGGCC TTTGACCGAT
TTAGGGCGCG CTATGGCGGT TTTGCCAATC AGTCCACGGC ACTCGAGAAT GTTGTTCGCC
GCCGCTCAAA GCGGCGTGGG AGGTTGCCTC TCACCGGCGA TTGCCATCGC TGCCGCGTTG
AGCCTCGACA GCCCATTCCT GCGAAATAGC AGCGAAACTG TGGAAGACGA CGAAGAAGAA
GGAGAGGCGA AGGCGACACC CAAAGGTCCG CCACCGCACG TTCGATTTCA CCACCCGGCG
AGCGACGCTC TCTCGGCGGC GCAGGCGCTC TTGGCGTACG ACGCCTGCAA GAGCTCGGAC
GCGGTGACGT TTTGCTCTAC GAATAGGTTG CACGAAAAGA CCATGCGAGA AATGTCCGAT
TTGCGGCGAC AATTGAAACG GCTCGTCGTC AATCTCGCGA CGACATCCAA GTTTGGCGAC
GACGTCTTTC CCAACGCCGC AGTGCTGAAC GAACTCGACG ATTCAAACCA GGCTGCTTCG
TCGATGATTT CGCTTCCTCC CGGGGGCGAT GTCGAGCGCA CGTTGAGACA GGCGCTGTGT
GCGGGTTGGG CGGATAGAAT TGCCCGTCGA TCAAAACACA AAGAGATGGA GCAAGCGTCT
CGAGCGAACG AGAAATCGAC CAAGGCGACG CGATACGTTC CGGCGCTCCT CGACGCCGCG
GTGTTTCTTC ATCCAACGTC TTCGTTGCAT CGAAGCTCTC CGGATTACGT CGTCTACACC
GACTTACTAC AAACGGACAA GCGCGCCTAC ATCGTCGGCG CCACCGGGAT TGAGCCCGAG
TGGTTGATTC AGCACTGCGA CGCACTCGTG GATCAAGGCG CTATGCTCGC CGACCCGGCG
CCGAGATACG TCTCGCGGGA GGACCGCGTC GTCGGTTGGA CGGCGCCGAG ATTCGGACCG
CACCGCTGGG ATTTGCCGCT GAATCCAATT GCTGTTAATG ATGTGGATAC CAAATGCGCC
GTCTTCGCCA CCGCGCTTTT GTCGGGGGCG GTGTCGCCAC CTATGGCGGA TTTGCGAGAG
AAGCTCGCCG CCAAGCCGCT CTTGGCGTCT CGACCTGAGG GACGAGCTCA AAAGCGCGTC
GTCGATTTAC TCGGCGCACT GAAACGTGTC GGTGGTGGCA TTTGTACTCG GGCACAATTA
CGACAAATCT GGATGACTCG TGGGAACGAC CGGTACTTGT ACCCAGAGCT CAAGGCGTGG
ATGCGGGCGG GCAAAGGTTA CGCGCTCGAA CAGGCGTGGT TAAAAATAGT TCAGGGCGTC
GTGAATTACG ACGAAGGCAA GGAGCGAAAG AAGAAGAAGG GAAAGAAGTG A
 
Protein sequence
MGRSNGDGAS DASAALVFLP KRKGTSGRDG ASAIEARDRR RAEAHAARRV TKLSNSQKRK 
LKKLAEEERK RSERASVMAM LEANVADERA LGLMRATTSL GARETAKEKM RRALKAERAG
VTLDDLDDAR LTKRPKAVGR EDEEQSEDES ESEREAAPKS NEYKGLHHEV FRGCSFVVPV
QRTGKINDSR EGLPIVQEEH EIVDAINTNP VTVICGATGC GKTTQVPQFL YEAGYGDPDC
DSHPGAVAVT QPRRVAVTST ARRVAEELNV PLGGDVGYQV RYDKNVGDNP RIKFMTDGIL
LREVQADFLL RKYSVVIIDE AHERSVNTDI LLGLLSRIVP LRAALAAEGK AVTPLRLVVM
SATLRVEEFV ENRKLCPTPP ALLQVATRQF PVTVHFSRKT EHADYVGAAT KKVLAIHRKL
PPGGILVFLT GQREVEMVCR KLRDAYPLHG KRVNAAESSD DEDEDSGDAM DDTYDVDAID
AGGEDLGGDD DEPDFDGEDD MSDAASDISE EDEVLVMGGE GVGEEEAAEA EAAWTRANAP
STGLGADKTA DGPGGLNVLP LYALLPPNLQ QRVFQASPDG SRMVIVATNV AETSLTIPGI
RYVVDAGRAK ERVYERDASL SRFQVGWVSK ASADQRAGRA GRTSPGHCYR LFSSAHFVDE
MKAHADPQIL GVPVEGVVLQ MRAMGIDKVV NFPFISPPER SALAAAEKTL QILGAVEKSR
HGEEIGPLTD LGRAMAVLPI SPRHSRMLFA AAQSGVGGCL SPAIAIAAAL SLDSPFLRNS
SETVEDDEEE GEAKATPKGP PPHVRFHHPA SDALSAAQAL LAYDACKSSD AVTFCSTNRL
HEKTMREMSD LRRQLKRLVV NLATTSKFGD DVFPNAAVLN ELDDSNQAAS SMISLPPGGD
VERTLRQALC AGWADRIAPN EKSTKATRYV PALLDAAVFL HPTSSLHRSS PDYVVYTDLL
QTDKRAYIVG ATGIEPEWLI QHCDALVDQG AMLADPAPRY VSREDRVVGW TAPRFGPHRW
DLPLNPIAVN DVDTKCAVFA TALLSGAVSP PMADLREKLA AKPLLASRPE GRAQKRVVDL
LGALKRVGGG ICTRAQLRQI WMTRGNDRYL YPELKAWMRA GKGYALEQAW LKIVQGVVNY
DEGKERKKKK GKK