Gene OSTLU_41641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41641 
Symbol 
ID5005020 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp129124 
End bp132159 
Gene Length3036 bp 
Protein Length936 aa 
Translation table 
GC content65% 
IMG OID640420441 
Productpredicted protein 
Protein accessionXP_001421062 
Protein GI145353527 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID[TIGR01970] ATP-dependent helicase HrpB 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000692526 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGACGC GCGTCGCGAC GACGGACGCG CGCGCGCCGG TCGCGCGGCG CGCGCGCGGT 
GCGCGTCGCG ACGTCGCGCG GCGCGCGACG GGAGACGACG CGCGGCGCGA GGCGCTGCGA
AAAGCGCGCG CGAAGTTGCC CATCGACGCG GTCGTGGACG CGTGCTTGGA CGCGCTCGAG
GTGCGTCGAC GGCGCGCGAA AGCGCGACGC GGGACTGACG AGGCGAACGA TTGGGTTCGA
TGCGAACAGC GCGCGACGCG CGCGGTGATG CAGGCGCCGC CGGGGGCGGG GAAGACGACG
GTGATGCCGC TGGCGGCGGC GCTGGCGGCG GCGAGCGCCG GCGGCGACGG ACGCGCGGGG
AAAGTGATCG TGCTCGAACC GCGACGGTTA GCGGCGAAAG CGGCGGCGAT GCGGATGGCT
GAGATGTTGG GGGAACGCGC GGGGGAAACG GTGGGGTATC AGGTGCGATT TGAACGAAGG
GCGAGCGCGG CGACGCGGGT GGAGGTGGTG ACGGAGGGCG TGCTGACGCG GCGGTTGAGA
AACGATCCAG AGCTGCGGGA CGTGGGATTG GTGGTGTTCG ATGAGTTTCA CGAGAGGAAT
TTAGACGCGG ACGTCGCGCT GGCGCTGTGT CGAGAGGTGC AACAGACGAT ACGGCCAGAT
TTGAGACTGT TGGTGATGAG CGCGACGCTC GGGGAGATGG GCGCGCGCGT GGCGGCGTTG
TTGAGGGATG AGAACGGGCC CGAGGTGCCG GTGATCGTGT CGGAGGGGCG GTCGTATCCG
GTGGAGACGA TTTATTTGGG CGCGCCGGGC GCGGGATGGG GTGAGCTCGA GCGCGCGACG
ACGAACGCGG TGAAGGACGC CGTGCGCGCG TGCCCAGATG GCGACGTGCT GTGTTTTCTT
CCGGGCGCGG CGGAGATCAA TCGCGTGGTG CGAGATCTGC AAAGGGAGCT TCCGAACGGT
GTCGTGGCGT TGCCACTGTA CGGCGCGCTA TCGCAAGAAG AACAGGCGGC GGCGCTCGCG
CCGTCGAAAC CGGGCACGCG TCGCGTCGTC GTGAGCACTC CAATCGCCGA GTCTTCGTTG
ACCATCAACG GCGTGAAAGT GGTCGTGGAC TCGGGATTGT GCAAGACGCC CAAGTTCGAC
GCTCGGAAGG GTATGACGCG ACTGGAGACG ACTCGTGTCT CCCGCGCATC GGCGGATCAA
AGGCGCGGGC GAGCCGGGCG CATCGCTCCT GGGACGTGCT ATCGTTTATG GAGCGAGGCC
TCGAACGCGA AACTTCAGCC AGACACGACA CCCGAGATTT TGCAAGCCGA CTTGACTCCG
GTGGCGTTAG ATTTAGCAGC GTGGGGCGTC GGCGATGGGG CAGACATGGC TTGGCTCGAC
CCACCGCCTG AAGGCCCGCT CATCGCGGCG AGACGGTTGT TACGCGAGCT CGGCGCGTTG
GAGGAAGGCA AACTCGTCCC TAGCGACGTG GGTTCGATCA TGTCCGAGCT TCCAGTGCAC
CCAAGACTGG CGCGTATGTT ACTTTTTGGC GCGTCACGCG GCGCTGAGAG CGCTCGACTC
GCGTGTCAAC TCGCAGCTGT CATCGGCGAC CGCGACTTGA TCTCTGGACG CGACGCCCCT
CTCGACGTTC GGTGTAGACT TCGCGCGCTT TGGGGTCAAG ATCCACTCGC AAGCGCGGGC
GATTTGGATG AAGAGAAACC AGAAGTCGAC CCGACGAAAC CGACGCGCGT TCCAATCGGT
ACCAAACTCC CGAAGAGCGG GAAGAAAATC AAAGGCGCGC CTCGCGGCAA ACGCGCGAGC
GTAAACGCGG CGGTGAGCGT CGCGACGGGC GGCGCGAGCG GTGCGAGCTG GAATGTCGAC
GAGCGGGCGG TTCGCGAAGC GAAACAAGTT GCCGAGCAGT TACTCGGGAA CTTGCGCCGC
TTAGCGTCTT CGCATCCTGA CTTCTGCGCT CCGGGTGTCG GCGAGGGCGA CGCCTCGGCG
GTGGTGTGGT CGTTCTTGTG CGGCGCGGGC GAAAGCGAAG CCGGCCTACT GCTCGCTGTC
GCCTATCCTG ATCGCGTCGC CGCGCGTAAA AACCGAGGAG GGGCGTTTCA ACTCTCTGGC
GGCGGCGCCG CGAGCGTGGG AAGTGAGCAC AAGGATGACG CGCTGCTTCG CTCTGGCGAC
AAGGCGAATG AAACTTTAGT CGTCGTCGAA TTAGCTGGCG ATGGCGCGGG AAACGCGGGA
TCGAGGAACG ACCGCGTGCG GTTGGCGGCA CCGATCGACC GTGCATGCTT AGAAAGCGGT
GGTGCGCTGT ACGAAGCGTT GTCCAAGGAG AGTGACGACG TGTTCTGGGC GAGCGCATCG
AAATCAGTCT TCGCGCGCAG GCGCTTGACG GTAGGATCTT TGGTGCTTCG AGAGATTCCG
TTTTCGGTGA AAGACAATCC CGAGGCGACT GTGAGCGCGA TGTTAGACGG TATTCGCGAG
ATGGGTCTCG CGAGCGCCTT CGGTTTGAAC AAAGCGACGA CATCGTGGCT CAAGCGCGCG
GAATATGTCC ATCGCTCAGG CGTGGACGCC ACGTTTCCAA ATCTCTCCGA AGAAAATTTG
CTGAGCTCCG CCGGTGAATG GCTCGCGCCG TGGATCGCTG GCGCGCAGTC TAAATCGGAT
CTCGCGAAAG TTGACGTCGC ATCCCTAGTC AAGGCGCATT TCTGCACGTA CGACCATCTC
AAGCTCGTGG ACGACGCCTG TCCCGTCGCC GTTCGCCTGC CCAGCGGATC CAACGCCAAA
GTCGACTACG ACGGCGACGT TCCCGTCGTC GCCGCGCGCA TCCAAGAGTT CTTCGGCACT
ACCGAAACTC CACGCGTGGG CGGCGTCAAC TGCGAATTAC ACCTCCTCAG CCCGGCCGGT
CGCACGCAAG CCGTCACGCG CGACTTGGCG TCGTTCTGGC GCAACGCCTA TCGCACGGAC
GTTCGCAAAG AGCTCGCCGG TCGATATCCC AAGCACTTCT GGCCCGACGA TCCCGAGTCC
GCCGCCGCGA CGAGCAAGAC CAAAAAGTAC ATGTAA
 
Protein sequence
MATRVATTDA RAPVARRARG ARRDVARRAT GDDARREALR KARAKLPIDA VVDACLDALE 
VRRRRAKARR GTDEANDWVR CEQRATRAVM QAPPGAGKTT VMPLAAALAA ASAGGDGRAG
KVIVLEPRRL AAKAAAMRMA EMLGERAGET VGYQVRFERR ASAATRVEVV TEGVLTRRLR
NDPELRDVGL VVFDEFHERN LDADVALALC REVQQTIRPD LRLLVMSATL GEMGARVAAL
LRDENGPEVP VIVSEGRSYP VETIYLGAPG AGWGELERAT TNAVKDAVRA CPDGDVLCFL
PGAAEINRVV RDLQRELPNG VVALPLYGAL SQEEQAAALA PSKPGTRRVV VSTPIAESSL
TINGVKVVVD SGLCKTPKFD ARKGMTRLET TRVSRASADQ RRGRAGRIAP GTCYRLWSEA
SNAKLQPDTT PEILQADLTP VALDLAAWGV GDGADMAWLD PPPEGPLIAA RRLLRELGAL
EEGKLVPSDV GSIMSELPVH PRLARMLLFG ASRGAESARL ACQLAAVIGD RDLISGRDAP
LDVRCRLRAL WGQDPLASAG DLDEEKPERA VREAKQVAEQ LLGNLRRLAS SHPDFCAPAG
LLLAVAYPDR VAARKNRGGA FQLSGGGAAS VGSEHKDDAL LRSGDKANET LVVVELAGDG
AGNAGSRNDR VRLAAPIDRA CLESGGALYE ALSKESDDVF WASASKSVFA RRRLTVGSLV
LREIPFSVKD NPEATVSAML DGIREMGLAS AFGLNKATTS WLKRAEYVHR SGVDATFPNL
SEENLLSSAG EWLAPWIAGA QSKSDLAKVD VASLVKAHFC TYDHLKLVDD ACPVAVRLPS
GSNAKVDYDG DVPVVAARIQ EFFGTTETPR VGGVNCELHL LSPAGRTQAV TRDLASFWRN
AYRTDVRKEL AGRYPKHFWP DDPESAAATS KTKKYM