Gene OSTLU_42421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42421 
Symbol 
ID5003109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp543937 
End bp546906 
Gene Length2970 bp 
Protein Length989 aa 
Translation table 
GC content54% 
IMG OID640418530 
Productpredicted protein 
Protein accessionXP_001419195 
Protein GI145349553 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0589659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.298974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAACG CAAACAAGTT TGTTACCGAA GGTACCGCCG TCGCGGCCGA AGATGACGAC 
GCGCCAGCCC CGGATGACGA CGCGGAGAAG GTTTTGGATC GCGCGTGGTA CGACGACGAC
GAGGGCGGCG GCGCGCACGG CGACGCGCAC AACCCTTTTA ACACGAACGC ACGGGATGAG
GCGCGTTACG CGAACAAAGA ACAAGAATAC GCGAAAAGGT TGACTCGACG CGACGGGTCG
CTCATGTCCA TGGCTGCGTC GCGACGCGTC AGTCAACTCA ACGCCGATTC AAATCAATGG
GAAGAAAATC GTATGATGAC GTCCGGTGTG ATTCGCACCA AGGAAATTGA TTTGGATTTT
GATGACATGG AAGAAAACCG CGCGGTTTTG CTCGTGCACG ACACGAAACC GCCATTCTTA
GACGGCCGTA TGGTGTTCAC GAAGCAGCAA GAGACTGTCG TACCGGTGAA GGACGTCACG
AGCGACATGG CGCAAATCGC GCGCAAAGGA AGCGCGTTGG TGAAGGAAGT GCGTACGAAG
CGAGAGGAGA ACAAAGGTCG GGATCGATTT TGGGAAATGA AAGGGTCGAA GATGGGATCG
ATCACGGGTA CGACACAAGC TGAAAACAAG GAAGCCGCGG AAAACGCGCA AGCGGCGAAA
GGTCGCGATG ACGACAGACC AGACGTCGTC GGCGCGGACG GCGAAATCGA TTTCAAGGCT
GGCGCCAAGT TTGCCGAGCA CATGAAAGGT TCGAAGGCGA GCGCACAAAG CGAGTTCGCG
AAGACGAAGA CGATCAAAGA ACAGCGTGAG TTCTTACCTG TGTACGGTTG TCGCGAAGAC
TTGATGCATG TCATTCGCGA AAATCAAATC GTAGTCGTCG TCGGCGAAAC CGGAAGCGGT
AAGACGACGC AAATGACGCA ATACATGCAC GAGGAAGGTT ACTCCACATT CGGGATGGTC
GGTTGCACTC AACCCCGTCG TGTAGCTGCA ATGAGCGTCG CGAAGCGTGT GAGCGAGGAA
ATGGGCTGTG AACTAGGTAA GGAAGTCGGT TACGCCATTC GATTCGAGGA CTGCACGGGG
CCTGATACGA TTATCAAGTA CATGACGGAT GGCGTGCTTC TTCGAGAAAC TTTGCGCGAA
CCTGATCTTA ACATGTACAG CTGTATCATC ATGGACGAAG CGCACGAACG ATCGTTACAC
ACTGACGTTC TATTCGGTAT TCTGAAGAAA GTTGTCGCGC GCCGTCGCGA TTTCAAGCTC
ATCGTCACGT CGGCGACGTT GAACGCAGAA AAGTTTAGTA ACTTCTTTGG ATCGGTGCCG
GTTTTCCACA TTCCTGGTCG CACGTTCCCG GTCGATATTC TGTACTCCAA GACACCCGTG
GAGGATTACG TCGAAGCTGC GGTGAAGCAA GCGCTCACTG TGCATCTCTC GTCGGGACCG
GGTGACATTT TGATCTTCAT GACGGGTCAA GAAGAAATCG AGACGGTGAC GTACACGTTG
GAAGAGCGCG TCGAGCAGTT GATGAGCGAA GGCACGTGTC CACCGCTGAA CGTTTTACCA
ATCTACTCAC AACTCCCGAG CGATTTGCAG GCGAAGATTT TTCAAGACGC AGAGGATGGT
AACCGAAAGT GCATCGTCAG TACGAACATC GCGGAGACGT CGCTCACGCT CGACGGCGTC
ATGTACGTCA TCGACAGTGG TTATTGCAAA CTTTCAGTGT TTAATCCTCG AATGGGTATG
AATGCTTTGC AAGTTTTCCC TTGCGCGCAA GCTGCGGTGA ATCAACGCAG CGGCCGCGCC
GGTCGTACTG GACCAGGGAC GTGCTATCGC CTGTACACGG AGATGGCGTT CAAGCACGAA
ATGCTCGTCT CGACGGTTCC CGAGATTCAA CGCACCAACT TGGGTAACGT CGTGTTACTT
TTGAAGTCGC TCAACGTGGA TAACTTGTTA GATTTTGACT TCATGGATCC TCCTCCCCAA
GAAAATATCT TGAACAGCAT GTATTCCCTG TGGATTTTAG GCGCGCTCGA CAACACTGGC
GGGCTCACGA AACTCGGCTC GAAGATGGTT GAGTTTCCCG TCGACCCGCC GCTGGCGCAG
ATGCTCATCA AAGCGGAAGA AACGGGCTGC TCGAACGAAA TGCTCACCGT CGTCGCGATG
TTATCGGTTC CGTCAGTGTG GTTCAGGCCG AAGGATCGAG AGGAAGAATC CGACGCCGCG
CGCGAAAAGT TCTTCGTTCC CGAAAGCGAC CACTTGACGT TGCTCAACGT GTACCAGCAA
TGGAAAAATA ACGGGTACAG GAACGATTGG TGCAACAAGC ATTTCATTCA GGGCAAAGGT
CTGAAGAAAG GTAGAGAGGT GCGCGCGCAA TTGATGGATA TCATGAAGCA ACAGAAAATC
CCGCTCGTGA GCTGTGGGCA AGATTGGGAC GTCTGCCGTC GATCCATCGC CGCCGCGTAC
TTTCATCAAG CGGCGCGTTT GAAAGGCGTC GGTGAGTATG TCAATGCTCG CAATGGTATG
CCTTGCCACC TTCATCCGAG CTCAGCGCTT TATGGTCTGG GTTACACTCC TGATTACGTC
GTATACCACG AACTCATCAT GACATCGAAA GAATACATGC AATGCGTCAC CGCCGTCGAA
CCGCACTGGC TCGCCGAATT CGGACCGATG TTTTTCACGC TCAAGGAGAG CCATTCGAGC
ATGTTGAAAT CAAAGGCGAA GCGCAAAGAG GACAAGGCGA AGATGGAGGC TGAAATGCAA
GCTAAACGCG ATGAGGAAGC ACAGCTGCAA GAAGCGCAGC GCACGCGAGA AGAAGATCGC
CGCGCGAGAC AAAGGAGTCA AATCGTGACG CCGGGGCAAC GCAGCGCGGC GACGACGCCG
CGCGTAGATT ACGGAACGTC GCGTCCCCCA TCGAGCGTTC GGCGCGGTGC GGGTGGCCGG
ACGCCGGGAA GAAAACGCTT TGGACTATAG
 
Protein sequence
MANANKFVTE GTAVAAEDDD APAPDDDAEK VLDRAWYDDD EGGGAHGDAH NPFNTNARDE 
ARYANKEQEY AKRLTRRDGS LMSMAASRRV SQLNADSNQW EENRMMTSGV IRTKEIDLDF
DDMEENRAVL LVHDTKPPFL DGRMVFTKQQ ETVVPVKDVT SDMAQIARKG SALVKEVRTK
REENKGRDRF WEMKGSKMGS ITGTTQAENK EAAENAQAAK GRDDDRPDVV GADGEIDFKA
GAKFAEHMKG SKASAQSEFA KTKTIKEQRE FLPVYGCRED LMHVIRENQI VVVVGETGSG
KTTQMTQYMH EEGYSTFGMV GCTQPRRVAA MSVAKRVSEE MGCELGKEVG YAIRFEDCTG
PDTIIKYMTD GVLLRETLRE PDLNMYSCII MDEAHERSLH TDVLFGILKK VVARRRDFKL
IVTSATLNAE KFSNFFGSVP VFHIPGRTFP VDILYSKTPV EDYVEAAVKQ ALTVHLSSGP
GDILIFMTGQ EEIETVTYTL EERVEQLMSE GTCPPLNVLP IYSQLPSDLQ AKIFQDAEDG
NRKCIVSTNI AETSLTLDGV MYVIDSGYCK LSVFNPRMGM NALQVFPCAQ AAVNQRSGRA
GRTGPGTCYR LYTEMAFKHE MLVSTVPEIQ RTNLGNVVLL LKSLNVDNLL DFDFMDPPPQ
ENILNSMYSL WILGALDNTG GLTKLGSKMV EFPVDPPLAQ MLIKAEETGC SNEMLTVVAM
LSVPSVWFRP KDREEESDAA REKFFVPESD HLTLLNVYQQ WKNNGYRNDW CNKHFIQGKG
LKKGREVRAQ LMDIMKQQKI PLVSCGQDWD VCRRSIAAAY FHQAARLKGV GEYVNARNGM
PCHLHPSSAL YGLGYTPDYV VYHELIMTSK EYMQCVTAVE PHWLAEFGPM FFTLKESHSS
MLKSKAKRKE DKAKMEAEMQ AKRDEEAQLQ EAQRTREEDR RARQRSQIVT PGQRSAATTP
RVDYGTSRPP SSVRRGAGGR TPGRKRFGL