Gene OSTLU_50709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50709 
Symbol 
ID5004243 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp18547 
End bp21889 
Gene Length3343 bp 
Protein Length1055 aa 
Translation table 
GC content52% 
IMG OID640419664 
Productpredicted protein 
Protein accessionXP_001419877 
Protein GI145351001 
COG category[L] Replication, recombination and repair 
COG ID[COG4581] Superfamily II RNA helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.918381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTCG CGAGCGTCGC GCCGTTGCGT ACGTCGTTTC GTGGCGTGAA CGCGAACGGG 
AGCGACGCCA CCGGGGAGCG TCGTCGCCTC CTCTCGTTTC GTCGCGCTCC ATCGCGCGCG
CGCCGCGATG AAGCGGTAAA CGCTCGCGTT CGAGCGAGTG GGACGTCTGT AGAACCGAGT
GTGGATGAAG ACGATGAAGA AGTCGACGTC ACGGTGACGT CTACGTCGGA CCGGTTGGAG
GCTCTTTTAA ACAAAGCTTC CACGTCGCCG GTGAACGTGC ATGACATAGA ACAGTTTTAC
CCGTACGAAT TAGATGGATT TCAAGTCGAA GCCACTGAAT TATTGCTACG CGGGTCATCG
GTCGTGGTGT CAGCGCCGAC GGGGAGCGGG AAAACTTTAG TCGGTGAGAC GGCGATTTTA
ACTGCGCTGG CGCGCGGAGA AAAGGCGATA TACACGACGC CGTTGAAGGC GTTATCGAAT
CAAAAGCTCA GAGAGTTTCA AAAGATTTTC GGTAAAAGAC GGTGTGGGTT AAAGACTGGT
GATGTGGACA TCAACGGCGA TGCTGACGTC ATGATTATGA CGACAGAAAT TTTGAGGAAT
ATGCTCTATT CAAGCGCTGC AGGTGGCCGC GACGACGAGC GACTCGCCGA CGTGAGTATC
ATCGTTCTCG ATGAGGTGCA CTATCTCGCA GATAGATCCC GAGGAACAGT GTGGGAAGAA
ACAATTATTT ACTGTCCTTC TCGCATTCAA CTTTTGTGTC TGTCCGCGAC AGTCGGTAAT
CCAGAGGATT TGTCGGGCTG GATTGAGGAA GTTCACGGCG AATGCGAAAC CGTCGTGTCA
AGTTACAGAC CTGTTCCCCT CACTTGGCAA TACAGCATGA AGCCGTCGCG CATGTATCCA
GGGTTGGGAC CTTTGATGAA TTTCAAATCC ACGAAAATTC ACCACGATCT ACGACCTTTT
ACGCGCGAAG GTCTTCAACA AGGATCGTAC GGTAACAACG ATTGGGCACC GGATGCGCAA
AGAGGCGCGA AAGAGTCTGA GCGCGTTCTC AGAAGGCGCT TCGTGCCTCA CGTCGAAACC
ACGGTACAGC AGCTCATCGC GAGCGATATG ATTCCCGCCG TGTGGTTTAT TTTTAGTCGT
AAAGGTTGCG ACCAATCTGT CGATTATCTC GTGCAGGCCG GCGGAAATCT CGTGACGAGC
AAGGAACGGC GAGAGATTGA TGATGCGTTG AAAGAATTTT CGGAGAAGAA CAAATCTGCT
GTGCGAGCAA GCATGGTTGA GCCTCTAAGA CGTGGCATAG CGTCACATCA CGCCGGGTTG
CTTCCAGCGT GGAAAGGTCT AGTAGAGAAA TTGTTTCAGC GAGGACTAAT CAAGGTGGTT
TTCGCTACGG AAACACTCGC CGCAGGAGTG AACATGCCGG CACGATGTTC CGTGTTAAGT
GCGCTTTCCA AGCGAGATGA TCAAGGTCCA CGACTACTAA CTTCCAATGA GTTCATGCAA
ATGGCTGGTC GAGCGGGCCG TCGTGGGTTT GACACCGTTG GCCACGTCGT GTGCTGTCAG
TCACCATTTG AGGGCCCGGA CGAAGCTTTC GAGCTCGTTC TTGCGCCACC TGAAAACTTG
AAGTCGCAAT TTTCTATTTC TTACGGTATG GTTTTGAATT TACTCCAAGG TAGAACGCTC
GATCAAGTCA AGGGAATCGT GGAGAGAAGT TTTGGCAACT ACCTTGGTGG TAAGGCGCGT
TCGATGCGCG AGCGCGAACT TCTTCGTGTC AATGATCAGA TCAGAAAACT GGTGAGCGAG
ATGGAGACAC TCGATGATGA TGAAGAAGCC GCGGAGTGGA GACGTTTCGT GAAGCTGGAT
GAACGGCTAC ATGAAGAAAA GCGCTTGCTA AAAATCTTGA TTCGGCAATT AGCTGAAATG
CGGGCAATTG AAACGCGGGA TCAACTACAG TTCGAGCTCG AACAGACGGG TGCTCCAGTC
ATCGTTACGA TTGATATTGG TGACAACGTT CTCAAACGGC GCAAAGAACG ACGATCGGCT
ACGATCGCTC TATTTGAAGA TGATTCTTTG AAGATGGAGG GTCAAGACTT TGCGGGCGAA
TGGAGATTGC AAGATATTGG TGGCGACGAT GCTCCCGGTT TGGACGAGTT ATTCGGCGAT
TCCGACGATG AAAGTCAAAA CGATGATTTT TTGCAGTCTT TTGACGACGA CGACGGTAAG
TTCCCCAGAG GTCTCATCAC CGCCGCAATC GTCGAGGCTG TTCCGGCGAT GAAAATTGCG
GCGACTGCGA GCACAATCGG TAAGCCATAT CCGATGGGCG AATTCACCGC CCTAGGAAAT
GATGGTGTCT GGTACCGTTT GTACTCCGAT CGAGTAAAAT CTATAAGCCT CGGCGCCGAT
GCGGTGCGCT TGGAAAGTTT TGGCGACATC GGCGTGCCGC CGGCTTCGAG CAGTCTTCGA
TGGATCCGCG CGAGCGGCGG AGGTTTGTGG AAGGCGGATG TGTCTAAAAA GACCAAGTTG
GTCGCTAATG GGATACCAAC TAACTTGAAC GACTTTGAAA TGATCGTCGA ATCGTCGGAT
TCGATGGAAT TTATCGACGC GCAGAAGTTA CAGATTCAAA AGACGCGCGA GGAGATAAAC
GGCTTGAAAA ACATCGCCAC CTTGCGCCGC GCTGCGAAGC AGCAAAAGCG GGCGGAGACG
AAACTGAAAA AGCTAAAAGA AAAACGTGAT GGAATTGAAA AACGTATCAA AGAGTATTCT
GCCGCGGGCT GGGACGATTT CTTGAGAGTT GTCGATATTC TTGTCGAGTG CGGCGCGATC
GAGAGAGACA CCCTAAAATT GTTGGAGTTT GGTGAGACCT GTGCTGACTT GAGAGGGGAA
AATGAATTAT GGCTTGGCAT GGCTATGTCT TCGCCGAGTA TCGAGAATTT GGACGCCGCG
ACTCTTGCAG GTTTTGCAGG GGCGCTCTGT ATGGACAACC GTCCGGCTAC ATGCTACTAC
GGCGCTTCGC AACACCTCGT CGAAGTGCTT GAAGAGCTCG AACCGGAGAT GGGCGACCTT
CAGTACTTGC AACAATCTTC TCGAATCGAC ATGCCTCTGA GCTTGAGTTT CGAGATCGCG
GCGTTGGTAG AGTCATGGGC ATCGGGAACG TCGTGGGACC AGATACGCCG TGATACTTCC
TTAGATGAGG GAGACATCGC TAGATTGTTT CGACGAACTG CAGAACTTCT TGCGCAAATT
CCGCGCACCG CACATCTACC GGAGAGTCTC AAAGCGACTG CAAAGAAGGC GAACGATGTC
GTCAATAGAC CTCCGATTAG TGATCTTTCT TGATCATTAT CAT
 
Protein sequence
MPVASVAPLR TSFRGVNANG SDATGERRRL LSFRRAPSRA RRDEAVNARV RASGTSVEPS 
VDEDDEEVDV TVTSTSDRLE ALLNKASTSP VNVHDIEQFY PYELDGFQVE ATELLLRGSS
VVVSAPTGSG KTLVGETAIL TALARGEKAI YTTPLKALSN QKLREFQKIF GKRRCGLKTG
DVDINGDADV MIMTTEILRN MLYSSAAGGR DDERLADVSI IVLDEVHYLA DRSRGTVWEE
TIIYCPSRIQ LLCLSATVGN PEDLSGWIEE VHGECETVVS SYRPVPLTWQ YSMKPSRMYP
GLGPLMNFKS TKIHHDLRPF TREGLQQGSY GNNDWAPDAQ RGAKESERVL RRRFVPHVET
TVQQLIASDM IPAVWFIFSR KGCDQSVDYL VQAGGNLVTS KERREIDDAL KEFSEKNKSA
VRASMVEPLR RGIASHHAGL LPAWKGLVEK LFQRGLIKVV FATETLAAGV NMPARCSVLS
ALSKRDDQGP RLLTSNEFMQ MAGRAGRRGF DTVGHVVCCQ SPFEGPDEAF ELVLAPPENL
KSQFSISYGM VLNLLQGRTL DQVKGIVERS FGNYLGGKAR SMRERELLRV NDQIRKLVSE
METLDDDEEA AEWRRFVKLD ERLHEEKRLL KILIRQLAEM RAIETRDQLQ FELEQTGAPV
ILFGDSDDES QNDDFLQSFD DDDGKFPRGL ITAAIVEAVP AMKIAATAST IGKPYPMGEF
TALGNDGVWY RLYSDRVKSI SLGADAVRLE SFGDIGVPPA SSSLRWIRAS GGGLWKADVS
KKTKLVANGI PTNLNDFEMI VESSDSMEFI DAQKLQIQKT REEINGLKNI ATLRRAAKQQ
KRAETKLKKL KEKRDGIEKR IKEYSAAGWD DFLRVVDILV ECGAIERDTL KLLEFGETCA
DLRGENELWL GMAMSSPSIE NLDAATLAGF AGALCMDNRP ATCYYGASQH LVEVLEELEP
EMGDLQYLQQ SSRIDMPLSL SFEIAALVES WASGTSWDQI RRDTSLDEGD IARLFRRTAE
LLAQIPRTAH LPESLKATAK KANDVVNRPP ISDLS