Gene OSTLU_12759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_12759 
SymbolCHR3506 
ID5002700 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp705643 
End bp708540 
Gene Length2898 bp 
Protein Length956 aa 
Translation table 
GC content53% 
IMG OID640418121 
Productpredicted protein 
Protein accessionXP_001419020 
Protein GI145349184 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0485802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.764276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTCG AACGCGTGCG AGAGGATCAG AACAAGAAGA TTGCGGGAGA TAATAAGGCG 
GGGAAGTGGA AGTTTTTGTT GGCGCAAACC GAGGTCTTCG CGCACTTTTT GTCGGGGACG
AAGGCGGCGA ACGAGGCGGC GAACAAGGGC AAGCGCGGGC GCAACAAGAG CCACGCGGCG
GAGGAGTCGG AGGACGCCGA GCTCGTGGAA CACGCTGAGG ATTACCAAGC GGTGCGTTTG
ACGTCTCAAC CGACGTGCAT TAAGTTTGGC AAGATGCGAG AGTACCAAAT TGCGGGTTTG
AACTGGATGA TTCGATTGTT CGATCACGGC ATCAACGGCA TCTTGGCGGA CGAGATGGGT
CTCGGGAAGA CGTTGCAGAC GATTTCTCTT CTCGGTTACC TCGCCGAATA CCGGGGTGTT
ACCGGACCGC ACATGGTTGT CGTGCCGAAA TCCACGCTCG GTAACTGGAT GAACGAGTTT
AAGCGTTGGT GTCCGATGAT TCGCACGTTC AAGTTTCACG GAAACGCCGA AGAGCGCGAG
GCGCAAAAGG CCAAGTTTTT GGTGCCGGGC GGTTTCGACG TGTGCGTGAC GTCTTACGAA
ATGGTCATCA AGGAGAAGAC TGCGTTGAAG AAGTTTCACT GGCGTTACAT CATCATCGAC
GAAGCGCACC GCTTGAAGAA TGAAAATTCT CGTCTTTCCA TCGTCCTAAG AACGTTTTCG
GCGAACAACC GCATGTTGAT CACCGGGACG CCGCTTCAAA ACAACCTTCA CGAGCTCTGG
GCTTTGCTCA ACTTTTTGCT CCCCGAAGTG TTCGGTAACG CCGGTCAGTT CGATGAGTGG
TTTGCGAACG TTGAAGACGG TGAGGGTGGT TCCGGTGCCG TCGTGTCTCA GCTTCACAAG
GTTTTACGCC CCTTCTTGTT GCGTCGATTG AAGACGGAGG TCGAAACCAG CCTCCCGCCG
AAGAAGGAGA CAATTCTCAA GATTGGCATG ACTGAGATGC AAAAAACGTT TTACAAGCGC
ATCTTGCAAA AGGACATCGA CATTGTTAAC AGTGGCGCGG ATAGATCTCG GCTTTTGAAT
ATCGTCATGC AGCTTCGCAA GTGCTGCAAC CATCCGTACT TGTTCCAAGG TGCCGAACCG
GGTCCGCCGT ACATCACCGG GGATCACCTC ATCGAGAGCT CGGGTAAGCT CGCGCTTTTG
GACAAGCTTT TGCCTCGTCT CATGCAACGC GGCAGTCGTG TGCTGATCTT CTCTCAGATG
ACTCGTTTGC TCGACATCTT GGAAGATTAC TTGATGTACA GAAACTACCA ATATTGTCGC
ATCGACGGGA GCACGGATGG CGCCGTCCGC GAAGACCACA TTGACGCCTT CAACAAGGAA
GGTTCGGAGA AGTTCTGTTT CTTGCTTTCC ACGCGGGCGG GTGGTTTAGG TATTAACCTC
GCCACTGCGG ACACCGTCAT CATCTACGAC AGCGACTGGA ACCCGCAAAT GGATCTTCAG
GCGATGGATC GAGCGCACCG CATCGGTCAA AAGAAGGAGG TACAAGTGTT CCGCTTTTGC
ACCGACGGTA GCGTCGAGGA GAAGGTGATC GAGAAAGCGT ACAAGAAACT CGCGCTCGAT
GCTCTCGTCA TTCAACAAGG ACGATTGCAA GAGAACAAGA AGAACCTCGG CAAGGATGAA
CTCTTGGCGA TGGTTCGCTT TGGCGCCGAG AAGATTTTCG ACTCGTCGAC GACGTCGATC
ACGGACGAAG ATGTCGACGC CATCATGGCT CGCGGTGAAG AAGAAACCAA AGCACTCAAT
AGCAAAATGC AAGGTTTCAC GGAGAAGGCC ATTCAGTTCT CTATGGGTGC CGAGAACTCG
CTTTACGAAT TCGAGGACGA AGACGACAAA AACGTCGCCG CGTTGCCAGA GGGCATTGAC
ATGAAGACCA TCATCAGCAG CAACTGGATC GATCCTCCCA AACGCGAACG TAAGAAGAAC
TACAACGAGT CTGATTACTA TCGCAGCGCC ATGGCGCAAG CGGCTCGCCC ATCCAAGCCA
ATGGGACCGA AGATTGCCAA GCTGCAGCAA ATGCACGACT TCCAGTTTTA CAACACCGCG
CGAATTCAAG AAATTTACGA CAAGGATGTT AGACGCAAGA CGTACGAATG GCAGAAAGAC
AAGAAGAAGG AGGAAGCTAA AGCTGCGCAA GATGAGGAGG CGCCCACTGA AGCGGAGGAA
GACGATCCCA ACGCGCCTCC GGCGATCACG GAGGAAGAAA AGGCTGAGCA AGAATCGCTC
CTCTCTCAAG GTTTCACAGA ATGGTCTCGC CGAGACTTCC AAGCCTTCTG CCGACTCAGC
GAGAAGTACG GGCGCGAAGA CGTGGAGTCT ATTGCGAGTG AGATGGAAGG AAAGACTCTC
AAAGAGGTCA AGGATTACGC CGCCGTATTC TGGAAACGTT ACGAAGAGAT TGCCGATCAT
CCGCGCATCA TCAGCAACAT CGAAAAGGGT GAACAAAAGA TTCAGCGTCA ACACGACATG
CTGAAGGCTG TACGAGAAAA GATCGCCAAG TACAAGAATC CGTGGCGCGA GCTCAAGTTG
ACGTATGGTC CGAACAAATT CAAATCGTTC ACCGAAGAGG AGGATAGGTT CTTGTTGTGC
TCCATCCCCG AAGTTGGCTT TGGGAATTGG GACGAGCTCA AGGCGCAGAT TCGCCAGCAC
TGGCAATTCC GATTCGATTG GTTCATCAAA AGCAGAACGC CAAAAGAGCT CGGTCGCCGC
GTGGAGACTT TAATTTCTCT CATCGAGAAA GAGGCTCAGG ACAGAGGCGA CAAGAAGCGA
GACGCCGAAG CCGAAGCCGA AGCCGACGGG AGCGCGCAGA AAAAGGTGAA GGTTGAAGCT
GAAGGCGTGA CCGCGTAG
 
Protein sequence
MELERVREDQ NKKIAGDNKA GKWKFLLAQT EVFAHFLSGT KAANEAANKG KRGRNKSHAA 
EESEDAELVE HAEDYQAVRL TSQPTCIKFG KMREYQIAGL NWMIRLFDHG INGILADEMG
LGKTLQTISL LGYLAEYRGV TGPHMVVVPK STLGNWMNEF KRWCPMIRTF KFHGNAEERE
AQKAKFLVPG GFDVCVTSYE MVIKEKTALK KFHWRYIIID EAHRLKNENS RLSIVLRTFS
ANNRMLITGT PLQNNLHELW ALLNFLLPEV FGNAGQFDEW FANVEDGEGG SGAVVSQLHK
VLRPFLLRRL KTEVETSLPP KKETILKIGM TEMQKTFYKR ILQKDIDIVN SGADRSRLLN
IVMQLRKCCN HPYLFQGAEP GPPYITGDHL IESSGKLALL DKLLPRLMQR GSRVLIFSQM
TRLLDILEDY LMYRNYQYCR IDGSTDGAVR EDHIDAFNKE GSEKFCFLLS TRAGGLGINL
ATADTVIIYD SDWNPQMDLQ AMDRAHRIGQ KKEVQVFRFC TDGSVEEKVI EKAYKKLALD
ALVIQQGRLQ ENKKNLGKDE LLAMVRFGAE KIFDSSTTSI TDEDVDAIMA RGEEETKALN
SKMQGFTEKA IQFSMGAENS LYEFEDEDDK NVAALPEGID MKTIISSNWI DPPKRERKKN
YNESDYYRSA MAQAARPSKP MGPKIAKLQQ MHDFQFYNTA RIQEIYDKDK DKKKEEAKAA
QDEEAPTEAE EDDPNAPPAI TEEEKAEQES LLSQGFTEWS RRDFQAFCRL SEKYGREDVE
SIASEMEGKT LKEVKDYAAV FWKRYEEIAD HPRIISNIEK GEQKIQRQHD MLKAVREKIA
KYKNPWRELK LTYGPNKFKS FTEEEDRFLL CSIPEVGFGN WDELKAQIRQ HWQFRFDWFI
KSRTPKELGR RVETLISLIE KEAQDRGDKK RDAEAEAEAD GSAQKKVKVE AEGVTA