Gene OSTLU_50379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50379 
Symbol 
ID5003755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp421298 
End bp424369 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table 
GC content57% 
IMG OID640419176 
Productpredicted protein 
Protein accessionXP_001419803 
Protein GI145350838 
COG category[L] Replication, recombination and repair 
COG ID[COG4581] Superfamily II RNA helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0819342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0284916 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGA CGGCCGAGCT CGAGTGCGCG AACGCGCGCG CGTCCGAACA CTCGGCGCGC 
GCGCGCGACG CGGCGAAGCG ACGCAAGGCG ACGGAGGCGG CGGCGGATGG AATCGCGGAC
GGCGGCGGCG ACGACGACGG CGACGCGCGC GCGCGCGCGA GGACGAGCTG CGTGCACGAA
GTCGCGGTGC CGCGAGACTG GGTCGGCGAC GTGAAAGCGC TGCGGGACCC GCGGTACGAC
GGCGCGAGGG CGAAGGAGTA CCCGTTCGAG CTGGACGCGT TTCAGCGCGC GGCGACGGCG
GTGCTGGAAC GAAACGAAAG CGTGCTCGTC GCCGCGCACA CGTCGGCGGG GAAGACGGTG
GTGGCGGAAT ACGCGATCGC GATGGCGTTT CGGGATAAAC AGCGGGTGAT ATATACGTCG
CCGTTGAAGG CGCTGAGTAA TCAAAAGTAT CGGGAGTTGA GCGAGGAATT CGGCGACGTC
GGGTTGATGA CGGGGGACGC GTCGATTAAT CCGAATAGTA CGTGCATCGT GATGACGACG
GAGGTGCTGC GGTCGATGTT ATATCGAGGC GGGGACGTAA TTCGCGAGGT GAAGTGGATC
GTGTTCGACG AGGTGCATTA CATGCGGGAC AGAGAACGCG GGGTGGTGTG GGAAGAGTCG
ATCATCTTTG CGCCGAAGGA CGCGCGGTTG GTGTTTTTGA GCGCCACGCT GCCGAATGCG
CTCGAGTTCG CGCAGTGGGT GACGAGCTTA CATAATCATC CGTGTCACGT GGTGTACACG
GATCATCGAC CGACGCCGCT GCAGCATTAC GCGTTTCCCA AGGGCGGGAG CGGTTTACAT
TTAGTCGTCA ACGAGCAGAG TCAGTTTCGT TCGGACAATT TTGCGAGGTT GCAGCAGGCG
ATCGCGGATG GGGCTGAGAA GAGCGGTGGT TCAGGAGGCG GCGGTCGCGG TCGCGGTCGT
GGCGGTGGAC GCGCACGCGG CGGCGGCGGC GGTCGTGGCG GTGGCGGTGG CGGTCGCGGC
GGCGGGTCGA TGGCGGACGC CGATATTTTG CGCATCGTGC GTATGGTGAA GGAGAAAACC
TTCTTCCCCG TCATCGTGTT TAGTTTTAGC CGACGCGAGT GCGAAGAGTA CGCCAAATTC
GTGTCAAAAT TGAATTTCAA CACTCCCGAG GAGGCTGAGC AGGTTCGTGA GGTGTACAAC
GCCGCACTGC TGAATTTGTC CGAAGAAGAC CGTCAGTTGA CGGCGGTGCA AGCGATTTTG
CCATTGCTCG AGGCGGGCAT CGGCATTCAT CACAGTGGTT TACTTCCGGT TTTGAAGGAG
CTCATAGAAA TTCTGTTCGG CGAGTCGCTG ATTAAGTGCT TATTTGCAAC TGAGACGTTC
GCCATGGGAC TGAATATGCC GGCGAGAACC GTCATTTTCA CCGCTGTTAA AAAGTTTGAT
GGCACCGATA TGCGCGTTCT CGCGCCCGGA GAGTATACGC AAATGTCCGG CCGAGCTGGT
CGACGTGGTA AAGACGACCG TGGTATCTGC ATCGTCATGT GCGATGAGCG CATGGAAGAA
CACGCGATGA AGGAGATGAT TCTTGGGAAG CCGCAGCCGT TGAACTCAGA GTTTAAGCTG
AGTTATTACA GTATTTTGAA CCTCTTAAAA CGCGCGACGG GGACGATTGA CGCAGAGTAC
GTCATCGCTC GCTCGTTTCA TCAGTTTCAG CACGCCAAAC AGTTACCAGA ATTAAAGGCT
CGGCTCACTG AAGTACAACA GGAGGCGGCG AAGATAAAGT CGGTGGGTAG CGAAGAGATT
CAAGAGTATA TCAAACTCAG ACGCGATTAT CGCGAAGCCG AGAAAGTGGT CTTGCGCACG
ATGCTCCAAC CGGCAAACTG CTTGCGATTC TTCACTTCGG GTAGACTAGT TCGCATAAGA
GATGGCGACA CGAATTGGGG GTGGGGTGTT GTCATCCAAG TTTCCACAGT TAAAGATGCG
AAGGGTGGCG ACGTACACGT GCTCGACTGT TTGCTTCGTT GCGGTCCAGG CGCGGCAGAG
GGTAGACTTG CGCCTGCGGA CGCAAAGAAT CTGAAGATGA ACACAACGGA AATCGTACCT
GTGGGCACAC ATCTCGTTGA TGCTATTAGC GCGATGCGCT TCACGCTTCC AGGTGATTTG
CGCACGAAAG AAGCGCGCGA AAGCGTTTGG ATTGCCGTCG AAACTGTTAC GAAGAAACTC
ACCGAAAAAG GCCAGGTGAT TCCGCAAATA CATCCTGTCG ATGATATGGG GATTAATGAC
GTCGCATTTG TGCGCACATA CCGTTCACTT GGCGCGTTAC GCGACAAGTT CCACTCGCAC
GCGTTGTACA GCGAAGCGGA TGCGCTCGAG CGCAGCGAAA TGACGGCAAA AATCGACGTC
ATCGAGCAGA AATCAGAGCT CCTCGCTGAA GCGTCGAGAC TGGAGACACA GATTCAATCG
AGCGAGTTGA CAAAGTTCCG CGACGATTTG AGCGCGCGAA GTCGAGTTTT GAAGAAACTC
GGGCACATCG ACAATGATGG CGTCGTTTTG ACGAAAGGTC GCGCCGCGTG CGAAATCGAC
ACCGCTGACG AACTCCTAGT CACCGAGCTC ATGTTCAATG GCGTATTCGC CGGTCTAAGC
CCTCACGAGC TCGTTGCCTT GGCGTCGTGC TTCATGCCCG TAGAGAAAAG CAACACATCG
AACATGGATA AATCCGCAAA GGCGCTCGCG AAGCCGCTCA AAGCCCTTCA GGACGCCGCT
CGAGAAATTG GCAACGTACA AAAAGAGTGT AAAATCGACA TCGAAGTCGA CGACTTCGTC
GAATCATTCA AGCCAACCAT GGTCGAAATC GTGTACTGCT GGGCCAAAGG CGAACCGTTT
TCCGAAATCG TCAAAAAGAC AGATCTATTC GAGGGCACCA TCATTCGCGC CATGCGTCGC
CTGGACGAAC TCATGATGGA ATTACATCGC TCGTGCGTCG CCGTCGGCGA CGACGGCTTG
GCGAAAAAGT TCGAGCAAGG CGCGGAGAGT CTGCGCCACG GCATCGTCTT CGCCGATTCA
CTGTACACCT AG
 
Protein sequence
MSATAELECA NARASEHSAR ARDAAKRRKA TEAAADGIAD GGGDDDGDAR ARARTSCVHE 
VAVPRDWVGD VKALRDPRYD GARAKEYPFE LDAFQRAATA VLERNESVLV AAHTSAGKTV
VAEYAIAMAF RDKQRVIYTS PLKALSNQKY RELSEEFGDV GLMTGDASIN PNSTCIVMTT
EVLRSMLYRG GDVIREVKWI VFDEVHYMRD RERGVVWEES IIFAPKDARL VFLSATLPNA
LEFAQWVTSL HNHPCHVVYT DHRPTPLQHY AFPKGGSGLH LVVNEQSQFR SDNFARLQQA
IADGAEKSGG SGGGGRGRGR GGGRARGGGG GRGGGGGGRG GGSMADADIL RIVRMVKEKT
FFPVIVFSFS RRECEEYAKF VSKLNFNTPE EAEQVREVYN AALLNLSEED RQLTAVQAIL
PLLEAGIGIH HSGLLPVLKE LIEILFGESL IKCLFATETF AMGLNMPART VIFTAVKKFD
GTDMRVLAPG EYTQMSGRAG RRGKDDRGIC IVMCDERMEE HAMKEMILGK PQPLNSEFKL
SYYSILNLLK RATGTIDAEY VIARSFHQFQ HAKQLPELKA RLTEVQQEAA KIKSVGSEEI
QEYIKLRRDY REAEKVVLRT MLQPANCLRF FTSGRLVRIR DGDTNWGWGV VIQVSTVKDA
KGGDVHVLDC LLRCGPGAAE GRLAPADAKN LKMNTTEIVP VGTHLVDAIS AMRFTLPGDL
RTKEARESVW IAVETVTKKL TEKGQVIPQI HPVDDMGIND VAFVRTYRSL GALRDKFHSH
ALYSEADALE RSEMTAKIDV IEQKSELLAE ASRLETQIQS SELTKFRDDL SARSRVLKKL
GHIDNDGVVL TKGRAACEID TADELLVTEL MFNGVFAGLS PHELVALASC FMPVEKSNTS
NMDKSAKALA KPLKALQDAA REIGNVQKEC KIDIEVDDFV ESFKPTMVEI VYCWAKGEPF
SEIVKKTDLF EGTIIRAMRR LDELMMELHR SCVAVGDDGL AKKFEQGAES LRHGIVFADS
LYT