Gene OSTLU_119483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119483 
SymbolHel1 
ID5000193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp420670 
End bp422010 
Gene Length1341 bp 
Protein Length446 aa 
Translation table 
GC content47% 
IMG OID640415614 
ProductDEAD-box helicase, probable 
Protein accessionXP_001416141 
Protein GI145342120 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.112592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGCTT TCGTGAGCGG CGAGAAGCTA AGGAAACGAA ATGGGCACGA CATGAACGGC 
GAATGTCCAC GACAAATTAC ACCAGTTTCA CCTCTTGATC ATCGTCTTTA CGCTACTCTT
GCGCGTGGTC AGCTTCATTC TCTTTTACCG GTACAGAAGC AGACGCTATC GCGCGCTCTT
GCGGGGAGCT TCGAGAGGGA CTTGTGTGTC ACAGCTCCTA CCGGGAGCGG TAAAACTCTC
TCCTATTTGC TACCTGTGTT ACAGATACTT TCAAAGCGAG TATCGAAAGC CGAAACAGTG
TGCCTGATTC TTGTACCGAG TGGCGATCTT TCCGCACAGG TATGCGTCGC CGCTAGTGAA
CTCTGCAAAG CATTGCACGT ACAGATTTCT ATTGTTGGGA AAAGTCGCAA TACAACCAAT
TCAACTAAAC TAGTGAACAA GCGATATCGA CGGCTATTGG CTCAAAATAG ACAGCGTATG
TATCAGTGTT GTATCACGCG CCAGTTTGCG CGACATAATT CGCTGGATGT GCCTTCACAA
ATACTTGTAG CGTCTCCTGG AAGGTTTGCT GCACATCGGA CGAGTATTCT CAAGAAATTG
AAATTGTTAG TCATCGATGA GGCGGACAGA ATGCTTCAAC AATCATATCA AAACTGGCTC
GTTACGCTGG ACTATGAAAT GTGTGCGCGA ACACGTCAGT GTCGGCGTCT CACTGTGGGT
GAACGAAACT CTGAAAGACT GCAACTTATA TTTTGTTCAG CAACATTGCA AAGTTCGTGT
CTACAAAGGG TTGGAGCTTT CGCACTTGAT CGAATCAATG CTTATGATAG TGTCTGTCCA
CTTTTGCCTA TTACGCTGTC TGAGTATGCG ATTGTTGCGG AACACACTGA TAAGTTTGAT
GCTCTTGTCT CGCTACTTGA ATTTTTCAAG GGTGAAAAAG TTATTGTCTT CTCTGCCTCG
GTATATCGTG CTAGACATAT ACTACAGCGG TTGAATAAAT TAGAAAATTT GCCCTGTTTC
GAGTATAGCA GTGACGCAAA CCTTCGAAGA CGAGCATCTA CGCTTCAAAA CTTCCAGCGC
TGCCAGGGAG GTGTCCTTGT TGCTTCAGAT GCCGCTGCAC GAGGTTTACA CATCGATGCA
GTATCGGCTG TGATATCTTT TGACGCTCCT GAGCATTTCG AGACGTATCT GCATCGAGCT
GGAAGAACGG CGAGGGCAGG CAAAACGGGC AAATGTATCA CGATTTGTTC GACAGCACGT
GAAGCACAAA CTTTCATAAA ACGAGTTCAG CGCCATGTCC CAATACTCCT TTCAACAGAG
CTTCAAGGCT CGACTATTTA G
 
Protein sequence
MRAFVSGEKL RKRNGHDMNG ECPRQITPVS PLDHRLYATL ARGQLHSLLP VQKQTLSRAL 
AGSFERDLCV TAPTGSGKTL SYLLPVLQIL SKRVSKAETV CLILVPSGDL SAQVCVAASE
LCKALHVQIS IVGKSRNTTN STKLVNKRYR RLLAQNRQRM YQCCITRQFA RHNSLDVPSQ
ILVASPGRFA AHRTSILKKL KLLVIDEADR MLQQSYQNWL VTLDYEMCAR TRQCRRLTVG
ERNSERLQLI FCSATLQSSC LQRVGAFALD RINAYDSVCP LLPITLSEYA IVAEHTDKFD
ALVSLLEFFK GEKVIVFSAS VYRARHILQR LNKLENLPCF EYSSDANLRR RASTLQNFQR
CQGGVLVASD AAARGLHIDA VSAVISFDAP EHFETYLHRA GRTARAGKTG KCITICSTAR
EAQTFIKRVQ RHVPILLSTE LQGSTI