Gene OSTLU_46542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_46542 
Symbol 
ID5003687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp246841 
End bp249907 
Gene Length3067 bp 
Protein Length545 aa 
Translation table 
GC content60% 
IMG OID640419108 
Productpredicted protein 
Protein accessionXP_001419535 
Protein GI145350269 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.893722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.156385 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAGGACAACG ATGCTGTCGT CGACGAAGCG AAAGAAGGTA CCGAGGCTGT GAATTCGCCG 
CAAAAGTCTA CGCCAACCAA CTTCACCGAA ATCGTCCTCG TCACTGCGCG GCGTCTTCAT
GTGGTAACGA AGACAGAATC TACGGCACGA CTGTACGAGT TCGCTGTCCA ACGACGCAAT
TCATTGCAAG CTCTGATCGA TGCGATTGAT TCTGGTAGCA CACCAACGGA CATGGAAGAC
CCGAGCGTGC CTGTCGTCGC TAAACTTCAC GCGGAAGTCA CCAAGCTAAC GGCAAAAATT
GCCGAGCTGA AAACAAAGCA CAGCAATTTG GTCGAAAGCG CGCTCGTCGC CGAAGGCCAG
ACGACCGATC CAGAACAACT AAAGCTCGTC AAGGAAACGT ACGAGGAGCT CACCACTACG
ATTGTTCGAT GTGAGAAACT CAAGGCGGAT CACGCCTGTG AAGTGGTCAA TTACCGCAAA
GAGACGAGCC AGCTCGCGGA TGAAATGGGC AAGCGCGACA TTGGCAACGC TGATTCGACA
GAATACATCC TGCGCGTGAT GAGCGCCGTG CTCCAACGTG AAGTGACCGA TATCGATGAA
GCCGAACGCG AAGCCGAAAT CGCCATCGGA GAGCTGAGAA GGTTGCGTTC GGACGTCGCG
AAAACGAAAC GCGCGAAAGT CAGTCGCGGT CGCTCAAAAA AATCCGCGGC GACAAACGCG
ACGGAAGCCG TCGCCGACGC CGACGCCGAC GCCGACGCCG ACGCCGACGT CGAACCTCGC
CAAGAGCTCG AGCCCCCTCG GCCCGAGCCC GAGCCCACCT CTGACGTCAC CATGCTCGTC
GCCGACGCCG TCGCCGACGC CGTAGCCGAC GTCTGAGGTC CCCGTTGTAA AAACATCCCC
ACTCGCCCAC TCGGGCGCAT GCGAACGACC ACCTCTCTGT TCGCGCAACG TTTGACCCTC
ATCAGTGGCC TACGCGCGTT CACCACCACG CGCTCCGCGC CCCGTCGCGC GCTCCACGCG
TCGTCGTCGT CGATCGTTTT CGCCGCGCGC GCGACCCCGG ACGCGAATCG CGACATGTTT
CGCGACGGTC CCGACGCCGG TGGGCGCGGA CAACCGCGCA AGCGCCCTCG AGGCGGTCGA
GGCAACGGCG GCGCGCCCCG TCCGCGCAGC GATGGCGCTC GAACGCACGT GAACGATCGA
CTCGAGAAGC CGTGGGAAGG TCAGGACGAC GCGCCGAGAT CGAGCGCGCC GCCACCGCGA
CCGAATTTGA TGAAAAACAC GCACGGAGGC GCGCTCGCGC AGGCGCAGAT CGTGGAAGTC
GCGACGGGGA ACGCGCCGAG CACGACGGCG GCGTTCGCGA ACATGGGACT CACGGAGGCG
TCGATGCGAG CGATTCACGA GGTGATGGGA TTCACGCACG CGACGGCGGT GCAAGATTCG
ACGCTGCCGC ACATCATGCG AGGATTGGAT GTGCTCGCGC GCGCGAAGAC GGGAAGCGGG
AAAACGGTTG GATTTTTGTT GCCGGCGATC GAACGGTTGG CAAAACAAGG AGCGCCGCGA
AAGGGGGACG TGTCGTGCCT GGTAATTTCG CCGACGAGAG AGTTGGCGTC GCAAATCGGC
GAGGAGGCGA AGAGTTTGTT GACGTATCAT CCGTTCAATT GCCAGGTTGT CTTCGGTGGG
ACGAACATTA ACAGCGAACG AAAGCGATTG ACGTCGCAAG GGGTTGAATT TTTGGTCGCT
ACTCCCGGAC GGTTGATCGA TCACTTTGAG TCGAGCAACT TGGCGCGCGC GTGCCAAAAC
CTCGACGTCT TGGTGCTCGA CGAGGCGGAT CAGCTTCTCG ACATGGGTTT CCGACCGAGC
TTGGAAAAAA TTTTGTCTTA TTTGCCTACG CAGCGACAGA CGTTGTTGTT CTCCGCCACG
GTGCCGAAAA CGGTGCATCA AATCGCTGCC AACGCTTTGC GACCGGGTCA TCAGTACATC
GACTGCGTCG GCGACGACGC ACCGGCGACG AATTTGCAAG TTAAGCAATC GCTCATCGTC
GCCTCGTTTC ACGACCACTT GACGCTGATG ACGCAAGCTA TTGAAGAACA TCAGGCGGAG
GAACCAAATC ACAAGATCAT GGTATTCTTT CCCACCGCTC GTTCGACGCA ACTCGCGTCA
GAAATGTTCG AGGCGTGCGG CAAACCCGTC TTTGAGATTC ACAGCCGCAA GTCTCAGGCT
GTTCGTACGA AGGCGGCGGA TAAGTTCCGC GAAGCCCGCG CGGCGGTCAT GATGTCTTCT
GATGTCACCG CACGCGGTAT GGACTTTCCA GACGTGACGA TGGTTATTCA AATCGGTGTT
CCGTCTGCTC GCGAGCAATA CATTCATCGT CTTGGTCGAA CCGGGCGCGC AGGCAAGACG
GGCAAAGGCT TGCTCATGCT CGATCCCGCG GAGAAGTTTT TCCTCAACGG CGTTCGTGAC
TTACCCATCG AAGTCTTAAA CCCTGCGATT GATTCTCAAG TCGACCAGCG CGTGCGCAAG
GCCATCAGCA GAATCAATCC GGATACCAAG GCGCAAGCAT ACAGCGCATG GTTGGGCTTT
TACAACGGCT CGAGCGGAAA GATGAAGTGG AGCAAGGACG ATTTGATCTT TGCAGCCAAC
AACTACGCCT TGGAGACCTT GCAGTGCGAA AGTTTGCCCG GTCTTTTGAA GAAGACTGTG
GGTAAGATGG GTCTCAAGGG GTTCACCGCG AACTTGAACA TTGTCTCCGA GTTGGGCGGC
GGCGGCGGCG GCGGTCGCGG CGGCGGCGGC GGCGGTCGTG GCGGCGGTGG CGGCGGTCGC
GGTGGCGGCG GCTTCGGTGG TGACCGCGGC AGTGGTGGAT TCGGCGGTGG TGGCGGTGGT
CGCGGTGGTC GCGGCGGTGG TCGTGGTGGC GGCGGTGGTC GCGGCGGCGG TCGCGGCGGC
GGACGCGGCG GTGGCGCCAA CGCGACGAGC ATGAACTGGC AACAGCAAAG CTTCTATTGA
GCGCGGTGTC ACCCTTTTTT CAAATAAATT CGCAGTAGAT TACATTCTTT TATCCGTCGT
GTGGACT
 
Protein sequence
MGLTEASMRA IHEVMGFTHA TAVQDSTLPH IMRGLDVLAR AKTGSGKTVG FLLPAIERLA 
KQGAPRKGDV SCLVISPTRE LASQIGEEAK SLLTYHPFNC QVVFGGTNIN SERKRLTSQG
VEFLVATPGR LIDHFESSNL ARACQNLDVL VLDEADQLLD MGFRPSLEKI LSYLPTQRQT
LLFSATVPKT VHQIAANALR PGHQYIDCVG DDAPATNLQV KQSLIVASFH DHLTLMTQAI
EEHQAEEPNH KIMVFFPTAR STQLASEMFE ACGKPVFEIH SRKSQAVRTK AADKFREARA
AVMMSSDVTA RGMDFPDVTM VIQIGVPSAR EQYIHRLGRT GRAGKTGKGL LMLDPAEKFF
LNGVRDLPIE VLNPAIDSQV DQRVRKAISR INPDTKAQAY SAWLGFYNGS SGKMKWSKDD
LIFAANNYAL ETLQCESLPG LLKKTVGKMG LKGFTANLNI VSELGGGGGG GRGGGGGGRG
GGGGGRGGGG FGGDRGSGGF GGGGGGRGGR GGGRGGGGGR GGGRGGGRGG GANATSMNWQ
QQSFY