Gene OSTLU_18985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18985 
SymbolCHR3518 
ID5006567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp172617 
End bp174533 
Gene Length1917 bp 
Protein Length638 aa 
Translation table 
GC content63% 
IMG OID640421988 
Productpredicted protein 
Protein accessionXP_001422675 
Protein GI145356928 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000565583 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.824746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGT GGGACGCGTG GGACGACGCC GTCGACGCCG AGCGCGCCGC GCGCGCGGTC 
GCGGACGCGA GCGAGGCGCT GCGACGGCGA ATGCGCGCGG AATCATCGAC GACGACGACG
ACGACGACGA CGACGGCGAC GACGGCGACG ACGGCGCCGA CGACGATCGA GTTCGCGCTT
CGCGAACCTG GGGACGCGTT CGCGATCGCG CGCGCGACGC CGCGCGCGGG CGCGCTCGGG
GAGGCGGCGG CGGCGGCGGC GCGGGACGCG GGCGCGACGC GCGCGAGCGA CGGCGCGTGG
ACGGCGCCGG CGGGCGCGGT GCGCGCGATT CGCGACGCGC TCGTGCGTCG GACGAACGCG
CGAGTGCTGG ACGTGCCGGG GATGGCGCTG CGGTGCGCGG AGGTGCGGTT CGAGGAGGAC
GCGGCGGGGG CGTACGCGAG AGGCGTGCCG AAGGCGCTGG ACGCGAAGAT GTTTGAATTT
CAGAGGACGG GGGTGATGTA CGCGCTGAGG CGACGGGGAC GCGTGTTGAT CGGGGACGAG
ATGGGGCTGG GGAAGACGGT GCAGGCGTGC GCGTTGTTGG CGTGTTATCG CGAGGAGTGC
CCGGCGCTCA TTTTAGTGCC GACTTCGTTG CGCGAGGCTT GGCGAAACGC TTTGCAATCG
TGGCTCGACG TCGCGGACGG CGACGTCGCG TGCGTGGGGG CGGCGAGCGA GGGGTGGAAA
CTCGACGAAG GACGACCGTT CGACATCGTG CCGTACTCGC TCGTCGTGAA GCTTCGCTCG
AAATTGCTCG CGAAGCGGTA TAAAATCGTC GTGTGCGACG AGAGTCATTT TTTAAAGGAT
CGACGCGCGC AGAGAACGCA GGCGGTGATG CCGCTGCTCA AGGATGCCAA TCGCGCGATT
TGTCTGACGG GCACGCCAGC GCTCAGCAGG CCGATCGAGC TGTTCACCCA GCTCGAGGCC
TTGGTGCCGA AAGTCTTCGC GCGATTGAAC GAGTACGGCG CGCGGTACTG CGCAAACGGC
GGGCCGTTCG GCATGTACAC GGGATGTACT CACGCCGACG AACTGCACGT CATGATTTCA
AAGCTTTGTA TGGTGCGTCG TTTGAAAAAG GACGTGCTCA AGGACTTGCC GCCGAAGCAA
CGAACGCAGG TTTGGCTCGC GCTGGAGAAG TCGAGCATGG GCGACGTGCG CCGCATAAAA
TCGCTACTCG ACGAACTGCG TCAGAGAGGC GGGAACGAGC TGGAAGAAAA AAGGCTGCTA
AACGAGCTCT TCTTAGCGAG CGCCAAGGCG AAGACGAAAT CGGTGTGCGA GTACTTGGAG
ACGTTGATCG ACGGGAGTAC GTCGAAATTT TTGTTCTTCG CGCATCACGG CGTCTTGCTC
GACGCCGTGG CTCAGTGTAT GGACGCGAAG AAAGTCAAAA CGATTCGCAT CGACGGATCG
ACGCCAGCGG CGGTGAGAGG CGACTTAGTG AACGCGTTCC AGCGTCGCGA CGACGTTCGC
GTGGCGATTC TCAGCATCAA AGCCGCGGGG ATGGGCTTGA CGCTCACCGC GGCTTCGACG
GTGATCTTCG GCGAAATGGT GTGGACGCCT GGCGACTTGA TTCAGGCTGA GGATAGAGCG
CACAGAATCG GGCAGCAATC GAGCGTTTTG GTGCAGTACC TACACGCCAA GGACACGATC
GACGAAATCA TTTGGCAAAG CATAAAGAAG AAGTTGGATA ATCTCGGCGC GGTGTTGAAC
GGGCAAACGA GCGGAAATCA CCTCGAGACG ACATCGACGA ACGGGAAGTC GCCGAAACGG
CAAAAAGTGC AACCCGTGAT CGACGTCTCC CAAAGGACGC TGACCGAGCT GTTCGCATCT
CAGGCAACGC AGTTATCGTC GCCGGCGGGT GAAGATTCTC AACCCACCGA CGCCTAG
 
Protein sequence
MDAWDAWDDA VDAERAARAV ADASEALRRR MRAESSTTTT TTTTTATTAT TAPTTIEFAL 
REPGDAFAIA RATPRAGALG EAAAAAARDA GATRASDGAW TAPAGAVRAI RDALVRRTNA
RVLDVPGMAL RCAEVRFEED AAGAYARGVP KALDAKMFEF QRTGVMYALR RRGRVLIGDE
MGLGKTVQAC ALLACYREEC PALILVPTSL REAWRNALQS WLDVADGDVA CVGAASEGWK
LDEGRPFDIV PYSLVVKLRS KLLAKRYKIV VCDESHFLKD RRAQRTQAVM PLLKDANRAI
CLTGTPALSR PIELFTQLEA LVPKVFARLN EYGARYCANG GPFGMYTGCT HADELHVMIS
KLCMVRRLKK DVLKDLPPKQ RTQVWLALEK SSMGDVRRIK SLLDELRQRG GNELEEKRLL
NELFLASAKA KTKSVCEYLE TLIDGSTSKF LFFAHHGVLL DAVAQCMDAK KVKTIRIDGS
TPAAVRGDLV NAFQRRDDVR VAILSIKAAG MGLTLTAAST VIFGEMVWTP GDLIQAEDRA
HRIGQQSSVL VQYLHAKDTI DEIIWQSIKK KLDNLGAVLN GQTSGNHLET TSTNGKSPKR
QKVQPVIDVS QRTLTELFAS QATQLSSPAG EDSQPTDA