Gene OSTLU_94154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_94154 
Symbol 
ID5006891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp182258 
End bp184087 
Gene Length1830 bp 
Protein Length609 aa 
Translation table 
GC content56% 
IMG OID640422312 
Productpredicted protein 
Protein accessionXP_001422833 
Protein GI145357250 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5034] Chromatin remodeling protein, contains PhD zinc finger 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.000419692 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0261603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGGAT GTGATGATTG CGGTGACTGG TATCACTTAA AATGTATCAA TGTGACGCCT 
ACTATGGCAA AGACGATGCA CAATTACATC TGTCCGCCAT GCATCGCGAA GAGTGGCAAG
GCGAGCGCAC TCTCGTTGGA TACGTATCGC TCGGTGCATC GCACCAATCG CCCAAACGTC
ATGCTTTTGC GAGAACTCCT CGGTGAAGCG CAGCGATTTC CTGGAGAAGT GCCAGAAGAA
GCCGTGTTGA ATCAAGTCAT CAATACTCAC GACGCGTGGC GATTGGAAGT ACGAAAGACT
TTGCATAGAA AGAGTATGAA GGAAGCACAC GACAACGCCA TCAGTCAAGC TTCGAAGATT
GCCCAAGAAG CAGCGATGAG AGAATCTGAG GAACGCACAC GACGAATCGC CGAAGCGTCG
CTCGCGGCGA CTGCAGAGAA CCCCAATCCT CCAGCTTCTC TCACGCCATT ACAGCAAGTC
ATGCTGATGG GTCAAGCTGT CACGTTCGCC GCAACGCAGC GCACGCACGC GTTGAACATT
TCGCCGTGTT CGCAGGAGAT TTCAGATGTC GTCACAAAGC AGCTCGGGCA ACTTCAACAG
CTCGCCATGC AACTCGCGAT GTGCCAAGCT CAAATTCACA TGGCTCCAGT TCGGTCTAGG
CCCCACATCC TCCACGTCTT GATGATGCAG CAACAGCATC TAAAGATGAC AAACGAGCAC
GAGGAAGAGC TGGCAAATGT TGGCGCTAGC AACTGGCTCA TGGAAGCGAC TCTGCAACTA
TCTGGAATGT CTGGCATGAT GCCGCCGCCG ACGGCGCAAG ACACCGGCAA AGAACCTTCA
ACATCCACGG GTCATGGTAG TCCAAGCGAT ATGTCTGCTC AGTTGAAGAC GAACGAACCG
ATGATGGTTC CGCAGCAGTT CATGTTGCAA CCGGCCATGC CATCGCTTGT CGGCGACGCG
TCTGGAATGC CCAAGCAAGA GATGGATTAC ACGTATGACA TTCCTCGAAA AGATAATGTC
GATCCAGCGC TGCAAGTGTA CGCGGGCATG AAGGGTGCTC TGGCGATGGA GCTGGAACCT
GTTGAGGAGA CGCTCGCGTT GATGCGCGAA GTGTGCGGTC CGGCGTGGAG AGAACAAGCC
ACGCGGCTCA TGACTGGTGC GCCGTTCCCG AGGTTGATCA AGCTTCACGA GCTCAAGGAA
TCTGCCATGG CTGCTGGTCT TTGCCCTGGT GCCGGCATTG ATCCTTTGGC CGATCGCGCG
CACGCGTTGG AGGTTGCTGG ACAAATCTGG CTCGAACGCG CCGCCGCCGT CGTGCAAGAC
AAAACGATTC CTATCGAGGC GGCGCAGTTG CTGCTTCAAG AGGGTCGATC TTTGCCTTTA
TACTTGAAGG AGGAGCTCGA GGAGTTGGGA GAGCGATGCG AACTGTATTG CGTTTGCCGA
AGCGCGTACG ACGCTCTCAG GCCGATGATT TGCTGCGATC GATGCGATGG TTGGTTTCAT
TACGAGTGCA TCGGCATGCA GTCGCCGGCG CCGGGCGAGG AAGACGAAAA CGCCGAAAAC
GTCAAGTTTG CCTGCCCAGA GTGCTGCGCG GCGCAAGGTA TTCCGTACGT TCCGTTCCGT
CCAGCGCCGA AGGACACGGA CAAAGCGCCG GAGCGAGCCG CGCCTCCGCC GGCTGAAGAA
GCTCCGGAGC CGCCAAAGAC GGAAGAAGAA GCGAAGCCGC CCGCTAAACC AGAACCGGAA
GTCGCGGACA ACAAGAAGAA ACGAAAAACG GTGGAGCCGA CTCCGCCTGC TAATAAAAGT
ACCTCTAGTC GCAGCAGAAG ACGAAAGTAG
 
Protein sequence
MVGCDDCGDW YHLKCINVTP TMAKTMHNYI CPPCIAKSGK ASALSLDTYR SVHRTNRPNV 
MLLRELLGEA QRFPGEVPEE AVLNQVINTH DAWRLEVRKT LHRKSMKEAH DNAISQASKI
AQEAAMRESE ERTRRIAEAS LAATAENPNP PASLTPLQQV MLMGQAVTFA ATQRTHALNI
SPCSQEISDV VTKQLGQLQQ LAMQLAMCQA QIHMAPVRSR PHILHVLMMQ QQHLKMTNEH
EEELANVGAS NWLMEATLQL SGMSGMMPPP TAQDTGKEPS TSTGHGSPSD MSAQLKTNEP
MMVPQQFMLQ PAMPSLVGDA SGMPKQEMDY TYDIPRKDNV DPALQVYAGM KGALAMELEP
VEETLALMRE VCGPAWREQA TRLMTGAPFP RLIKLHELKE SAMAAGLCPG AGIDPLADRA
HALEVAGQIW LERAAAVVQD KTIPIEAAQL LLQEGRSLPL YLKEELEELG ERCELYCVCR
SAYDALRPMI CCDRCDGWFH YECIGMQSPA PGEEDENAEN VKFACPECCA AQGIPYVPFR
PAPKDTDKAP ERAAPPPAEE APEPPKTEEE AKPPAKPEPE VADNKKKRKT VEPTPPANKS
TSSRSRRRK