Gene OSTLU_17735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17735 
SymbolCHB3501 
ID5005065 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp39424 
End bp42413 
Gene Length2990 bp 
Protein Length902 aa 
Translation table 
GC content60% 
IMG OID640420486 
Productpredicted protein 
Protein accessionXP_001420888 
Protein GI145353152 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG5259] RSC chromatin remodeling complex subunit RSC8 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.31577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.344998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCGA CGTTGGCGGC GTCCGCGGAT GAGCCAGGGG TGACGCACGT CCTACGCACG 
GATGATTCGG TGAATGACGA CGAGATCGAG GACGGCGCGT TGTGGCGAAT CACGGAGACG
AGAGTGGCGA CGTCGAGTGG AAAGACGGAG CAAAGGTTAC ACTTTTGGTT TTATCCGGAT
TCGTACGACG CCTGGTACGC GGATTCGTCG ATTAGTGGTA AGGGGAAAGC GCCGTGGGCG
CCGGAGACGA GCGTGATTTT GAGAAATAAA GCGGAGAAGA GAAACGACCT GCCGCACAAC
GTGCGCGCGC GATGGTTGCG CGACAGTCAC ACGTTTAACG AGTGGATGAA CGAGCTCGAT
TACGAGTACG ACGTGACGCG CGAAATGTTG AGCGCGCCGC GGCACGCGTG GGATCCATCG
AACGCGCCAC AAAACAAACG CGCGCGTGCG CCCGAAAACT TTGAGGGTGA TGCCGAAATT
CTCGCTCCAG GAGAAGAGGA ACGCGTGCCG TGGGGGTTTG CCGTCACGCG TCGTCGCGTC
GTTCGCGCGC ATCCAGTCGT CGCGGCGGCG GCGAGTCAAG ACGACGGCGC ACTCGTCTTG
CAAGACGACG AAGAACTAAT CACGAAAGAG TTCATGATTG CGAAAACGCT ACGCATGCAG
AATATCTCGG TGGATCAGTT GCCGTGGAAT GCGACGCCGT CAATGCGCAA AGCCGAGCGT
AAAGCCGAGC CTTTAACGGA GTACAGAGTG CCGACGCACA GCGCTTGGTT CAAATGGGGC
GAAGTTCACG CCATCGAAAG ACGTGCGTTG CCAGAATTCT TCGACGATGA TGACACGTGT
CAAAAGTACA TCGCGTGTCG AAACGAAATC ATGAATCAGT TTCGCTTCAA AGGCCAAGAG
GTTACGTTGC ATGAAGTGTC TTCGTCAAGA ACGACAAAAA ATATTGTCGA CGCCGCCGCG
CATCAGAGAA TTTTCTCATT TCTCGAGCAG TGGGGATTGA TTAATTGGCA ATTCACATCC
GGACGTGATG TGATTGACTT GAAACAAAAA CCTCTCGCCG CGTGGCGTCG CATCGTCACT
GGCGAGGATG GCGCGGCGCG TGTCGAGAAG ACGGATCCCT TAGCCGCCTT CAAAGGGACG
TTGTTCGAGT TTTCGAAATG TCGTGCGACG ACTGCGAGTG GTTTACACCC GCTCGAACCG
CAGTCGAGAT ATGCGCCGTC TTCGGAAACG CAACTCGAGC GTCAATCTTT GGATGCGTTG
TTTGCCTCTC ACGACGCGCT GTCAAAGCGT GGGGTCGACG TCAAGTTTGC GTGCAACGCG
TGCGGCGCTG ATTTAAAGAG CACTGGCGTT TTTTACCACG CGTTTCTCAC GCGTGATTTT
GATTTGTGCC CATCGTGCTT TTCCAAAGGC GTGTACCCGC ACGGCCAAGC GAGCGGCGAC
TTTGTCAAGG CAATGTACCC AGACTTTCAC GCCGAAGCCG TCTCGGCGGA CGAAATCGTC
GACGACGCCG AGTGGACGCC GCAGGAGGTC GCCGCCCTGC TCGATGCAAT TTCGCAGTCG
AATGAGTTAA ATTGGAACGA TATTGCTTCT GCGGTCGGGA CAAAGAGCGA GGATGAGTGC
TTGAAGCACT TCGCGCGCAT GCCCATCGAA GACGCCGCGA TTGAAAACAT AGAGCGCGAG
TTACTTGTGC CGCGCGGCGC CATCATCGAT GATGAGGGAG CCAAGATCCT CGATCCTGTG
CCTTTCTCAT TCGCCCCAAA CCCCACGATG GCTCAGCTCG AGTTTTTGGT GAGCATGATC
TCCCCTCGCG TCGCCGCTGC GTCGGCGAAA GCCGCGCTGA CGAAAATCGC GCTCGGCGGG
TCGCTCGACG CCGCCGACCT CAACGTCGAC GGTCTCGCCG CTGCCGCCAT TCAAGCCAAG
ATCCTGGCCC AAGACGAAGA ACACGAAGTT CATCGCATCA TCGCCAGTGC TCTGGACGTC
TTGCTGAAAA AGCTCGAAAT TAAGCTCAGA TTCCTCGGCC GACTGGTCGA CGACGAGCCG
GAGACGGCGA GCCGTCTCGC CAAGCTTCGA GAGGAGTCCG CGCGCAATCG AACGAACGAT
CTGTACACGC GCGACGTGCA ATCCGCGCGA CACAAGGAAC ACATAGCCAC GATTCATCGT
TTACGACAGC AGCTCGCCGG TCTGTCGTCT TAGTCGCCTC GCGCTTGCAG CAACAAAGCC
GTAAGCCGTC ACAACTCGCC TCGCCTCGAA ACATGACCAC CCGGCAGTTG ACGATCCAAA
TCGTCTCCGA CGTCGTGTGA CCTTGGTGTT ACGTCGGCGT GAAGAACCTC GACCGCGCGC
GCGCCGCGCT TCGTCCCGAC GTCGCGTCCT CCCGGGCCGT TTGGCGACCA TTTCAGCTCG
TGAGTTCTCA TGATTGGCGG CGAATGGGCG CCGACGTCGC GCGCGCGGGC GTGAACAAGC
GCTCGTGGTA CAACGAACGA TTCGGCGCCG ATACGGTGGC GACGTTCGAG CCCAGGCTCG
CGAGCGCGTT CGCGAAGGCG GGGATCGAGG GCGCGTACAC GCTCGACGGC AACACCGGCG
ACACGAGACC TGCGCACCGC GTCGCGGCTT ACGCCGAGGA AACGCACGGC CCGGCGGCGC
AGGACGCCTT CATGCGCGCC ATGTTCCACA GATACTTCAT CGAAGCGCTC GCGCCGTGCG
ACGAAGCCGT GATGAGAGAC GCGGCGAGCG CCGCGGGTTT GGACGAAGCG GCGGTTTCCA
AAGTGCTCGC CGACGGCGAG GCGTCGCCGT TCGAGACGGT CGTGGAGGAG CAAATGTCGG
CGACGCGCGC GCGCGTTCGC GGCGTGCCGC ACTTCATCAT CACGTGCGAC GGCGACGGTG
CGTCGCGAAA GATTGAGATC GGCGGCGCGC AACCGCCCGA GGCGTTTTTG GACGCGTTCG
CCGAGCTTTT GGATTTGGAC GCCGACGACG TCGCAGCGAC GAAGTCCTAA
 
Protein sequence
MRATLAASAD EPGVTHVLRT DDSVNDDEIE DGALWRITET RVATSSGKTE QRLHFWFYPD 
SYDAWYADSS ISGKGKAPWA PETSVILRNK AEKRNDLPHN VRARWLRDSH TFNEWMNELD
YEYDVTREML SAPRHAWDPS NAPQNKRARA PENFEGDAEI LAPGEEERVP WGFAVTRRRV
VRAHPVVAAA ASQDDGALVL QDDEELITKE FMIAKTLRMQ NISVDQLPWN ATPSMRKAER
KAEPLTEYRV PTHSAWFKWG EVHAIERRAL PEFFDDDDTC QKYIACRNEI MNQFRFKGQE
VTLHEVSSSR TTKNIVDAAA HQRIFSFLEQ WGLINWQFTS GRDVIDLKQK PLAAWRRIVT
GEDGAARVEK TDPLAAFKGT LFEFSKCRAT TASGLHPLEP QSRYAPSSET QLERQSLDAL
FASHDALSKR GVDVKFACNA CGADLKSTGV FYHAFLTRDF DLCPSCFSKG VYPHGQASGD
FVKAMYPDFH AEAVSADEIV DDAEWTPQEV AALLDAISQS NELNWNDIAS AVGTKSEDEC
LKHFARMPIE DAAIENIERE LLVPRGAIID DEGAKILDPV PFSFAPNPTM AQLEFLVSMI
SPRVAAASAK AALTKIALGG SLDAADLNVD GLAAAAIQAK ILAQDEEHEV HRIIASALDV
LLKKLEIKLR FLGRLVDDEP ETASRLAKLR EESARNRTND LYTRDLVSSH DWRRMGADVA
RAGVNKRSWY NERFGADTVA TFEPRLASAF AKAGIEGAYT LDGNTGDTRP AHRVAAYAEE
THGPAAQDAF MRAMFHRYFI EALAPCDEAV MRDAASAAGL DEAAVSKVLA DGEASPFETV
VEEQMSATRA RVRGVPHFII TCDGDGASRK IEIGGAQPPE AFLDAFAELL DLDADDVAAT
KS