Gene OSTLU_31660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31660 
Symbol 
ID5001916 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp303579 
End bp305563 
Gene Length1985 bp 
Protein Length440 aa 
Translation table 
GC content63% 
IMG OID640417337 
Productpredicted protein 
Protein accessionXP_001417733 
Protein GI145346517 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5602] Histone deacetylase complex, SIN3 component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0526314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0219153 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTCG CGACGGCGCC GCCCGCGGGC AACGTCGCCG ACGCGCTCGC GTACGTCCGC 
GAGGTCCGCG ATCGGTTCGC GCGACAGACG GGAAAGTATC GCGAGTTCCT CGCCGCGATG
CGCGACTTTA AGACTGGAAC GTGCGTCGCG ACGTCGCGCG ACGCGACGCG ACGCGACGCG
ACCGACGCGC GATGACGCGC GAGCGACGAA CGACTGACGA CGACGACGAC GGCGACGCAG
GCTCACGCCC GAGGGCGTGA TCGAGCGCGT GCGACGGTGC CTGCGAGGAC ACGACGATTT
GCTCGACGGC TTTCGAGCGT TTCTGCCCGA GGTGCGGGAC GCGCGAGACG AGACGGACGC
GAGACGAGAC GGAGACGGAG ACGACGCGAA AACGACGACG ATGGCGCGAG ACGACGAAGG
CGCGGTGACT GACGGTGACG CGCGACGCGC GGGTCGGGGC GCAGGGATAC TCGACGGCGA
CGGCGACGGC GAGCGGGACG CGCGCGCGCG GACGACCGGC GACGGCGAGC GGGCGCGGAA
GACCGAGAAA GGCGCCGAGC GCGCGGGAGG TGGCGATGAA GCGCGAATGC GAGGTGCGCG
GGCGACGGCG ACGCGCGCGC GACGCGCGCG CGGCGGCGAA CGCGACGCCG ATCGACGCGG
GGAAAAGTCG GCTGGGGTCG GAGGGACTCT TTGACGAGTC ACGACCCGTC GATCGTTTGA
AACCACAGCC GTCCGACCGA CCGACCTGAT CCGACGGCGC GAACCGGCGG TTGGGGTCGC
GCGGGCGACG CGCGCGGGAC GGCGACGGTT TCGTTCGGTT CGCGAACGCG CGGAAGACGA
TCGAGCGCGT GACTGACTGT GTGCGCGAAC GCGCGCGTGA TAAGGCGACG CAGGAAACGG
ATGAGAATCG AAGCGCGACG TTACTGAGCG GACAAGATTT GCTCAAGCGC ATTCGCGCGC
AGTACGCGTC GGACGACCGT CCGTATCAGG CGTTTTTACA GACGTTGATT AAGTTTAGGA
ACAAGCAGTT CAGCGCGGAA GAGGTGGTGC AGAAGTGTGC GATATTGTTT TATGATCATC
CGCAGTTATT GGAGGGTGAA AATGGGTTGG CCACCTTTGT GCCGAAGACG TGCAAGCCGC
CGACGCGGTC GTCGTGGTTC GAGTGGAGTC CGGCGGTGCA TCATCGCTTC AGCAGCGAGA
GCTTTCGATT GTTCGTGCGC ACGTTGTTAT TGTGTGAGTA TAAAAGTCGC TTAACGCCGC
CGGAGCCGAA GATGCGGCGT TGGCGTCTGA CGGATGGCGA CAGTAGCGAG TCGGATAGCG
ACGACTACAC GTCGTCGGAC GAGAGCGAAA GCGAGAGTTC GACGGTGATA GAAAAAGAGT
CTGGCACGCT TCGACCGGAT CAAATTCCCG ATCGATGGGT GGGTAAAATA CACGACCGAA
CGTTGAGCGC GTTCGAGCGC CTGACTTTAG AGCTCAAACC GAAAGACGAA GAGTATTGCT
GGGCCGACAC CAAGGCGGCG CGCGAAGCCA AGGAGAGCAA GGACGCGATC GAACTTCGGG
GCAAGCGCGG TCGCGGCGAA GAGCGAGATT TCACGACGAC GCCTCGAGAT TCACCATCAC
CGCCGCCGTC GTTTAATAAA CGGCGCGCGA GCGCTCGCAT CGTGGCCAAG GTGAACCGAG
GCATCGTGTA CAACCGCTAT CGCAGCTTGG CGTCCCCCGA CGGTCTCTCG CGCGGTCGCA
TGCGCATGAC GCGATCGATG AGCACGGTGG ACGACACCTT GCCCGCCAAA CCAGATCGCA
AGTTCTTGAT CAATCGTGAC GAAGGCAAGA TGTCCCCGCT GTGCGCCGAG CGCATCGGAT
TACACTCCCT GCCCGCCGTC GCGCTCGAGA AAATCATCGC GTGCTACAGC GCGCAAGAGG
GCGTCGCCGA GCTCAATCAA ATTCGCGAGT CTCTGATCAA ATCGCTCATA AAAAATAATA
GGTAA
 
Protein sequence
MALATAPPAG NVADALAYVR EVRDRFARQT GKYREFLAAM RDFKTGTLTP EGVIERVRRC 
LRGHDDLLDG FRAFLPEETD ENRSATLLSG QDLLKRIRAQ YASDDRPYQA FLQTLIKFRN
KQFSAEEVVQ KCAILFYDHP QLLEGENGLA TFVPKTCKPP TRSSWFEWSP AVHHRFSSES
FRLFVRTLLL CEYKSRLTPP EPKMRRWRLT DGDSSESDSD DYTSSDESES ESSTVIEKES
GTLRPDQIPD RWVGKIHDRT LSAFERLTLE LKPKDEEYCW ADTKAAREAK ESKDAIELRG
KRGRGEERDF TTTPRDSPSP PPSFNKRRAS ARIVAKVNRG IVYNRYRSLA SPDGLSRGRM
RMTRSMSTVD DTLPAKPDRK FLINRDEGKM SPLCAERIGL HSLPAVALEK IIACYSAQEG
VAELNQIRES LIKSLIKNNR