Gene OSTLU_47051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_47051 
SymbolGTC3501 
ID5004920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp333686 
End bp336715 
Gene Length3030 bp 
Protein Length1007 aa 
Translation table 
GC content57% 
IMG OID640420341 
Productpredicted protein 
Protein accessionXP_001420659 
Protein GI145352666 
COG category[B] Chromatin structure and dynamics
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG5406] Nucleosome binding factor SPN, SPT16 subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00853126 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000628668 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GCGCGGATGC GGTGCCTGTA CGAGACGTGG CGCGCGGAGC GAGACGGGGC GTTCGGCGGG 
GCGAGCGCGC TGGTGGTCGG GACGGGGGCG AACAAGGAGG ACGACTTGCG GTACCTGAAG
GCGGTGGCGC TGGAGGTGTG GTTGTTTTCG TACGAGCTGC CGGACACGCT GTTGATGTTC
ACGGAGCGCG GGATGCACGT GGTGGCGGGA GGGAAAAAGG CGGCGCTGAT GGAGAACGCG
CGGGAGGTGC TGAAGGAGGA GTGCGGGTTG GATCTCGCGG TGCACGTCAA GCCGAAGGGC
GAGGACGGCG CGGCGCAGGC GGCGGCCGTC GTCGAGGCGA TTAAGAGCGA GAATCTGGTG
GTTGGGATGG TGATGAAGGA GAAGAACGAG GGTGCGATGA TGCAATACGT GACGAAGGCG
CTCGGGGAGG CTGGGATGGA AATTAAGGAT GTCACGAGCG GCGTGTCGCT CGCCATGGCG
GCCAAGGATG AAAAGGAGCT CGGTTTCGTG AATAAGGCGG TGACGCTAAC GAGCAAGGCT
TTGGGGTTCG CGGTGAAGGA GATGGAGGCC ACGATCGAGG ACGAAAAGAA GTTGACGCAC
GCCAAGTTGT CGGAGATGAC GGAGGATGCG ATTATCGATC CGTCGAGACT CGGTTTGAAA
TTCCCGCCAG AGGACGTGGA TATTTGCTAT CCTCCGATTT TCCAATCTGG TGGCGAGTAC
GACTTAAAAT ACAGCGCAGA GAGCGCGAAC ACGAAGCTTC ACTACGCTTC CCCGCCCGCG
GTGGTGCACA TGTCCGTCGG CGCTCGATAC ACGCAGTATT GCGCGAACGT CGGTCGCACG
TACATGGTTG ATCCGACGCC CGCGCAGGAG GCGACGTACG CCGCTATTCT CGCCGCGCAA
GAGGCGGGTA TCGCCGCTCT CGTCGATGAT GCGACGTGTG CGTCCGTGTA CGAAGCTGTC
AAGTCCTCTC TGACGAGTGC GGAAGGCGTC GACGGCGCGA CGTTAGCTTC AAAGTTGAAC
AAAAATGTCG GCACCGCCAT GGGTCTCGAA TTCCGCGACA TGACTTTTGT GTTGAATGGC
AAGTGCGAAA CCAAAATCAA GGCTGGTATG TTGTTTAATC TCGCCGTCGG TGTGCAAGGC
TTGAAGGAGC CGAGCGCCAA GGAAGGTAGT AAGAATGAAA CGTACGCCGT GATGATCGCC
GACTCTGTCT TGGTGGGCGC CGCGGGCGAG ACGCCGTCAG TGTTGACCAC GAACCCAAAG
GGCGTCAAGG AGATCTCTTA CATCATGAAC GACGATGATG ACGACGACGA CGACGAAGAA
GCCGAGGTCC AAATCAAACA AGGGGGCGTC ATCATGGATG CGAAGACTCG CGCCGAGCAA
TCCGGTCCGA GCTCGGCGGA GGATCGCGAG CGTCGTCAGC GCGCGTTGGC GGACAAAAAG
AATGCCGAAA CGTACAAACG ATTGACGCAA GCGGGCGAAG AAGAGATTCA AAACGCCACT
ATGGGCTCAT CCGCAGAATT TGTCGCGTAC AAGTCCATGC GTGAAGTCCC GACGCCGAAG
AACAAAGAGC TCGTTCTCGC CGTCGACCAA GAGCGCGAAA CCGTCCTCGT GCCGATTTAC
GGTCAGCTCG TGCCTTTTCA CGTCATGTCG GTCAAGTCTG CCTCAGTGAG CCAAGATGCC
GGTGCTGCGT TTATTCGTAT CAACTTTCAG CATCCCACCG GTTCAGGGGC GGTGGCGGTA
CAAAAGTACG CGGCGGCGGC GCGATTTCCG AACTCCATCT TTTTGAAGGA GGTGAGCTTT
CGCAGCACGG ACGCGCGTCA CGCCAACCAC GTGGTGCAAG AGATCCAAGC CTTGCGACGT
AACATCGTGC AACGTGAAAC TGAGCGCGCG CAACGCGCCG ATTTGGTTCG CCAAGAGCGT
CTCGTTCTCT CCTCTGGCCG CGTGCATCGC TTGACGGGTT TGTGGATGCT CCCGACGTTC
GGCGGTCGCG GCGGTCGCAA GGCGGGCACG TTGGAGGCGC ACACGAATGG TATGCGATAT
CTCGGCGCCA AAATGGACGA GCAGGTGGAC ATTATGTACG ATAACATCCG ATTCGCGTTC
TTCCAACCGG CCAAGCAGGA GATTAAGACT TTGATTCATT TCCACTTGAA GAATCCAATC
ATGATCGGCA AAAAGAAGAC GCAAGACGTG CAATTCTACC AAGAAGTCAT GGAGGCTGTG
CAAAACTTAG ACGGCGGGCG TCGTAACATG TACGACCCGG ATGAAATCGA AGACGAACAA
CGCGAGCGCG AGCGTCAAAA GAAAATCCAA AAGGAGTTTA GCCACTTTGC CAAGCGCGTG
CAAGAAATTT GGGAAAAGGA TTTCCCGCAG TTGAATTTGG AGTTTGACTC GCCGTATCAC
GAGCTCGCAT TCCAAGGGGT GGCGTACAAG TCCACGGTGC GCATTCTGCC CACGACGTCG
TGCTTGGTCG AACTCACGGA GTTCCCGCCG CTCGTGCTCG CTTCTAGCGA TATCGAAGTC
GTCAACTTGG AGCGCGTCGG TTTCCATTTG AAGAACTTTG ACATGGCGAT CATCTTCCGC
GATTTCAACC GCGAAGTCCA TCGCATCGAT CAAATCCCGA GCCAATACTT GGAGAACATC
AAGCAGTGGT TGACGACGCT CGATATCAAG TACTACGAAG GTAAAGCCAA CTTGAACTGG
AAGCCGTTAC TTCGACAAAT CAAAGAAGAC CCCGACGGCT GGCTTGAAGC CGGCGGTTGG
GAATTCTTAA ACAACGAAGC CTCCGACGGC GAAGACGAAG AAGACGAGGA AATGAGCGAG
TTCGAACCGA GCGAAGACGA AGACGAAGAC GAGTCCGAAG AAGAGTCCGA ATCCGAAAGC
GTGTACGATT CCGAGGAAGA CGACGAAGAG GAAGAATTGG ACGAGGACGA CGAGGAAGGT
TTGTCTTGGG ACGAGCTCGA GGAAAAGGCC GCGAAAGAGG ATGCCGACGC CAGCGATTCC
GACGAACGGC CTCGAAAGAA GAAGCGATAG
 
Protein sequence
MRCLYETWRA ERDGAFGGAS ALVVGTGANK EDDLRYLKAV ALEVWLFSYE LPDTLLMFTE 
RGMHVVAGGK KAALMENARE VLKEECGLDL AVHVKPKGED GAAQAAAVVE AIKSENLVVG
MVMKEKNEGA MMQYVTKALG EAGMEIKDVT SGVSLAMAAK DEKELGFVNK AVTLTSKALG
FAVKEMEATI EDEKKLTHAK LSEMTEDAII DPSRLGLKFP PEDVDICYPP IFQSGGEYDL
KYSAESANTK LHYASPPAVV HMSVGARYTQ YCANVGRTYM VDPTPAQEAT YAAILAAQEA
GIAALVDDAT CASVYEAVKS SLTSAEGVDG ATLASKLNKN VGTAMGLEFR DMTFVLNGKC
ETKIKAGMLF NLAVGVQGLK EPSAKEGSKN ETYAVMIADS VLVGAAGETP SVLTTNPKGV
KEISYIMNDD DDDDDDEEAE VQIKQGGVIM DAKTRAEQSG PSSAEDRERR QRALADKKNA
ETYKRLTQAG EEEIQNATMG SSAEFVAYKS MREVPTPKNK ELVLAVDQER ETVLVPIYGQ
LVPFHVMSVK SASVSQDAGA AFIRINFQHP TGSGAVAVQK YAAAARFPNS IFLKEVSFRS
TDARHANHVV QEIQALRRNI VQRETERAQR ADLVRQERLV LSSGRVHRLT GLWMLPTFGG
RGGRKAGTLE AHTNGMRYLG AKMDEQVDIM YDNIRFAFFQ PAKQEIKTLI HFHLKNPIMI
GKKKTQDVQF YQEVMEAVQN LDGGRRNMYD PDEIEDEQRE RERQKKIQKE FSHFAKRVQE
IWEKDFPQLN LEFDSPYHEL AFQGVAYKST VRILPTTSCL VELTEFPPLV LASSDIEVVN
LERVGFHLKN FDMAIIFRDF NREVHRIDQI PSQYLENIKQ WLTTLDIKYY EGKANLNWKP
LLRQIKEDPD GWLEAGGWEF LNNEASDGED EEDEEMSEFE PSEDEDEDES EEESESESVY
DSEEDDEEEE LDEDDEEGLS WDELEEKAAK EDADASDSDE RPRKKKR