Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_47051 |
Symbol | GTC3501 |
ID | 5004920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 333686 |
End bp | 336715 |
Gene Length | 3030 bp |
Protein Length | 1007 aa |
Translation table | |
GC content | 57% |
IMG OID | 640420341 |
Product | predicted protein |
Protein accession | XP_001420659 |
Protein GI | 145352666 |
COG category | [B] Chromatin structure and dynamics [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG5406] Nucleosome binding factor SPN, SPT16 subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00853126 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000628668 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GCGCGGATGC GGTGCCTGTA CGAGACGTGG CGCGCGGAGC GAGACGGGGC GTTCGGCGGG GCGAGCGCGC TGGTGGTCGG GACGGGGGCG AACAAGGAGG ACGACTTGCG GTACCTGAAG GCGGTGGCGC TGGAGGTGTG GTTGTTTTCG TACGAGCTGC CGGACACGCT GTTGATGTTC ACGGAGCGCG GGATGCACGT GGTGGCGGGA GGGAAAAAGG CGGCGCTGAT GGAGAACGCG CGGGAGGTGC TGAAGGAGGA GTGCGGGTTG GATCTCGCGG TGCACGTCAA GCCGAAGGGC GAGGACGGCG CGGCGCAGGC GGCGGCCGTC GTCGAGGCGA TTAAGAGCGA GAATCTGGTG GTTGGGATGG TGATGAAGGA GAAGAACGAG GGTGCGATGA TGCAATACGT GACGAAGGCG CTCGGGGAGG CTGGGATGGA AATTAAGGAT GTCACGAGCG GCGTGTCGCT CGCCATGGCG GCCAAGGATG AAAAGGAGCT CGGTTTCGTG AATAAGGCGG TGACGCTAAC GAGCAAGGCT TTGGGGTTCG CGGTGAAGGA GATGGAGGCC ACGATCGAGG ACGAAAAGAA GTTGACGCAC GCCAAGTTGT CGGAGATGAC GGAGGATGCG ATTATCGATC CGTCGAGACT CGGTTTGAAA TTCCCGCCAG AGGACGTGGA TATTTGCTAT CCTCCGATTT TCCAATCTGG TGGCGAGTAC GACTTAAAAT ACAGCGCAGA GAGCGCGAAC ACGAAGCTTC ACTACGCTTC CCCGCCCGCG GTGGTGCACA TGTCCGTCGG CGCTCGATAC ACGCAGTATT GCGCGAACGT CGGTCGCACG TACATGGTTG ATCCGACGCC CGCGCAGGAG GCGACGTACG CCGCTATTCT CGCCGCGCAA GAGGCGGGTA TCGCCGCTCT CGTCGATGAT GCGACGTGTG CGTCCGTGTA CGAAGCTGTC AAGTCCTCTC TGACGAGTGC GGAAGGCGTC GACGGCGCGA CGTTAGCTTC AAAGTTGAAC AAAAATGTCG GCACCGCCAT GGGTCTCGAA TTCCGCGACA TGACTTTTGT GTTGAATGGC AAGTGCGAAA CCAAAATCAA GGCTGGTATG TTGTTTAATC TCGCCGTCGG TGTGCAAGGC TTGAAGGAGC CGAGCGCCAA GGAAGGTAGT AAGAATGAAA CGTACGCCGT GATGATCGCC GACTCTGTCT TGGTGGGCGC CGCGGGCGAG ACGCCGTCAG TGTTGACCAC GAACCCAAAG GGCGTCAAGG AGATCTCTTA CATCATGAAC GACGATGATG ACGACGACGA CGACGAAGAA GCCGAGGTCC AAATCAAACA AGGGGGCGTC ATCATGGATG CGAAGACTCG CGCCGAGCAA TCCGGTCCGA GCTCGGCGGA GGATCGCGAG CGTCGTCAGC GCGCGTTGGC GGACAAAAAG AATGCCGAAA CGTACAAACG ATTGACGCAA GCGGGCGAAG AAGAGATTCA AAACGCCACT ATGGGCTCAT CCGCAGAATT TGTCGCGTAC AAGTCCATGC GTGAAGTCCC GACGCCGAAG AACAAAGAGC TCGTTCTCGC CGTCGACCAA GAGCGCGAAA CCGTCCTCGT GCCGATTTAC GGTCAGCTCG TGCCTTTTCA CGTCATGTCG GTCAAGTCTG CCTCAGTGAG CCAAGATGCC GGTGCTGCGT TTATTCGTAT CAACTTTCAG CATCCCACCG GTTCAGGGGC GGTGGCGGTA CAAAAGTACG CGGCGGCGGC GCGATTTCCG AACTCCATCT TTTTGAAGGA GGTGAGCTTT CGCAGCACGG ACGCGCGTCA CGCCAACCAC GTGGTGCAAG AGATCCAAGC CTTGCGACGT AACATCGTGC AACGTGAAAC TGAGCGCGCG CAACGCGCCG ATTTGGTTCG CCAAGAGCGT CTCGTTCTCT CCTCTGGCCG CGTGCATCGC TTGACGGGTT TGTGGATGCT CCCGACGTTC GGCGGTCGCG GCGGTCGCAA GGCGGGCACG TTGGAGGCGC ACACGAATGG TATGCGATAT CTCGGCGCCA AAATGGACGA GCAGGTGGAC ATTATGTACG ATAACATCCG ATTCGCGTTC TTCCAACCGG CCAAGCAGGA GATTAAGACT TTGATTCATT TCCACTTGAA GAATCCAATC ATGATCGGCA AAAAGAAGAC GCAAGACGTG CAATTCTACC AAGAAGTCAT GGAGGCTGTG CAAAACTTAG ACGGCGGGCG TCGTAACATG TACGACCCGG ATGAAATCGA AGACGAACAA CGCGAGCGCG AGCGTCAAAA GAAAATCCAA AAGGAGTTTA GCCACTTTGC CAAGCGCGTG CAAGAAATTT GGGAAAAGGA TTTCCCGCAG TTGAATTTGG AGTTTGACTC GCCGTATCAC GAGCTCGCAT TCCAAGGGGT GGCGTACAAG TCCACGGTGC GCATTCTGCC CACGACGTCG TGCTTGGTCG AACTCACGGA GTTCCCGCCG CTCGTGCTCG CTTCTAGCGA TATCGAAGTC GTCAACTTGG AGCGCGTCGG TTTCCATTTG AAGAACTTTG ACATGGCGAT CATCTTCCGC GATTTCAACC GCGAAGTCCA TCGCATCGAT CAAATCCCGA GCCAATACTT GGAGAACATC AAGCAGTGGT TGACGACGCT CGATATCAAG TACTACGAAG GTAAAGCCAA CTTGAACTGG AAGCCGTTAC TTCGACAAAT CAAAGAAGAC CCCGACGGCT GGCTTGAAGC CGGCGGTTGG GAATTCTTAA ACAACGAAGC CTCCGACGGC GAAGACGAAG AAGACGAGGA AATGAGCGAG TTCGAACCGA GCGAAGACGA AGACGAAGAC GAGTCCGAAG AAGAGTCCGA ATCCGAAAGC GTGTACGATT CCGAGGAAGA CGACGAAGAG GAAGAATTGG ACGAGGACGA CGAGGAAGGT TTGTCTTGGG ACGAGCTCGA GGAAAAGGCC GCGAAAGAGG ATGCCGACGC CAGCGATTCC GACGAACGGC CTCGAAAGAA GAAGCGATAG
|
Protein sequence | MRCLYETWRA ERDGAFGGAS ALVVGTGANK EDDLRYLKAV ALEVWLFSYE LPDTLLMFTE RGMHVVAGGK KAALMENARE VLKEECGLDL AVHVKPKGED GAAQAAAVVE AIKSENLVVG MVMKEKNEGA MMQYVTKALG EAGMEIKDVT SGVSLAMAAK DEKELGFVNK AVTLTSKALG FAVKEMEATI EDEKKLTHAK LSEMTEDAII DPSRLGLKFP PEDVDICYPP IFQSGGEYDL KYSAESANTK LHYASPPAVV HMSVGARYTQ YCANVGRTYM VDPTPAQEAT YAAILAAQEA GIAALVDDAT CASVYEAVKS SLTSAEGVDG ATLASKLNKN VGTAMGLEFR DMTFVLNGKC ETKIKAGMLF NLAVGVQGLK EPSAKEGSKN ETYAVMIADS VLVGAAGETP SVLTTNPKGV KEISYIMNDD DDDDDDEEAE VQIKQGGVIM DAKTRAEQSG PSSAEDRERR QRALADKKNA ETYKRLTQAG EEEIQNATMG SSAEFVAYKS MREVPTPKNK ELVLAVDQER ETVLVPIYGQ LVPFHVMSVK SASVSQDAGA AFIRINFQHP TGSGAVAVQK YAAAARFPNS IFLKEVSFRS TDARHANHVV QEIQALRRNI VQRETERAQR ADLVRQERLV LSSGRVHRLT GLWMLPTFGG RGGRKAGTLE AHTNGMRYLG AKMDEQVDIM YDNIRFAFFQ PAKQEIKTLI HFHLKNPIMI GKKKTQDVQF YQEVMEAVQN LDGGRRNMYD PDEIEDEQRE RERQKKIQKE FSHFAKRVQE IWEKDFPQLN LEFDSPYHEL AFQGVAYKST VRILPTTSCL VELTEFPPLV LASSDIEVVN LERVGFHLKN FDMAIIFRDF NREVHRIDQI PSQYLENIKQ WLTTLDIKYY EGKANLNWKP LLRQIKEDPD GWLEAGGWEF LNNEASDGED EEDEEMSEFE PSEDEDEDES EEESESESVY DSEEDDEEEE LDEDDEEGLS WDELEEKAAK EDADASDSDE RPRKKKR
|
| |