Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25566 |
Symbol | HMGB3503 |
ID | 5005246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | - |
Start bp | 672980 |
End bp | 674566 |
Gene Length | 1587 bp |
Protein Length | 338 aa |
Translation table | |
GC content | 63% |
IMG OID | 640420667 |
Product | predicted protein |
Protein accession | XP_001421528 |
Protein GI | 145354514 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5648] Chromatin-associated proteins containing the HMG domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGAGGGGC GTTAACTACG GTGCGTTCGC GACGACGCTC GAGACGCGCT TCGGCGTCGC GGGCGACGCG GGGCGTCGCG CGCGACGACG ACGACGACGC GAATCGCGAC GACGATCGGC GACGACGCGA ATCGCGACGC CGACGGGCGC GAATCGCGAC GACGACGGGG CGACGACGAC GACGACGACG ACGCGCGGGA CGACGGCGCG CGACGACGAC GACGAGGGCG ACGAGGACGA GGACGACGAC GGTGGACGCG CGGACGCGCG ACGACGGGCG TCGCGGCGTC GCGCGGCGTC GCGCGCGAGA GAGGATCGGT GCGGGCGCCA ACGGACGAAT GTCGGTTTGA AACGGTTTTT TGAAGCGCGA GGGACTGACG AAGACGCGCG CGACGCGCGC AGGGGAACGC AGACGCGCGC GAACGAACGC GAACGATGGT GACGGTCGGG ATGCAGACGT TGGAAAAGAA ACCGCGAAAG AAACGTCGGG ACAAGGACAA GCCGAAGCGG GCGATGTCGG CGTATTTGGT GTTTTTGAAC AAGCACAGAG ACGCGACGCA AAAGAAGAAT CCGTCGTGGA GCGTCACGGA CGTGACCAAG GAGCTCGCGG GGAAGTGGAA GACGGTGACG CAGAGCGAGC GAGATGAGTG TCAACGCGTT TCCGACGAGG ATAAGGCGCG ATACTACCGC GAGATGCAGA ACTACGTGCC GCTTCCGGAT GAGAAGGAGG AGCCGCCGTT GCGATTCGAC AAGGATGGTA ACCGCAAGCG TCGCAAAAAG GACAAGAACG CCCCGCGCAA GAACCGTTCG AGTTACATCA TTTGGGCGCA AGAATATCGC GACAAGACGT TCAGACCGAA GGCGAACACG CCCGACGCCG TGTCTTTCCG CGATCAAGCC GCGCAGCTCG GCGCGGCGTG GAAGGCTTGC ACGCCGCAGC AACGCAAAAA GTACGACGAC ATGGCGGAGA AGGAAGCGCA GGCGTACGCC ATCAAGCGTG ATGCGTACAA AGCTGAACAA AAGGCCATCG CCCTCGCCGC TCGTGAAGCC AAGCGTCAGC GTTTGCTCGA CGAGAAGCGC GCGTGGGAAG CCACCAAGAT CGCCGCCGCG CAACTCAAGG AGGACAAGAA GAAGGAAAAG AGCGCTCAAC TGTCGCCGGG CAAGGCGCGA CCGTTCGGTG GTCCGTCCTC GAAGGCGGTT CCGAAGAAGG TTGTGCACAA GGAGGCGGAC AACGTCACCA TGGCTAAGAT GCGCAGCGCC GTGCTCGCCG TCGCTCCGAG CGATAACGCG TGGCGCGCCG TCCAGGACGT CTACGGCAGC GCTCCGGACA AAGTCATGGA GGCGTTCAGA AACTTCGTCG ATAACCGTAA GATGACGAAC TTCGTCCCGG ATCACCGAGC TTTCATGAAC GCTGTTCTGG GCTACGACAT GGCGGCGACG TTCATGCGCT AGTCGCGCGA ACGCCAGCTC GCTCGTCACT CGTCCACGCA CTTCTTCATC TCGCGAAAAC CGCGATCACG GAAGACGATG GAACGAAACT CCACGGAAAC TCTCGACGAC CGACTCGACT CGCTTTTAAT AACACAA
|
Protein sequence | MVTVGMQTLE KKPRKKRRDK DKPKRAMSAY LVFLNKHRDA TQKKNPSWSV TDVTKELAGK WKTVTQSERD ECQRVSDEDK ARYYREMQNY VPLPDEKEEP PLRFDKDGNR KRRKKDKNAP RKNRSSYIIW AQEYRDKTFR PKANTPDAVS FRDQAAQLGA AWKACTPQQR KKYDDMAEKE AQAYAIKRDA YKAEQKAIAL AAREAKRQRL LDEKRAWEAT KIAAAQLKED KKKEKSAQLS PGKARPFGGP SSKAVPKKVV HKEADNVTMA KMRSAVLAVA PSDNAWRAVQ DVYGSAPDKV MEAFRNFVDN RKMTNFVPDH RAFMNAVLGY DMAATFMR
|
| |