Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32993 |
Symbol | HMGB3507 |
ID | 5003401 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 216219 |
End bp | 217291 |
Gene Length | 1073 bp |
Protein Length | 306 aa |
Translation table | |
GC content | 59% |
IMG OID | 640418822 |
Product | predicted protein |
Protein accession | XP_001419099 |
Protein GI | 145349350 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5648] Chromatin-associated proteins containing the HMG domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.161398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.121587 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGACT CGAAGGTGCG ACGACGACGA CGACGCGAAC GGGTGATTTC GATTTCGGGC GCGCGCGCCA ACGGCTTTTT AAACCGCGCG GCGAATCGGA CGAATTGGAC GCGGGAGGGG GCGATCGCGG GCGCGAGACT GACGAAAACG CGGCGAACGC GCGATAGAAT GCGTTGCGGT TGACGGATGA GAGCTCGCCG GAGCCGCCGA AACGGGCCAT GACGGCGTAT TTGATCTTTT GCAGCAAGCA TCGCGAGCGA GTGATGCGGG AGGTGCACGG AGACGACGGC GCGCGGAAGT TTTCGCGAGA CGAAATGCAG TTAGTGACGA CGCGGTTGGC GGAGATGTGG AATAACATTT CGGAGAAGGA GAAGAAGGAG GTGCAGGCGA AGGCGGCCGC GGCGAAGGCG GAGTATGAGA TGCAGAAGGC TGCGTTTTCG CCGGCGTTGC TGAAGAAGTT GCATCGGTTG AAGAGCAAAC CGAAGGGAAC GGTCGTCGTG GAGGCGCAGG GTGAAAAGCC CGTGCGCGCC AAAACGGCGT ATTTGATCTT CTGTGGTAAG CATCGCGCGG CGGTGATGCG GAAGATTCAT CCGGAACCAG AGGCCAAGTT TACGCGCGCT GAGATGCAGC AAGTCACCAC GGAGTTGGCT GCGTTGTGGA ATAACATCTC CCCGCAGGAG CTCGCCGAAT GCAAGGCGGC GGCGGCGAAA GAGCTCGAGC GATACAAGCA GTTGAAAGCA GAATACCGCC CGCCGGTGTA CGGGCCATCC AAGCGGAACA AAGGCAAGAG CGTGCCGGGC AAGCCCAAGC GCGCTCCCAC CGCGTACCTC ATCTTTGCCG AAGAGTTGCG CGCCAGAATC AGACAGGAGC GACCGCATTT GAAGCACGAC GAAATCTCTC AAAAACTGTC TACGGCCTGG AAGGAGATCG ACGAAGCCTC CAAGAGAATC TTCCAGCAAA AGGCGGACGC AATCAAGGCG GATCTCATGC AAAACATGCC GAGTTCTGTG ATGCTCACGG GCCTGGAACA CTCCTTACCG GAGCCACACT ACAATACGCA CATGTACCCC TAA
|
Protein sequence | MVDSKNALRL TDESSPEPPK RAMTAYLIFC SKHRERVMRE VHGDDGARKF SRDEMQLVTT RLAEMWNNIS EKEKKEVQAK AAAAKAEYEM QKAAFSPALL KKLHRLKSKP KGTVVVEAQG EKPVRAKTAY LIFCGKHRAA VMRKIHPEPE AKFTRAEMQQ VTTELAALWN NISPQELAEC KAAAAKELER YKQLKAEYRP PVYGPSKRNK GKSVPGKPKR APTAYLIFAE ELRARIRQER PHLKHDEISQ KLSTAWKEID EASKRIFQQK ADAIKADLMQ NMPSSVMLTG LEHSLPEPHY NTHMYP
|
| |