Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18991 |
Symbol | HMGB3501 |
ID | 5006737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | - |
Start bp | 186780 |
End bp | 188648 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | |
GC content | 62% |
IMG OID | 640422158 |
Product | predicted protein |
Protein accession | XP_001422680 |
Protein GI | 145356938 |
COG category | [B] Chromatin structure and dynamics [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG5165] Nucleosome-binding factor SPN, POB3 subunit [COG5648] Chromatin-associated proteins containing the HMG domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0475639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.137229 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGG AGTTTACCGT CGCGGCGCGC GCGCTCGGAC GCGGGAGCGG GACGCAGGGA GAGTTGACGC TGAGCAAGGC GATCGGCGCG CGGTTTCGAG CGGCGACGGG CGCGGGCGAG CAGAAGAAGA CGGAGATCGA GGCGGGGAAG GTGCGCGAGG TGCGATGGAG CGACGCGCCG ACGGGTGGGG TGCTGCGAGT GCGATCGACG GACGGACGGA CGCTCGTGCT GGGAGGGATG GGGACGGAGG ACGCGAAGAA CGCGGCGGAG TACGCGGCGC GCGAGCTGGG GTGCGCGAGC GGCGAGACGA AGATGAACGT GAACGGACGG AACTGGGGCG ACGTCGCGAT CGAGGGGAGC GGGACGGTGT TTGAGGTCGG TGGGAAGACG GCGTTTGAGA TCGATGGGCA GTACATCTCC GAGGCGACGG TCGTCGGGAA GAGCGACGTC GTCTTGCAAT TTCACCACGA CGACACGGCG GCGGAGAAGG ATTCTTTGGT GGAGATGTCG TTTTACGTTC CGCCGGGCAG CGAGACTTGG AAAGGAGACG ATATGGAAGA TCCCGATGAC ACCGCGGCGA AGCGTTTGCA CGCGGCGATC ATGTCGATCG CCGCCGCCGA CGCCGAGGCG GGCGAACCCG TGGCGGAATT CGACGGCGTC TCCATGGTCG TCCCGAGAGG CAAGGTTTCG ATCGAGTTGC ACAACACGCA CATGCGCATG CAATCCTCGA CGCTGGACTT CAAGGTTCAG TACAGCTCCA TCGTTCGCGT GTACTTGCTT CCGAAGCCGC ACTCGAACCA GTCGCACGCG GTCATCGCGC TCGACCCACC GATTCGCAAG GGACAAACGT TTTACCCGCA CATTCTCGCC ATGTTCAACG ACGACGATCA TCTCACGGTG GAACCAAACT TGGCTGCAGA CATGAAAGAC AAGTTCCCCA CCCTCGAGTC GACTTACGAC GGCTCCTCGG GAGAAGTCTT CGTGCGCGTA TTGAAAAACA TGGCTGGCGT CAAGTTGACG AGACAAAGCT TATTCACCGC CTCGGCGGGC GGACACGCCA TCCGCGTGTC GCACAAGGCT GACGTCGGCC TACTGTACCC GCTTGAAAAA GCGTTCTTCT ACTTGCCCAA ACCTCCGCTG TTGTTGCACT ACAGCGAAGT CGATGAGGTT GAGTTTGAGC GTCACAGCGC GCAGGGCGCG ACGAGCGGCA GGACGTTTGA CGTCTCTGTG AATATGAAGA ATGGCTCATC GTACGATTTC CACGGCATTC AACGGTCGGA GTTCCAGAAC TTGGTTAACT TCTTGACCGC CAAACAAGTG AGAATCTCCA ACGTCGACGC CAACGCGCGC GCGGACCAGC TCATCGCCGA AGCTTCGGAC GACGACGACG AAGGCTACGC GCGACGCGGC GACGACGACA GCGAAGAGGA CGAAGACTTC GCCGCGGGAA GCGAGTCCGA CGGCGGCGAG CCCACCGACA GCGATTCCGA CAGCGAAAGC GACGAGGGCG CGAAGAAGTC GAAGAAGTCG CCCAAAGCCA AACGCGCGAA GAAGGATCCG AACGCTCCGA AACGAGGCTT GTCCGCGTAC ATGTTCTTCT CCGCCGCCAA GCGCGCCGAA ATCACCGCCG CTAACCCTTC GTTCGGCGTC ACCGACGTCG CCAAGGCGCT CGGCGAAAAG TGGAAGACGA TCACCGACGA AGAAAAGAGC GTGTACCAGC AACAAGCCGA CGAGGACAAG ATTCGATACG AGCGCGAGAT GGAAGCCTAC CGCGCCGGCG GTTCGCAGCC CAAGGTGGAG ATCAAGGACG ACGACGACTC CGACGCCGAC GACGCCGCGC GCGACGTCGA CGCCATGGAC GAAGATTAG
|
Protein sequence | MSAEFTVAAR ALGRGSGTQG ELTLSKAIGA RFRAATGAGE QKKTEIEAGK VREVRWSDAP TGGVLRVRST DGRTLVLGGM GTEDAKNAAE YAARELGCAS GETKMNVNGR NWGDVAIEGS GTVFEVGGKT AFEIDGQYIS EATVVGKSDV VLQFHHDDTA AEKDSLVEMS FYVPPGSETW KGDDMEDPDD TAAKRLHAAI MSIAAADAEA GEPVAEFDGV SMVVPRGKVS IELHNTHMRM QSSTLDFKVQ YSSIVRVYLL PKPHSNQSHA VIALDPPIRK GQTFYPHILA MFNDDDHLTV EPNLAADMKD KFPTLESTYD GSSGEVFVRV LKNMAGVKLT RQSLFTASAG GHAIRVSHKA DVGLLYPLEK AFFYLPKPPL LLHYSEVDEV EFERHSAQGA TSGRTFDVSV NMKNGSSYDF HGIQRSEFQN LVNFLTAKQV RISNVDANAR ADQLIAEASD DDDEGYARRG DDDSEEDEDF AAGSESDGGE PTDSDSDSES DEGAKKSKKS PKAKRAKKDP NAPKRGLSAY MFFSAAKRAE ITAANPSFGV TDVAKALGEK WKTITDEEKS VYQQQADEDK IRYEREMEAY RAGGSQPKVE IKDDDDSDAD DAARDVDAMD ED
|
| |