Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33057 |
Symbol | HAG3501 |
ID | 5003090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 321091 |
End bp | 322599 |
Gene Length | 1509 bp |
Protein Length | 447 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418511 |
Product | predicted protein |
Protein accession | XP_001419344 |
Protein GI | 145349859 |
COG category | [B] Chromatin structure and dynamics [K] Transcription |
COG ID | [COG5076] Transcription factor involved in chromatin remodeling, contains bromodomain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCCGCGCCCG CGCGCGTCGA ACGCGCCCGC GCGCTCGAGC GATCGGAAGA AATCTTTTCG AGCGCACGGC GCGCAGGAGA TCGCGCGGAC GCGCTCGCGG GTGGCGTGCG AGCGCGTTGA ATTAGCTCGT CGGGACATCG CGCGGAACGG CGATCGGCGA GCGCGATGCA GCGCGGCACG CGCGCGCGTG AGGACGCGAG CGATCAGGAT GATGTCTCGG GAAGCGCGCG CGGCGTGAAG CACGCGCGCG CGGATGAGAG CGGCGCGAAC GGGTCGAGGG ACGGCGCGGG GGCGGCGGAC GGAACGCCGA ATAAACAAAC GACGACGACG GGCGCGACTG CGGGCGCGAC GGGGGCGACG TCGCCGGTGT CGCCGTCGAG AGGGGCGTAC GCCACGCGAG AATCGCACCT GCGTAAGCAA GAGCGTGATG GGGAGCTGAA GTGGGAGGTG ATTAAAAACG ACGGGAGCGA GGCGAATTCG CGCTTGCTCG TGGCGCTGAA GAACATTTTT AGCAAACAGC TGCCGAATAT GCCCAAGGAG TACATCGTGC GGTTGGTTTT TGACTCCAGA CATTACTCGA TGCTGTGCAT GAAGAATGGG AACGTCATCG GGGGGATCAC GTACAGGCCG TTTCCGAAGC AACGCATGGG TGAGATTGCT TTTTGCGCGG TGAGCGCAAA TGAACAGGTG AAGGGATACG GTACGCGGTT GATGAATCAC ATCAAGGAGT ACGCTAAGGA AAAGGAAAAT ATGACGCACC TGATCACGTT CGCGGATAAC AATGCGGTGG GGTACTTTCA AAAGCAAGGA TTCACAAAGG AAATCATGAT GGAGCGCGAA AAGTGGTACG GGTACATCAA AGAGTACGAC GGTGGGACGA TTATGGAGTG TCAATTAGAT GGGCACGTCT CCTACGTGGA CTTTGTGAAT CAGATCCGAG AACAGCGCAA GGCGGTGGAG GCCAAGGTAC GAGAGATGAG CACGGCGCAC AAAGTCTACC CGGGATTGAA GGACCACTTC AAGCCGAGCG CCGAGGGCAA GTATATCCCC ATCGACGTCA AACACATCAA AGGCTTGAAA GAGGCGAAGT GGGAGGACCC AGGACTACCA AAGTACCGTT TAGTCCACCC AGGATGCGGC GATGGCATAC CCACGAAGGC AAACTTGCAT AAATTCATGA GAGCCATCGT AAACGTGATT CAGGCGCACT CCGATGCGTG GCCGTTTGCC GCACCCGTGA ACCCGCTCGA AGTCACAGAC TACTACGACG TCGTCAAGGA TCCCGTCGAT ATGGAACTCA TCCAAGAACG CGTCTCGGCG GGGAATTACT ACGTCTCTTT GGAGATGTTT TGCGCCGACT TCCGTTTGAT GTTCAACAAC TGTCGCATAT ACAACTCACG CGACACTCCG TATTTCAAAG CGGCCAATCG TCTCGAGGCG TTCTTCGAAT CGAAAATCGC CGCCGGGGTG AATTGGAAAA TTCGCGAGGC ACCTTCTCGA GGTCGATGA
|
Protein sequence | MQRGTRARED ASDQDDVSGS ARGVKHARAD ESGANGSRDG AGAADGTPNK QTTTTGATAG ATGATSPVSP SRGAYATRES HLRKQERDGE LKWEVIKNDG SEANSRLLVA LKNIFSKQLP NMPKEYIVRL VFDSRHYSML CMKNGNVIGG ITYRPFPKQR MGEIAFCAVS ANEQVKGYGT RLMNHIKEYA KEKENMTHLI TFADNNAVGY FQKQGFTKEI MMEREKWYGY IKEYDGGTIM ECQLDGHVSY VDFVNQIREQ RKAVEAKVRE MSTAHKVYPG LKDHFKPSAE GKYIPIDVKH IKGLKEAKWE DPGLPKYRLV HPGCGDGIPT KANLHKFMRA IVNVIQAHSD AWPFAAPVNP LEVTDYYDVV KDPVDMELIQ ERVSAGNYYV SLEMFCADFR LMFNNCRIYN SRDTPYFKAA NRLEAFFESK IAAGVNWKIR EAPSRGR
|
| |