Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15581 |
Symbol | GTE3502 |
ID | 5002026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 769870 |
End bp | 772449 |
Gene Length | 2580 bp |
Protein Length | 859 aa |
Translation table | |
GC content | 58% |
IMG OID | 640417447 |
Product | predicted protein |
Protein accession | XP_001418103 |
Protein GI | 145347283 |
COG category | [B] Chromatin structure and dynamics [K] Transcription |
COG ID | [COG5076] Transcription factor involved in chromatin remodeling, contains bromodomain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.251833 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGG TGGCGGCGGC GGCGACGGCG CCAGCGGGGG CGGCGGATGG GGAAAAACCA GACGCGCGCG CGACGGAGGC GGGAGATGGG GCGATGACGG ACGCGCCGAA TACGCCCGCG GAGGATGGGG CGATCACGGA GGCGGCGGCG GCCGCGGGTG CGAACGCGGC GGCGGCGACG TGGCCGGCGA ACGCGGTGTC GGCGGCCCGC GGGGAGATCT TGTTGACGCT GAATCAGTCG ATGAAGGCGC CGAAAGGGTA CGCGTACATG CCGGCGGGTG ATTACGCGGC GTTTGACTCG AATTGGTTGG AGAGTAAGAA ACGCGTGCGT AGACCGTCGG CCAAGGTAGA TGACGGGATG TTCGCGTCGG GCGACGACGA GGGCGGCGGG ATGTTTGCGG CGCCGGGCGA CGATTCGGCA AAGCTTCCAG AAAAACCGGA CGAAGAGTGT ACGGAAGAGG AGCTTGCTTT GCGGAAGCTG CAACGCAAAG AGCAAAAGCG TCGTAAGAAG GAAGAGTTGG CGAAGCGTCT CGCGGCCATC GAGGAGGCCA AGACCAAGGT GGCGGAAATC GAGGCGAAGA AGCAGTTACA ATTCGCCAGG TTACCTCCGC CGCGTCCGGC GTCGGCGCGA CCGATCAAGG CGACGCAACC GTACGATGGT TACGGCGTCT CGTCCTACGG GTCGAAGAAG AAGGGCTCCG CGGCAAATCT TGCGGGCGTC AAGAAGTTAA GGGATGGCCG CGCTGTCACG ACGCAGCGTT TGCAAAAGGA GTACGAAGTT GAGCAGGCGC GCGACGCTCG ACGCAAGGAA ATGATAAGAC GTTGCCGTGA AGTTTTAATA GCGTCGAAGA AGCACAAGTA TCACAAGATT TTCCTTGTGC CGGTCGACCC GAAAAAGCAT GGGGTGCCAG ACTACTTTGA CATCATCAAG AACCCGATGG ATATGGGCAC GGTGAAGACG AAGCTCGACA CCAAGGCATA TCTCAACCCC GCTGAGTTTT GCGCCGACAT GCGCTTAATC TTCTCCAACG GCCTTCTGTA CAACGGCACG GCGTCAGATG CTGGCGTCAT GACTGAAACC GTCCGCCAGT TATTCGAAAC CGCGTGGTTG AATAGCGATC TCGAAGACTA CGTCTCAATC GAAAACGAAA TTCGAGAGCA CGAGGACATT GAAATTCGCA ACACGCCCGC CACGCCCATC AGTCTTGAGG TGGCTGTCCT CGAAGACGCT AGAGCGGAAC TCGAAAAGGT CAGACGCGAG ATTGAGGAAC TGAAGCGCGC CAAGCAAGAA GTTATATACA AGGAAGACGA CGAAGATGAT TGGGACGACG AACCGATGCC GCGACCAAAG TCTCGGTCTC GCGGGGCGGT CAAAAGACAA AGAGACCCAG ATGAAGATCC CGACTTCGAC TTCGATGACG CGCGCATGGA TGAAGACACT TACAAGGAGT ACAAGGAGGA AGGCCGGATT CGAAAGGAAA TCAAGCGAGA GCGTGGATTC AGGAGCGAAG GAGGCTACTC GCGTCCGCGA CCGGCGCCGA CGCCCTCGCG CGATATGACT TTTGATGAAA AGTCTGAGCT CACAATGTTG CTTGGAGAAC TTCCCGAAGA CAAGCAGGAT CGAGTCGTAC AAATTGTTAG TGAAGCGAAA CAAGCTTTGG GCCAAGGGGA GGAAGATGAG ATCGAAATCA ACATCGAAGA GCTTCCCGCG GCCACGCTCT GGAAACTTCA CAAATACGTC AATGGCGTGC TTCGTCCGAA GAAGCGTAAG CTCAACGCCG CTGAGCAGCT TCTTGAGGCG AAAATGCGTG AAGCGCAAGC CGCGCGCGAG CTTGCGGCTG TCGAGCAAAC ATTGCAGCAG GTTCAAGAGA CCGGAGGAAG CTACCAAGAC GTCGTTCATC TCAACAGCGC CGGGCCGTCG GCTCAAAAGC CAGAAAAGAA ACCGGTCGAC GACGACTCCG ATACCGACAC CGATTCCGGA GATTCGGATT CAGACACTTC GAGCATGGAT AGCGCGGAAG GTGCTGGCAG TAATCCGGAG AAGCGCGCCG CCGTCACCGG TGCCGCCGCG CCCGTCGACC AGTCGTCTAA CCAGTTCGCC AAGGAGGCTC CGGGCATCAA ACAAAACGTC TCCAAGGCGG CGGTGAACGT CCAAAATCCG TCAGGTTGGG AAAATCTGGC GAGTTCTGAC GCGCCAGTGA ACGCGAGCGC GCAAGCGGCG ACGCAGGATG CCATTCCGGA TGACTTGTGG AATGAGTTCG AAGCCGCGGC TCAACAAAAG CAACAACTCG ACGAATCCAG AAAAGAGGAC GAGGCGCAGG CGCAAGCCGA GCGCGAGCGA CTGGAAGCGG AAAAGAAAGC AGCTGAAGAG GCGAAAAAAG TTGCCGAAGA AGCCGCAAAG CAAGCCGAGG CCAAGGCGCG AGAAGAGGCG CGCGCCAAGG CGCGAGAGGC GCTCGAAAAC CAAGAGCAAA CCATCGATCT CGAGGAACAA CGCATCGCGA TGAAGCAGTT CGGAGGCGAC GGTATGGGCG GAATGATCGG ATCCATCGAT CCGAGCAAGC TCAAGCAAGG CTCGGAGTGA
|
Protein sequence | MTAVAAAATA PAGAADGEKP DARATEAGDG AMTDAPNTPA EDGAITEAAA AAGANAAAAT WPANAVSAAR GEILLTLNQS MKAPKGYAYM PAGDYAAFDS NWLESKKRVR RPSAKVDDGM FASGDDEGGG MFAAPGDDSA KLPEKPDEEC TEEELALRKL QRKEQKRRKK EELAKRLAAI EEAKTKVAEI EAKKQLQFAR LPPPRPASAR PIKATQPYDG YGVSSYGSKK KGSAANLAGV KKLRDGRAVT TQRLQKEYEV EQARDARRKE MIRRCREVLI ASKKHKYHKI FLVPVDPKKH GVPDYFDIIK NPMDMGTVKT KLDTKAYLNP AEFCADMRLI FSNGLLYNGT ASDAGVMTET VRQLFETAWL NSDLEDYVSI ENEIREHEDI EIRNTPATPI SLEVAVLEDA RAELEKVRRE IEELKRAKQE VIYKEDDEDD WDDEPMPRPK SRSRGAVKRQ RDPDEDPDFD FDDARMDEDT YKEYKEEGRI RKEIKRERGF RSEGGYSRPR PAPTPSRDMT FDEKSELTML LGELPEDKQD RVVQIVSEAK QALGQGEEDE IEINIEELPA ATLWKLHKYV NGVLRPKKRK LNAAEQLLEA KMREAQAARE LAAVEQTLQQ VQETGGSYQD VVHLNSAGPS AQKPEKKPVD DDSDTDTDSG DSDSDTSSMD SAEGAGSNPE KRAAVTGAAA PVDQSSNQFA KEAPGIKQNV SKAAVNVQNP SGWENLASSD APVNASAQAA TQDAIPDDLW NEFEAAAQQK QQLDESRKED EAQAQAERER LEAEKKAAEE AKKVAEEAAK QAEAKAREEA RAKAREALEN QEQTIDLEEQ RIAMKQFGGD GMGGMIGSID PSKLKQGSE
|
| |