Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25826 |
Symbol | PAFD3501 |
ID | 5006528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | - |
Start bp | 63860 |
End bp | 65666 |
Gene Length | 1807 bp |
Protein Length | 576 aa |
Translation table | |
GC content | 61% |
IMG OID | 640421949 |
Product | predicted protein |
Protein accession | XP_001422642 |
Protein GI | 145356861 |
COG category | [K] Transcription |
COG ID | [COG5296] Transcription factor involved in TATA site selection and in elongation by RNA polymerase II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGGGATTCCG GCGCGCGACG TTCCGGCGTT CCGCGTCACG ACGCGTCACG ACGCGACCTT CGACGCCCTC CTCGCGATGG CGTCCGACGA CGAAGCGCGC GATGACGACG ATTTCGACGT CCTCGCCGCG GCCGGGAGCT CCGACGACGA CGCCGACGCC GACGATTTCG ACGATGGGTA CGGATCAGAT TTGATGGGCG ACGCCGCCGA TCGCGCGCGC CTCGCGTCGT TGAACGAACT CGAGCGGGAA GAGATCATGC TCGAACGCGC GGACGCGCGA AAGAGGCTGG AATCGCGACG CGCGATCATC GCGGCGGCGC GATCGAAGGA ATTGGAGCTG GCTCGGAAGA AGACGGGGGC GCGGGCGATT TCGAGTAAAC AGCGGGACGA GTTAGAGGCG CTCGAACGCA TCGCGGAGGC GAAGAAACGA AAAGAACGCA GTCGACGCGT GTTCGGTGAT CTGAGCGAGG ACGAGGAAGA ATTTGGCGGG GCAGCCTCCG AATCGGAAGA CAAAGAAAGG GTCGCGAGAC CTCGGCGGAA GAAACGCGAA ACCCGAGCGG ACGTCGATCT TGAGGCTGAA TACGCGTCGT CCGACGTTCC GGCGACGCGC GATGAGATCG CGTCGGTGAC GGTGAAGAGA CATCAGCTCG AGCAGTGGGT GAACGAACCG TACTTTGAAG CCGCGGTGAC GAATTGTTTG ACGCGCGTCG GCATAGGACT CAACAAGGAA GGGCAAAACG TCTATCGCCT CGTCGAGATC GCGGGCGTCG CGGACGGCAA GTACAAGCAA TACTCGCTCA AAAAGTACGA GTACTTGGCC AACAAACCGC CGACGCAAAA GTGGTTGATT TTGCGCTGGG GAAAGTCTGA AAAGACGTTT CGACTGAGCG AGGTTTCAAA CTCCGACGTG CAAGACGCCG AGTGGAAAGC CTGGGTGGCG CACTGCGCGC AGGCGGGGTG CACCCCCATC ACCGCGCGCG ATGTGAAGAA GTGTCTTCAA GGGTTGGAAG AGGCGAAAAA TTACCGATAC ACGAGCGACG ACGTGACGAA GATTCTCGCG GAGAAGCGCG AAAAGCAAGG GACGCGTCAC AACTTATTGT TTGAAAAGGA GCAAATCAAG GCGGCGATCG CGCACGCCGA GGCGGAAAAC GATTTAGATA AAGTTGAAGA GCTTCGTCAA CGTTTCGACG AGGTCGATCG AGAGATAAAG GACAAATTGC AGCAGAGAAG CGGGTCGACG CAGGACGCCC TCGCGAACAT CAACCGCCGC AACGAAATCC AAAACAGCGA AAAGCTCTCG AAGCGAGCGT CCGAGCAAGT GGCGAAACTC AAGGCGGGCG TGTTGAACAC GGGCTCAGGC GATCCATTCA GTCGTCGCCC GACGCGTTTG ACGACGTATT GGGACATGGG CGCGGGCGCC GCCGAGCGAT CGGCGACGGC GGCGGAAGAA GAAGAAGCCG CGGCGGCGAC CGCCGCGGCG CTGGCGACCG ATGGTGGAGA CGAAGAAGAC GACATCTTTC TCACCGCGGG TGTGGACGCG CAGCAAAGCG AAAAAGAACT CGTAGACGTG CTTCAGCAGG CGCACCGCGA AACCGTCGGC TCGCTCAACA TCGATCTCAG TCGACTCGAC GCGCCCGCGG ACGCGGACGC CCTCAGGCTC AAGACGACGA AGAGTGTGCT CGATCGCGGT GCCGTACTTT TGAAATCCGC CTACGACGCT CAACGCGCCG AGCAGCTCGC GACGCAACCG CCCGGTAAGG CTTTCACCGT CGCCGAATAC TTGGCCGAAT TCGTGGCGCC GACGTGA
|
Protein sequence | MASDDEARDD DDFDVLAAAG SSDDDADADD FDDGYGSDLM GDAADRARLA SLNELEREEI MLERADARKR LESRRAIIAA ARSKELELAR KKTGARAISS KQRDELEALE RIAEAKKRKE RSRRVFGDLS EDEEEFGGAA SESEDKERVA RPRRKKRETR ADVDLEAEYA SSDVPATRDE IASVTVKRHQ LEQWVNEPYF EAAVTNCLTR VGIGLNKEGQ NVYRLVEIAG VADGKYKQYS LKKYEYLANK PPTQKWLILR WGKSEKTFRL SEVSNSDVQD AEWKAWVAHC AQAGCTPITA RDVKKCLQGL EEAKNYRYTS DDVTKILAEK REKQGTRHNL LFEKEQIKAA IAHAEAENDL DKVEELRQRF DEVDREIKDK LQQRSGSTQD ALANINRRNE IQNSEKLSKR ASEQVAKLKA GVLNTGSGDP FSRRPTRLTT YWDMGAGAAE RSATAAEEEE AAAATAAALA TDGGDEEDDI FLTAGVDAQQ SEKELVDVLQ QAHRETVGSL NIDLSRLDAP ADADALRLKT TKSVLDRGAV LLKSAYDAQR AEQLATQPPG KAFTVAEYLA EFVAPT
|
| |