Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_41510 |
Symbol | |
ID | 5005078 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 93162 |
End bp | 94446 |
Gene Length | 1285 bp |
Protein Length | 383 aa |
Translation table | |
GC content | 61% |
IMG OID | 640420499 |
Product | predicted protein |
Protein accession | XP_001421048 |
Protein GI | 145353498 |
COG category | [K] Transcription |
COG ID | [COG5095] Transcription initiation factor TFIID, subunit TAF6 (also component of histone acetyltransferase SAGA) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.0135123 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0116799 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGC CGAGCCGCGT GCACGTCGAT TCCGTGCGCG CGATCGCGGC GACGATCGGC GCGCCGCCCG TGGACGCCGA CGCCGCGCGC GCGCTGGCGA GCGATTGCGA GTACCGGCTG CGACAGGTGT TTCAAGACGC GATGAAATGC ATGCGGGCGT CGAAACGAAC GACGCTCTCG GCCGAGGTGC GCGCGAAGAC GCGCGCGAAG ACGCGATGCG AACGACGACG ATGACGAGAC GCGCGCGCGA TCGACGCGGG ATGGACCGAT GGACGACGGA CTGACGAATC GGACGACGAC GACGTCGATC GAACGCGCGC AGGACGTCAA CGCGGCGCTG CGATTGAGAA ATTGCGAACC GCTGTATGGA TTCGGGGCGG GAACGAGCGA TTATGAATAT AAACAGACGC GCGAGGATCC GGATGTGTTT TACGTGGAGG ACCGAGAGAT CGACATGCGG GAATTGCTGA CGAGGAAGTT GCCGAGACCG CCGATCGAGG TGAACTTGGT GCCGCACTGG TTGGCGGTGG AGGGAGTGCA GCCGATGATT CCAGAGAACC CGATGGTGCC CGCGGCGGAA CCGGTGGCGA TCGAACCTCC GCGTGGGATG AAACGGCCGC GGCCGCGAGC GATGGGGGCG AAAGAGAATG GTGGGGATCC GGACGCGGGA GGATTGTTAC CCGTGGTGTC GCACACGCTG AGCCGGGAAT TGCAGTTTTA TTTCGACAAG GTCACGGCGC TGATTCGACA GGCTGGACGC GCCGATGCGA GCGACCGTGA AGTTGAATTG CTCTCCACCG CGCTGCGATC GCTGAGCGCG GACGTCGGTC TGCACAATTT GATGCCTTAC TTTACGCAGT TCATCACCGA GGAGACGACG CAAAACCTGA GGGATTTGCC GCGGTTGCGA GTGTTGATTC AGATGATTCG GGCGTTAATT TCCAACCCCG ACATAAACGT GGAGCTGTAT TTACACCAGC TCATGCCGAG CGTGGTGACG TGCGTGGTGG CGAAGCGTTT ATGTCAAAAT TTGGACGAAG ACCACTGGTC GCTGCGAGAC GACGCGGCGT ACACCATGGC GTTCATTTGC GGCAAGTTCG GCGACGCCTA TCCAAGCATT CGACCTAGGA TCACGCGCAC GTTGTTGCGG GCGTTATTGG ATACCAAACC AATGACCACG CACTACGGCG CGATTCGAGG CTTGCACGCT CTCGGTCCCA AGGTCGTGCG AGAGACGGTG ATGCCCAACT TGCGCTCGTA CTTGAACACG CTCGAGCCGT TGCTC
|
Protein sequence | MSAPSRVHVD SVRAIAATIG APPVDADAAR ALASDCEYRL RQVFQDAMKC MRASKRTTLS AEDVNAALRL RNCEPLYGFG AGTSDYEYKQ TREDPDVFYV EDREIDMREL LTRKLPRPPI EVNLVPHWLA VEGVQPMIPE NPMVPAAEPV AIEPPRGMKR PRPRAMGAKE NGGDPDAGGL LPVVSHTLSR ELQFYFDKVT ALIRQAGRAD ASDREVELLS TALRSLSADV GLHNLMPYFT QFITEETTQN LRDLPRLRVL IQMIRALISN PDINVELYLH QLMPSVVTCV VAKRLCQNLD EDHWSLRDDA AYTMAFICGK FGDAYPSIRP RITRTLLRAL LDTKPMTTHY GAIRGLHALG PKVVRETVMP NLRSYLNTLE PLL
|
| |