Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119295 |
Symbol | Tdc |
ID | 5000165 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 18103 |
End bp | 21063 |
Gene Length | 2961 bp |
Protein Length | 876 aa |
Translation table | |
GC content | 57% |
IMG OID | 640415586 |
Product | transducin / WD-40 repeat protein, putative |
Protein accession | XP_001416047 |
Protein GI | 145341920 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACGG CGGCGGCGCG CGCGCTCGCG GCAGTCGTCG ATGCAGAACG GAGCGGTACG TCGAGGGCGC GGCGCGACGC ACACGCGCGA TGTGCGCGCG CGATGAGATG AAGGAATCGC GTTTGGATGC TGATGCCGAT GATTGACGCG ACGAACGATG GATGGCGGAC GCGTCGACGC ATTCGACGAC GACGCACCGC GAACGAATCG ATCGCGATCG ATGCGCTCGT AGACTGACGC GGGTGATGAC GCACACGCAG TGACGAACGT GGCGAAAGGG CGGGTGTTGA ACCGAATGGA GGAATACGTC GACGTCGAGC GTCGCGAAGA CGGCGAAGAC GCGAGCGCGT TGGAGGTGCG ATGATGACGA CGCGCGCGAA GTGACACGAA CCCTCGCGCG TGAAATGACG TCTGATGCGT CGCAAACGGT CCGAGACTGA CGATTCTACG GTGCGGATGC CGACGACGTA CCCAATCTAG GTGGCGCCGA TTACCGTGTA CGAGACAAAA TACGTCGAGG CGCAGTTGCG AATGATCGCC GTGAACGATG CTTTCATATG TTACGGTATT CGAAACGGGT TGATTCGAGT GTTTTCTAGA AAGTCTGGCA ATGTGCGATC GTTGCTTCGC GGACACTCCG ATTCAATCTC GACGCTCAAG TTTTTGGCGA ACACCGACGT GTTGCTCTCG GTGGACGTGC GAGGACGGGT GATGGTTCGC AAGTTGAGTT TACAAGGCGC CGACGATGGG GCCTCGATCG ATGTGAGCGA CGATAACGAT GCGGGATGGA CGATCGGTTC GCAGAATTTG GTAGACTTCG ACTTCGCAGT GGACACCCCG GACGGGCACC TGCCGCCGTC GGCGTGCTGG CTCCCGCTAG ATTCGCGAAA ACCCGGCAAG TGTGAATTCG CCCTGAGCGC GGGGACGTGC GTGGTGTGCT ACGAGTGCTC GCTGATGAAG AAAGGGGATG CAATTGCGGC TGAGATCGAT CTCAACGCCC CGCCAGCCAT GGATGGTTTA CAGGTGATCG AGTTTGATGC CCCGGTGTCT TGCGTGGACA GCGCCCCCGC GGGTCGCAAG CTCGTCGCCG CGAGCGAGGG CCGTGCGTAC GTTTTGGTGA AGGAGGACGG TGAAACGGAG TTTGAGTGCA CGGATACACT TCCATGGACG GTGGAAACCG CCGTCTTCGC CTCGGCTGAA CGCGTAATTC TGGGTAACGA TAATAACTCA GAGTTGATCG TCGTCGACGT GACGCAAGAA ACTCCTTTCG CGGTGCAGCG CGTCGTCTTT AAGTCAGACA GTGACAAACT TTTCAACGTA TGCTTGCAGC ATAACCCCGA AACAGGAATC GTATTGCTTT CGAACACGCG TATGAACACG GTGTACGCTT TACACTTCGA GAAGAGCTTT GATTACATTG CGCGCTTTGA GGCGTCGCAA CCGATTTTGA GCTTTGACTC CCACGTGAAG TACGATGGCG ACGGCAGTGC GACGATGCAA TTGTTTTGCT TACAGACGCA AGCGATTCAA ACCTTGTCCA TGCCGGCTGA AGCATGCATT CCACCAGACG GAGCTCTCGA GGCGCTCGGC TCGTTGGGTA CGCCCAAGAC GAAAGGTTCG CTCGAGCACT CATCTTCAGC CACTCTGCTC ACTCCGGATA TGTTTGGCGG CGCTGAGCTC GCCGGCGACG AGGAGGACGA AGAGGAAGAG CGTGCAGCGG TAGTTCCATC GAAAAAGGCT GCTGCGGCCC AACCAGCGTC GTCCGAAGAA AGCATGGGCG AGTCTGGTGC AGAGGAAGAA GAGGGAGAAG ATGAAGAGTT TGAGTCTGCC AACATGAGTG AGAGCGGCGC TGGTGCGATT CCCGTCGCCG CCGCGTTCGA TATGAAGCAG TTGAGAGCGA CTATTCGTAA CGAAATGCGC GAACTTCTGA ACGAAGTTCG CGATGAGCGT CGTCTGGCTG CAGAAGAGCG CAAGGCAGAC ATGGAACGCG CTCGTAAGGC GAACGAAACC GCGGCAGCCA ACTTGAAGCG AGACGTCACG AATGCTATCG CGGCTCTGTT GCAACAACAT TCCGTGGACA ATGCAAAGAC TTTGGAAGCT GGCATCGGAC GCGCTCAATC TGTTGCTGCA ACAAGCGCGC AAACTGCGAT GAAGTCCATC GTCGGTCCAT CCATTGACGC CGCTGTTCGA GCGCAGATGG AAACCTCCGT CGTGCCAAAA ATGGAAATCG CGTGCTCGAC GATGTTTACG CAAGTCAAGC ACACGTTTGA ACGAGGTATG GCTGACTTGA ACACCGAGCT TCTCGCTGCG AGAGAATCTG CGGCAATTTC TCAAGCGACG CCGTTCGTCT CCGGTTTGCG CCAGGCGACG ACCGAGGTTC GCCAAGCCGC AACAGCTTTG ATGACCGATA TCCCAAATCA AGTCGCGCAA GCCATGGCGA AGGTCACGGC GCGCGCACCT TCGGGCATGG GCGCACCTCC CGGGATGGCC CCAAAGAGCG TGACGCAAGG TAAGACGCTC GCTCAAATCG AACAGCGCTT GGACCCAACT GTGGAGATCA GCAAGTTGCT ACAGGCAAAC CAAATCGACC GCGCGTTCAA TCTGGCGCTC AGCATGAGCA AGGTTGAAGT CGTCATGTGG CTCGTCAACC AAGTTGCGAG TGATCGAATC TTTGGCCAAA CACCGTGTCC GTTGTCGCAA GGTGTGCTCT TGTCGCTTGT TCAGCAGCTT TCGAGCGATT TGACGACGCC AGACGCGCCC AAGAAGCTCG ATTGGATTCG AGATTCTTGC CTCGCCGTCG ATCCGGCGGA TCCCGTGTTG CGGCAACACA TGCGGCCGGT GCTCAGCACG GTGCACCAGT CGCTCATGGC TGCGGCCAAT TCACCGACGA GTGCGCCAGA AGTACGCGCG GGGACACGAT TGTGTATCCA CGTCGTGAAT TCCATGTTAT CTTCTTTGTA A
|
Protein sequence | MSTAAARALA AVVDAERSVT NVAKGRVLNR MEEYVDVERR EDGEDASALE VAPITVYETK YVEAQLRMIA VNDAFICYGI RNGLIRVFSR KSGNVRSLLR GHSDSISTLK FLANTDVLLS VDVRGRVMVR KLSLQGADDG ASIDVSDDND AGWTIGSQNL VDFDFAVDTP DGHLPPSACW LPLDSRKPGK CEFALSAGTC VVCYECSLMK KGDAIAAEID LNAPPAMDGL QVIEFDAPVS CVDSAPAGRK LVAASEGRAY VLVKEDGETE FECTDTLPWT VETAVFASAE RVILGNDNNS ELIVVDVTQE TPFAVQRVVF KSDSDKLFNV CLQHNPETGI VLLSNTRMNT VYALHFEKSF DYIARFEASQ PILSFDSHVK YDGDGSATMQ LFCLQTQAIQ TLSMPAEACI PPDGALEALG SLGTPKTKGS LEHSSSATLL TPDMFGGAEL AGDEEDEEEE RAAVVPSKKA AAAQPASSEE SMGESGAEEE EGEDEEFESA NMSESGAGAI PVAAAFDMKQ LRATIRNEMR ELLNEVRDER RLAAEERKAD MERARKANET AAANLKRDVT NAIAALLQQH SVDNAKTLEA GIGRAQSVAA TSAQTAMKSI VGPSIDAAVR AQMETSVVPK MEIACSTMFT QVKHTFERGM ADLNTELLAA RESAAISQAT PFVSGLRQAT TEVRQAATAL MTDIPNQVAQ AMAKVTARAP SGMGAPPGMA PKSVTQGKTL AQIEQRLDPT VEISKLLQAN QIDRAFNLAL SMSKVEVVMW LVNQVASDRI FGQTPCPLSQ GVLLSLVQQL SSDLTTPDAP KKLDWIRDSC LAVDPADPVL RQHMRPVLST VHQSLMAAAN SPTSAPEVRA GTRLCIHVVN SMLSSL
|
| |