Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29260 |
Symbol | |
ID | 7203024 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 772638 |
End bp | 775129 |
Gene Length | 2492 bp |
Protein Length | 684 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | transketolase |
Protein accession | XP_002182454 |
Protein GI | 219124318 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.029788 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTTGTTTCT CCTTACAATC TCATAATTTT TCAATCGCGA ACAGCCAATC GCGATAAGAA AAGACAATTC ACCACTCCAA AGTGACAAGA AGCAACACCA GTCATAATCA TACTAGCACC AACAGCATGT CGGAATTGGA ACAACTTTGC GTCAACACGA TCCGTGCCGT CTCGGCCGAT CAGCCGCAAG CGGCGAACTC CGGTCATCCA GGAGCTCCCA TGGGTTGCGC TCCCATGGCG CATCTTCTTT GGGGTGAAAC CATGATGTAC TCGTCCAAGG ACCCAGCTTG GATCAACCGC GATCGCTTTG TGTTGAGTAA CGGACACGCC TGTGCGCTGC AGTACACCAT GCTTCACCTC ACTGGGTACA ACTTGACGGC CAAGGATCTT TCCGAGTTCC GCAAGATCGA ATCCAAGACG CCAGGGCATC CGGAGTGGTA CGAAACGGAA GGTACGTTCA AAAACGCAAG CTTGCAGGCT GGAATAAGCG CATGAGCGAT ACAGAGAATG CTGAGCATGC GCCGCAACTA TGTAGTAGTC GGATGAAAGG CTTGTTCCTT TTTTGTCCCA TGATTGATGC ATACACATAT TCACCATCGC TTGTTTCCTT GCGGTAGGTA TCGAAGTCTG CACGGGACCC TTGGGTCAGG GAATCTCCAA CGCCGTGGGT TTGGCCATCG CCGAGCGTCA CCTAGCTGCG ACATACAACG AAGACAACTA CCCTATCTTC GATCATCACA CCTATGTCAT CTGCGGAGAC GGTTGCTTAC AGGAAGGCGT CTCGGCAGAA GCCTCTTCCT TGGCCGGACA CCTTGGTCTC GGACGAGTCA TTGTACTCTA TGATGACAAC AACATCACCA TTGATGGATC TACCGAGCTA AGCTTCACTG AGGATGTCCT CAAGCGCTAC GAAGCCTACG GATGGCACAC TCAGACCGTC GACGAAGTGG TGGAATCGCT GGACGCTCTG CGCACGGCCA TCAAGAACGC GCAGGAAGTC ACCGACAAGC CCTCTCTCAT CAAGGTCAAG ACATTGATCG GACAGGGAAG TCCCACTAAA CAGGGCCACC ATTCGGCGCA CGGGGCTCCC TTGGGTACCG ATGATCTGGC TGCGGCCAAG AAAGCGTGGG GCTTGCCCGG GGACAAAATG TTCTACGTGC CTCCGGAAGT ACAGGAATAT TTCGATAAGG CTGCCGCTAT CGGCGATGCC AAACGAATCG CGTGGGAAGA GTTGTTTGCC AAGTTCTCGG CAGAATTTCC CGATAAGGCG GCCGAAATCT CGCGCCGCTT CGCAGGCAAG CTCCCTGATG GGCTACTCGA CAAGCTGCCC AAGCATGAGT TTGGCAAGGA CAAGGAAGTG GCAACGCGAA AGTCCAGTCA AATGTGTCTT GAGGCTATCG CGCCGCACAT GCCGGAACTC ATTGGTGGGT CGGCCGATCT GACTCCATCA AACTTGACGG ACTACAAGGA TGTAGTCGAC TTCCAGAAGG ACTCCTACAT GGGACGGTAC CTGCGTTTCG GTATTCGTGA GCATGGTATG GTTGCTATTA CCAACGGTAT CTTTGCTCAC GGTGGTCTCC GCCCCTATTG TGCGACCTTT TTGGTCTTTG TCGGCTACTG CATAGGTTCG GTCCGCCTTT CGGCTTTGAG TCAGTTCCCG ATCTTGTTCA TCATGACGCA CGATTCCATT GGCTTGGGCG AGGATGGTCC AACTCATCAG CCGATCGAAA CCTTGGAAAG TCTTAGATCC ATGCCTAACA TCAACGTGTA CCGTCCAGCG GACACGAACG AAACTGCTGG TGCGTACCAT GTTGCCTTGA CGAGCAACAA GACGCCAACC GTCATTTGTT GCTCGCGCAC TACGACCAAG GCTTTACAGG TTTCGTCTAT CGAGTTGGCC GGCAAAGGAG GGTATGTTGC TGTCGAGACG GAGAACCCGG ACTTGATTTT GGTAGCCAGT GGCTCGGAGG TGGGACCGTG TGTCGATGCG GCCAAGAAGC TTACCGCGGA AGGTATCGCC ACCCGTGTGG TTTCCATGCC GTGCCAAGAA GTCTTTTTGG CTCAGCCGGC ATCGTACCAA CGAGTCGTGC TTCCTGGAAC TGTTCCGACG TTGTCTGTGG AAGCGGCTTC CCCTCACGGT TGGCATCGGT TTAGTCACTC TCAGATTGCG ATGACCGAGT ACGGGTGCAG CGGTGCGAGT GCGGATGTTT TCAAGAAGTT TGGGTTTACG GTCGAGAACA TTTCCAACAA AGGGAAAGAG TTGGTCGACT TCTACAAGCA GGCTGGCAAT ATTCCCGACC TGAACAACCG TCCCGTCTTT AGTGCCATGA ATGGCGGTGC GCACTAAAGA AGCGTCTTTG GATGAACCTT GGTGCGGTCC CTTATCCCAC TTTGCCTGGC GATTGCGACT GTACTTCGTA ATCCATTCAG CTTCCCGTTG GACTTAAATA TATATAAAAG GCTGTAAACA AAATAAGAAA CGCTTAGCCA TT
|
Protein sequence | MSELEQLCVN TIRAVSADQP QAANSGHPGA PMGCAPMAHL LWGETMMYSS KDPAWINRDR FVLSNGHACA LQYTMLHLTG YNLTAKDLSE FRKIESKTPG HPEWYETEGI EVCTGPLGQG ISNAVGLAIA ERHLAATYNE DNYPIFDHHT YVICGDGCLQ EGVSAEASSL AGHLGLGRVI VLYDDNNITI DGSTELSFTE DVLKRYEAYG WHTQTVDEVV ESLDALRTAI KNAQEVTDKP SLIKVKTLIG QGSPTKQGHH SAHGAPLGTD DLAAAKKAWG LPGDKMFYVP PEVQEYFDKA AAIGDAKRIA WEELFAKFSA EFPDKAAEIS RRFAGKLPDG LLDKLPKHEF GKDKEVATRK SSQMCLEAIA PHMPELIGGS ADLTPSNLTD YKDVVDFQKD SYMGRYLRFG IREHGMVAIT NGIFAHGGLR PYCATFLVFV GYCIGSVRLS ALSQFPILFI MTHDSIGLGE DGPTHQPIET LESLRSMPNI NVYRPADTNE TAGAYHVALT SNKTPTVICC SRTTTKALQV SSIELAGKGG YVAVETENPD LILVASGSEV GPCVDAAKKL TAEGIATRVV SMPCQEVFLA QPASYQRVVL PGTVPTLSVE AASPHGWHRF SHSQIAMTEY GCSGASADVF KKFGFTVENI SNKGKELVDF YKQAGNIPDL NNRPVFSAMN GGAH
|
| |