Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29633 |
Symbol | |
ID | 7194765 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 61680 |
End bp | 65055 |
Gene Length | 3376 bp |
Protein Length | 949 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183150 |
Protein GI | 219125779 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGATCGACC GAATCGTTGT TGCTTCCTTT CGTACATCTT CAAGGACTCG CTGTAACAGC CAATCCTTGT CCATTGTATT TAACAACACC CAGTCTTTCG GTTGTCACTC TCGATTGTGT AACCATTCAG ACAAGGCAGG CACTCTACGG CCAACATGGT AGAGCACGCA GCACCTTCCG CCTTGACCGA CCAGAAAGTC GCCATTGTCG GTGGCGGAGT AGCGGGATTG TCCGCTGCTT GGCACTTGTC CGTCAACACC GGTGCACACG TCCAACTCTT CGAAGCGGAA TCCCGTCTCG GCGGGCACGC CTACACAACC AACGTGGACG GGGTCGACGT GGATATTGGC TTTATGGTGT ACAACGAAAC CAATTATCCC AACATGGTGG AGTGGTTCCG GACGTTGGGT GTGACACAGG AAGACTCGGA TATGAGTCTT TCCGTCAGCC TCGACGGGGG TGACACGGTC GAATGGAGTT CCGACGGACT CGACGGTCTC TTTGCGAATC GCCGACAGCT CGTTAGCCCC CCGTTCTACC GTTTCCTCAA GGACATGATC CGTTTCAATC AGCAAGCCGC CAATATTCTG CTCCTCACCG ACGACGATCC CCGCAAACAC GTCACCACGG CACAGTATCT CCGCGAACAC GGATATTCAA CAGAATTTGC CAAATTCTAC GTATTTCCCA TGATGGCGGC GCTATGGAGT GCCAGTATGG AGGACGTCTT GCGATTCCCC GCCGCCCAGC TCATTGGATT CTTGTGCAAT CACAAAATGC TCCAACTGTT CGATCGACCG CAGTGGAAGA CGGTCGCCGG AAGGTCGCAG CAGTACACCA ACCTGGTACA GAGTATTCTC GGCTTCGAAG CGGTCCATTT GGACACGCCC GTTCACAAGG TGGAAAAACT AGAGGACCAA ACCTACCGTC TCGTTACTTT GCGGAAAGAG GACGGCGAGC ACGGATCAGC CACCGAAGTC TCCCTCGGCG TGTTTGATCA AGTAGTCTTT GCCTGTCATC CACCCACGGC CCACGACATT CTGCAACGCA GCACTAGTGT GAGCAACAAT CCCACCAATA AGGAGCACCA AGATCACCAG CTTCTCCTCC AACTTTTGGC ACAGATTGAA TACGCCGACA ACGTTGTGTA CGTCCATTCT GATCCGTCAC TCATGCCGAA ACGTCGTCAC GCATGGGCGT CCTGGAATTG TCTCGGACGC AGTCAACACA TGCTACCCTT CCATTCCACC AGTAGCAAAA AAGGCGAAGC CTTTGAAGGT GCCGAATCGG GATTTGGTAA CGTTGCACAC TCTGCACTAC CCGAAGACCA AGATTCATTC CACGAAAAAA CAGCAGTACC AGCACAGCCT ACGCTGGAAG GCATCCACGG TCGCATGAAG GCAGTCTTTG TGACCTACTG GTTGAATCGG CTGCAAAATC TTGAAACGGA CCGCGACATT TTCGTATCAC TCAATCCCCA CCACGCGCCG GAACCAGCCT TGACGCACCA GCGGGTCATT CTCGCACACC CTCAGTTCAA CAGTAAGACG CTCCAAGCCA GAGAAAAACT CGGCGCTCTC CAGGGAAAGG ACGGTCTCTG GTTTTGTGGT GCCTGGTCCG GGTACGGGTT TCACGAAGAC GGATGTCGCG ATGGATTTCG AGTCGCCACG GCCATGTCCG CTGTTCCCTT ACCGTGGGTG ACGGAGGCGC GGGCTACTGA CGAGACTCCT AGTACACCAG ATGCACTCAT CTTGCCACCA CCGGATCTCT CGAGCAATCA CACCCGAATG ACCTTGTGGC AAGCCCTGTA CCAACGTGTC ACGTACGATT TGCCCGTCGC AATCTGCCGA CAACTTTCCT TTTATTTCAT GAAGCAGGCC GTCCAAATGG GTCGCTTGCG TCTGAAGTTT AACGATGGAT CCGTGGTTAG CTTTGGCGAC GGCACACCGT GTGGCTGTGA CACTAGTGAC GTGACAATTC GCGTATTTGA TCCATGGTTC TTTGTCAAAC TGGCGACCGA ATACGACCTG GGTCTCGCAC GATCGTACAT GGCTGGGCAT TTCATTGTGG AGCCTCTCGA AAAGACGTCA TCCTACCATC CTGTCATTCG CCCGGAACAT GCGTCCGAGG AAGCAACCAT CACTCTGGGT GATCCGATTG GCTTGACCCG TCTCTTTTTG CTGCTGATCG GGAACCGTGA TGACAATGCT GCAAAAGCAC ACATACCTCG TCGAGCGGGT CGGGGGCACA AGTACGCCAA CGCGTTGTCC AACGCATCGG GATTGGTATT GGCTCAGATG GGTTCCTTCG TCAATTACCT TCGCTACAAA TTGACAATGG ACAATTCCGA ACGAGGGGGT AGCCTAAAAA ACATCCACGC GCACTACGAT CTTTCCAACG ATCTATTCAA GACTTTCTTG GACAAGGAGA CACTTATGTA TTCGTCGGCG ATTTACGATG CCGTGCCTGC TCCACGCCCG CACTCTGGAC TCGTCTTTCG CGGGTCCCTC GAAGAAGCAC AGTGGCGTAA GTTGGATACA CTGTTGGATC GTGCCCAGAT TCAGCCCGGA CAAACGGTCC TGGACATTGG CTTTGGTTGG GGCGGATTGT CTATTCATGC CGCCAAAAAG TACGGATGCA AAGTGACTGG TATAACGCTT TCAGTGGAGC AAAAAGCACT TGCCGAAAAG CGCGTCAAAG AAGAAGGTAT TGAGTCCCTC ATTACGTTCG AAGTCGTGGA TTATCGGACC TTTTGCGCGC GCAAGAGCAA CTGCGGTATG TTTGACCGTG TGCTGAGCTG CGAAATGATT GAAGCTGTTG GACACGGTCA TTTGGTAGAA TTCTTCTGGG CCGTCGAACA GGTCCTGTGT CGTGACGGCG TCCTTGTTAT GGAGGCTATT ACGACACCAG AAGAACGATA TGAGAACTAT TTACGTTCGA CCGACTTTAT CAACACCATA ATCTTTCCTG GCTCCTGCTG CCCTTCCTTA CACGCGCTCG TAGACGCCGC CTACCGAGGA TCGACGTTGA CGCTAGAGCA CGTCGATAAC ATTGGACTGC ACTACGCCCA AACTTTGGCT GAGTGGCGTC GTCGTTTCAA CGCCGAAGAA CCCTTTGTGC GCCAGCTTGG TTTTGACGAT GTCTTTTTAC GGGCCTGGAA CTATTACCTG ACATACTGCG AAGCGGGTTT CTTTTCACAA ACGGAGAATT GCTTGATTTT GGTCTTTGCC CGACCAGGAT GCAAGGCATT GACGGCTTTG TGCGAGACGC GGTCAGTGGT GCAAGCATCA CCTTTTAGCG ACAAGGAGAT CGAGACTTTT GTGGCCGAAT GCAAATAGGC AATTGGCAAG TAAAATGATC GAACATATAG CTTCGA
|
Protein sequence | MVEHAAPSAL TDQKVAIVGG GVAGLSAAWH LSVNTGAHVQ LFEAESRLGG HAYTTNVDGV DVDIGFMVYN ETNYPNMVEW FRTLGVTQED SDMSLSVSLD GGDTVEWSSD GLDGLFANRR QLVSPPFYRF LKDMIRFNQQ AANILLLTDD DPRKHVTTAQ YLREHGYSTE FAKFYVFPMM AALWSASMED VLRFPAAQLI GFLCNHKMLQ LFDRPQWKTV AGRSQQYTNL VQSILGFEAV HLDTPVHKVE KLEDQTYRLV TLRKEDGEHG SATEVSLGVF DQVVFACHPP TAHDILQRST SVSNNPTNKE HQDHQLLLQL LAQIEYADNV VYVHSDPSLM PKRRHAWASW NCIHGRMKAV FVTYWLNRLQ NLETDRDIFV SLNPHHAPEP ALTHQRVILA HPQFNSKTLQ AREKLGALQG KDGLWFCGAW SGYGFHEDGC RDGFRVATAI NHTRMTLWQA LYQRVTYDLP VAICRQLSFY FMKQAVQMGR LRLKFNDGSV VSFGDGTPCG CDTSDVTIRV FDPWFFVKLA TEYDLGLARS YMAGHFIEAT ITLGDPIGLT RLFLLLIGNR DDNAAKAHIP RRAGRGHKYA NALSNASGLV LAQMGSFVNY LRYKLTMDNS ERGGSLKNIH AHYDLSNDLF KTFLDKETLM YSSAIYDAVP APRPHSGLVF RGSLEEAQWR KLDTLLDRAQ IQPGQTVLDI GFGWGGLSIH AAKKYGCKVT GITLSVEQKA LAEKRVKEEG IESLITFEVV DYRTFCARKS NCGMFDRVLS CEMIEAVGHG HLVEFFWAVE QVLCRDGVLV MEAITTPEER YENYLRSTDF INTIIFPGSC CPSLHALVDA AYRGSTLTLE HVDNIGLHYA QTLAEWRRRF NAEEPFVRQL GFDDVFLRAW NYYLTYCEAG FFSQTENCLI LVFARPGCKA LTALCETRSV VQASPFSDKE IETFVAECK
|
| |