Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29967 |
Symbol | |
ID | 7195178 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 650375 |
End bp | 653608 |
Gene Length | 3234 bp |
Protein Length | 871 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183400 |
Protein GI | 219126303 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.497541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGTCTTGGT CCTCCTACGC ATTCCGGTCC TTTCCAGTTA GTCGGTCGAC CTTCTTTCTT GTCTGCTACG TCTGTTCGTG TGTGCCAAAA GAGGTCCTTT GCAATAATTA ACCATGGCGG ACTTGACAAG TATCCTGTTG GCTGTTGCCT CGTCTCCTGG TAGGTTTCGT CGAGAGAAAG CAGAGAGGAA TCGGCGTAGG GTCTCGTCAC GGAACCAGAT GACTCTCGTT GGCCACGTTT CCTTGTAGAA AGAATGTCTG TTTGTTTATT TGGTTGTCTT TGTTTACCAA TCCCAAATCT CACCTCACTT TGCCTTTGTT TGGGAATTTT ACTGCGTAGA CCGATCGGAA GAAAAACTAC TCGAAGAATA TATGCAGTCG AACTACAGTG AGTTCTGTTT AGCTTTGGCC AAGCTGCTCG CTACGGAGGG CGCACCCTTT GCAGCACGTC AAATGGCGGC ACTTCAGCTC AAAAATACGG TCCACGCCAA ATCTGCCGAG ATACTGCAGG AAAAGCACAA TCGCTGGAAG GCCACCGACG CCACACATCG AGCCGCCGTC AAGGAGTGTC TCCTTGCAGC GATGCGTTCC GGTGTACCAA AGGTGCCGCA TTTTGCCGCC GTCACTGCCG CGGAATTCGC TTCCATCGAA CTGCCTTTTA ACGAATGGCC ACAGTTCATC GCAACGCTCA TGGAAAACGT CACTTCGCAT GCACCGGAGC CCATCAAGAT TGCCTCGTTG GAATGCCTCG GATTCACCTG TGAAAGCATT GTAATTATGG AGGAACTAAT GGGAGACAAT TTCGTTCCCG AATTGGCCTC TTCCACCGTC GACACCATGC TGACAACGAT TGTGAACGGA GTGCAATCGA ATCAGACGGA TGCGATGCGT CTCGTCGCTC TGACAGCATT GAAAAACTCG CTCGGCTTTG TCCGTCACAA CATGGAACGC AAGCAAGAAC GAGATTTCAT TTTTCAAGCC ATGTGTGAAG CCACCAAGTC CAGCGATGCA CAGGTTCGGG CTCTCGCTTT TGCCTGTCTA GACCATACCG CCGAACTATA CTACGACACC CTACCGGACT ATATGACCGT CATTTTTGAG CTCACCACCA ACGCGATTCG ATCAAACGAC GAAGAGGAAA CCGTTCAAAT GAATGCCATG GAATTGTGGA CCGCCATCGC CAGCACAGAG CAGACTCTGG TGGACCAAGA CCAGGATGCG GCCGAGAGAG GGCAGCCCCT GGATCGACCT CCGTGTCCCA AATACACACT CGCTGCTATG GAAGCGTTGG TTCCGCTATT GCTAGTCATG CTTGCCAAAC AAGAAGACGC ACCCGAGGAC GACTCCTGGG GTTTGCAAGA GTCTGCCGGG GTGTGTTTAG AGACAATCTC GCAAACTGTT GAAGGATCAA TTGTTCCACA CGTCATTCCC TTTGTCACGC AACATATCCA GTCGGAAGAA TGGCGCTACC GCGATGCGGC TATCGTAGCT TTTTCCTCCA TCATGGATGG TCCCAGTACC GAGGAGCTGG CCATATACGT GAACCAGTCC ATTCCGGTTC TACTCCGTGC ATTTTCAGAT TCGAATGAGA TGGTCCGCGA CTCTGCCACA CACTGCATCT CCACCGTTTG TCGCCTCCAC ATGATTGCTG TCGACCGAGA TATAGTGCAT TCCATTATCA AAGGTCTAAT CGAGAAGCTA CGTGACTCTC CTCGTGTTGC TGCCAAAGCG TGCACGGCCC TCTTCAATAT TGCCACATCC TTCAAAAGCC CCGAACCCGA GCCGACGTCA CTTTTGTCGG AACCCATGTT ACCCCTTTTG CAAGCATTGT TACAAACCAG CGAGCGCCAA GATGCCACGG AATGCCATTT GCGTGTCGGT GCTATATCGG CAGCCAATGA CCTCGTCGCA GCCGCACCCT CGGACACCAC ACCCATCCTC GCCGAATTCC TGCCGGTTAT CATCGCACGG TACGAAGCGA CAATGCACGC ACAAGTATTA GGCAACGAAG AAAAGGAAGA GAAGGAACAA GCCCTGGGCT TGTTTAGTTC GCTTATTTCA GTCCTGTTTC AACGGCTCGA AAAACATGAC GTGCTTGCGT ATGTGGACAA GGTCATGGAA TTGTTGCTCC AGGGATTGCA ATTGCGGAAC GCCTCGTGTC ACGAAGAATT TTGGCTGGCG ATTGGATCAA TTGCCGGGAC GATGGAGGGC GAATTTATTG TAGGTATTTG ATGTAGTTGC AGCGTGTCCT ACCTTTGCTT TCCTTTGACA AGTTTTCGTG GCTGACCTCT TTTGCATCAT TTTCAGAAAT ACATGCAAGC GCTGAGCCCA GCCCTACTGA CGAGTTTGCG CGACTTCCAC GCCAAGACTC TCTGTATTGT ATCTATAGGA GTTGTCGTCG ATATTTGCTC CGCGATTGGT GATAAAATTC AACCGTATTG CGACGGTATT ATGAGCGCGT TAGTGGACTG TTTGAAAGAT TCCGTCATTC AACGAGATGT TAAACCTGTG GTATTTTCCT GTTTTGGCGA CATTGCCATG TCAGTGGGCG GTGCATTCCA ACCATACCTT CAAGTATCGA CGATGCTTCT ATTCCAAGCC TCACAACAGC AAGCACCACC GGATGACGAA GACTTAATCC TTTTTGTAAA TTCGCTTCGG CTAGGCATTC TGGAAGCCTA TTCGGGTATC ATCATGGGTC TCGCTGACGG CAACGCCCTC CAAAGTTTTA CACCCAGTGT GCCCAACATT GTACAATTCG TGCAAGTCTT AGCGGCCGAT TCGACCAAAG ATATCTATGT GTTGGAAAAG TCGGTGGCTC TTTTGGGCGA TGTGGCGCAA CAAATGGGTA GCATTCCTCA AATTCGGGAA CAATTAAACC AACATTTCGT TTCAAAGCTG TTGCAAGAAG CTCTCAACTC CAATGATGAA ACCACCGTCG ACTCGGCTAA CTGGGCGGGA AACTTGATCA AGCAACTCAT TCGAGGCAAC GCTTAGTACA CACACGGTTG TGGAAACTAG GGCGTCCTTT TGGACCACGC CGGTCGCCAT GGGACGAGAT ATGATTTCGT TCCGTGGCCA CCGATGAAAT AGTGAAACAT GACAAAATGG ACAAGATGGA GTGTACTTAT TTTGACACCC TGTTTTCTCC TCTTTTTCGA AATGAGCTAT CGAAAATCGT ATATAAATGG AATCGCATTA GAAAATTTTT AACGAATTGA CTGTGATTAT TACC
|
Protein sequence | MADLTSILLA VASSPDRSEE KLLEEYMQSN YSEFCLALAK LLATEGAPFA ARQMAALQLK NTVHAKSAEI LQEKHNRWKA TDATHRAAVK ECLLAAMRSG VPKVPHFAAV TAAEFASIEL PFNEWPQFIA TLMENVTSHA PEPIKIASLE CLGFTCESIV IMEELMGDNF VPELASSTVD TMLTTIVNGV QSNQTDAMRL VALTALKNSL GFVRHNMERK QERDFIFQAM CEATKSSDAQ VRALAFACLD HTAELYYDTL PDYMTVIFEL TTNAIRSNDE EETVQMNAME LWTAIASTEQ TLVDQDQDAA ERGQPLDRPP CPKYTLAAME ALVPLLLVML AKQEDAPEDD SWGLQESAGV CLETISQTVE GSIVPHVIPF VTQHIQSEEW RYRDAAIVAF SSIMDGPSTE ELAIYVNQSI PVLLRAFSDS NEMVRDSATH CISTVCRLHM IAVDRDIVHS IIKGLIEKLR DSPRVAAKAC TALFNIATSF KSPEPEPTSL LSEPMLPLLQ ALLQTSERQD ATECHLRVGA ISAANDLVAA APSDTTPILA EFLPVIIARY EATMHAQVLG NEEKEEKEQA LGLFSSLISV LFQRLEKHDV LAYVDKVMEL LLQGLQLRNA SCHEEFWLAI GSIAGTMEGE FIKYMQALSP ALLTSLRDFH AKTLCIVSIG VVVDICSAIG DKIQPYCDGI MSALVDCLKD SVIQRDVKPV VFSCFGDIAM SVGGAFQPYL QVSTMLLFQA SQQQAPPDDE DLILFVNSLR LGILEAYSGI IMGLADGNAL QSFTPSVPNI VQFVQVLAAD STKDIYVLEK SVALLGDVAQ QMGSIPQIRE QLNQHFVSKL LQEALNSNDE TTVDSANWAG NLIKQLIRGN A
|
| |