Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49971 |
Symbol | |
ID | 7198654 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 465389 |
End bp | 469010 |
Gene Length | 3622 bp |
Protein Length | 957 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184710 |
Protein GI | 219129048 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATGCAGTA CGTAGTCGTA GGACTAAGTG TAAGTGTTAT ATCTCTCACT GTCAATCCAT CATAACTGCA AAGGAAGCTT CCACTGGTTC CTGTTCGACT CGTTGCCCAA CAAAAGCAAG GCCGTATGTG TGCGCCCCAG AACTCGGCTT TTTCTGTTAG TTCACTGGCA GAGTCTTTGG GGAAGCGGAA GCGACTCTCG TACAAAGCTT GCGTCCTTGC AACGGTTATT TGTCTTGGAA CCGCAACTGG TGGGTGTTGC GCGTTTCATC CCGTCGTTTT GTCACGGGTA TCCACACGTC GACGTGTTAG TTCATCGTCG CCTTCGTTGC CATCCGTACG GATACAGCAG CAATATCATC GAGACTGTCG CCATCAACAT GGACTAGTGC TCTTTGCGTC TATACCCGTC GTTCCCAACG CAGCTTCGAT AGCGACTGGC AACAGCACGA ATATCGATAC CAAGACAACA AGCCATCTAC AACCGAGAAC ACAACCACCA AACCCACTCT CTCGGACTAC TAAGAAGGAG TCTTCGTCAA AGCAACTCAC GACAGCGTTG GCATTTTTGA CGGGAGTTGC CGACGTTTTT CTCATTCGCA AATACCAAAC CTTTTGTACC ATGATGACGG GAAACACCAT GTGGATGATG AAAGCCGCGA CGGAGTGCCA GTTTAGTCTG GTTGGCTACT ACGTCGCCGT TATCTCGTCT TACATAGCCG GTCTCATCAT TTTTCGCAAG GCCGAAATGA CGTGGAAGAC GCAAAGTCTC GGCCGATTTT GTGCACCTTT TGTTACCATC AGCTTTCTAC TGGCCGACTT TTGGTCCTCC CGAAACGCTG CAATTCGGTG GCCCGCCGCC ACTTTGCTGA GCGCCTCCTA TGGTATCATA AACTCGGTAG GGTCGGAAAT GGCGGGGTCT TTGGCGTTTG TCGTCACCGG GCACATGACT AAACTTACTC ACGTGTTGAC GGACCGCTTT TCCAAACAAG CCGGAAACAA ACCCATCGCC GACAAGGATA AGTCCACACT CTTACAATCA TCACTAATCA TAGTGGGGTT CGCGGCCGGA GCGTTCGTCG CGTGTGCGCT ATTGTTCAAA CGTCCGCACC TTCTCGATCA ATGGGGAGCA TTTACGGCTT TAGGAATCCT CTACGGCTCA CTCTTTGTCT GGCAAGATCG AGAGTCGATC CAGGCTTGGT GGCTGGCGCA CAGATCACCA ACGTAGCTGC ACAATATGCC AAGCCGTGCA ACGATGCATC GCTGACGCGG TAAGGGTTTT TTGATCGTTA CGTTTTGTGT TGGTGGCCGA GTGGTTTCTA TTTTGGAGCC ACACGGATTG ACTCTTCTCT CGGCTAGCTT ACTATTTTCC CGGTATGCTT CACTCGTGCA TTGATAGCGA TGTTCCCTCT TTTCACGCTG CTTCGCCACG AGTAGAAAGG AAGGGATAGC CACTGCTTAA AGGAGGATTA TTTGTAACAG TAAGTCCAAA GCGGTTTTTC GGGGCCGTCG TAAACCTAAT GTAAAAATGA CAGTCGGCCT TGGCCCGTGG TAATCGTAAA CTGCGACTGG TCGTTGGGAA CGTTGTGAGT CCCGGGTGTC CGGGTTCCTG TCATGTCCGT CGTTGGGCTA ATTGTAACAA AATAATCTAG GCTCCACAAA ACGGCATCAC TTTCAACTTG AAATAACGAG CGCGGGCATC ACGGAGAAAA TTTCCGAATT TTGCGAGCAC GTCAAACCTC TGACTAGTCC ATTGTGAAAA CAGGAGGAGA AGGGTTCCGT TCTGTCTGGA CCATCGCGAC CGAGACGATG CAATCTCGAT GCTCGGTCAT GTCGCAAGCG ATTCCAATGG ATCGGCTAGT AACGGAAAGA GCAATAACTC CCAAACGGAC TCGATTGGAC CTCCACCGAT CCGAGGCGTA TACAGATCGG AGCGCCATCA GTTGCTTCGA GCTAAATTCT GGAACTCCAC CTGCGAAGAG GTCTACCCTT CGTTGGCGTA AAGGTGCTAG ATCTTGTACT TCACCTTCAT ATACGCTGCT CAATGTACTA AGCTTGCTGC TTACGCTACT GGGACAGTGC AACTATACTT CTGCCGCCAG TAACTTGCCG CCATGTTTGC CCAAGATCAA CTCTGGTCTA TCCATCATGA CAATACGGGG AGGTGCGCGG ATATCCTCTA GCTTCCAGGG AACAAAAGCA TTTACTCGGC CCGCGGTAAA GTCCCGCATG TCGACTTTGG AAGCCAGGTC TCCACCTTTG CAAGATTCTT TATCACGTTC AAAATTGCTT TTGATTCGAC TCATGTTCCT GACGTACTAC GGATCCTTAG GCACAATCAT GCCATATCTT CCCGTCTACT ATCATCACTT GGGACACGGC GGACAAATTA TCGGTTTGTT GGGCGCCGTC AAACCCTTTA CCACCTTCTT GGTTGCTCCT CTTTGGGGTT TGATTGCCGA CCAAACACAA AAGCCGTTCG TCATCCTGAA CATTACTTTT TTGGTATCCT TGGTCGGTCA ACTGCTTGTG GGTGTTCGTC ACGAAGCGCT GTATATCACG TTTATGGTGT TTCTCACGGC CGTTTTTAAT GCCCCTGTCA AGTCATTGAT CGATTCCATG GTTATGGAGC ACATTCCGGA GCAGTCGAGC TATGGCCGGC TTCGGTTGTG GGGTCAGATG GGATTTGGCG TGGCGAGTTC GTGCGTCGGT ATTTTGTTGT CCAAGAGCAA GCATGTACCG TGGCCGGACA CCAACGACTT CTCGTTATCC ACCGAGAATA CCCTTGCACG GCTTCCCTCC TTCCTGCAGA AGTTGGTGCA ATTTACCGAT AAATGGTGGC GTTCGATGAC GGGGTACAAG CTACTGTTTT TGACGTACGC TGCCCTTTCT GCACCAACTT GGTTTTGCAT TCAAGCATTC CGACAAATGG ATGAAAAAAG CAAACGAGTA GCGAAGAAAT CCAGAAAAAG AGAAGAGACT ACCAAAGTAG GCGAAGGTTT GCTACTCTTG CTCCAGAACG CCGATGCCCT TCTTTTCTTT TCTTTGGTTC TAGTCGTGGG TATTTCGAGC GGAGTAATTG AGAATTTTGC CTACGTTCGG ATGCGTGAAG TCGGTGGAAC GGGTAAACAA ATGGGATTGA GTAGGCTCGT CAGTAGTTTG GCCGGTGCTC CAATGTTTTG GTTTTCGGGA CCTTTGACAG AAACGCTGGG AGCCGACCGT GTGATTGTGC TCTCGCTACT CAGCTACGTG ACGCGATTTC TTATCTATGC TTTCATGCGT AATCCATATC ACGGCCTCCC AGCAGAAGCG TTGCGTGGCG TGACATTTGC GGCGTTTTGG TCCACAGCGA CAATTTACGC TCATCGAGTG TCGCCACCGG GACTGCACGC TACCATGCTT ATGTTTCTGA ATGCAATATA CGGAGGACTT GGACAGTCGG TGGGTGCCAT CATCGGGGGT AAAATGCAGC ATCGCTTTGG CACGGTGAAA ACTTTCCTGT ACTCGGCGGG GGTTGATCTT GTGTTCGTAT GCGGTGTGGT GGCGTATTTA AATATCCGGC AGGATTCCAG CTTTAAGAAT CCCAAGCCGA TCGTAGCCCG AAAGAGAGGA AAACAGAGTT GA
|
Protein sequence | MCAPQNSAFS VSSLAESLGK RKRLSYKACV LATVICLGTA TGGCCAFHPV VLSRVSTRRR VSSSSPSLPS VRIQQQYHRD CRHQHGLVLF ASIPVVPNAA SIATGNSTNI DTKTTSHLQP RTQPPNPLSR TTKKESSSKQ LTTALAFLTG VADVFLIRKY QTFCTMMTGN TMWMMKAATE CQFSLVGYYV AVISSYIAGL IIFRKAEMTW KTQSLGRFCA PFVTISFLLA DFWSSRNAAI RWPAATLLSA SYGIINSVGS EMAGSLAFVV TGHMTKLTHV LTDRFSKQAG NKPIADKDKS TLLQSSLIIV GFAAGAFVAC ALLFKRPHLL DQWGAFTALG ILYGSLFVWQ DRESIQACCT ICQAVQRCIA DASALARGNR KLRLVVGNVS IVKTGGEGFR SVWTIATETM QSRCSVMSQA IPMDRLVTER AITPKRTRLD LHRSEAYTDR SAISCFELNS GTPPAKRSTL RWRKGARSCT SPSYTLLNVL SLLLTLLGQC NYTSAASNLP PCLPKINSGL SIMTIRGGTI MPYLPVYYHH LGHGGQIIGL LGAVKPFTTF LVAPLWGLIA DQTQKPFVIL NITFLVSLVG QLLVGVRHEA LYITFMVFLT AVFNAPVKSL IDSMVMEHIP EQSSYGRLRL WGQMGFGVAS SCVGILLSKS KHVPWPDTND FSLSTENTLA RLPSFLQKLV QFTDKWWRSM TGYKLLFLTY AALSAPTWFC IQAFRQMDEK SKRVAKKSRK REETTKVGEG LLLLLQNADA LLFFSLVLVV GISSGVIENF AYVRMREVGG TGKQMGLSRL VSSLAGAPMF WFSGPLTETL GADRVIVLSL LSYVTRFLIY AFMRNPYHGL PAEALRGVTF AAFWSTATIY AHRVSPPGLH ATMLMFLNAI YGGLGQSVGA IIGGKMQHRF GTVKTFLYSA GVDLVFVCGV VAYLNIRQDS SFKNPKPIVA RKRGKQS
|
| |