Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46361 |
Symbol | |
ID | 7201629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 117231 |
End bp | 119186 |
Gene Length | 1956 bp |
Protein Length | 594 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180758 |
Protein GI | 219120020 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.299005 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTTTGACTG TGAATCTCAA TCGGCAAACG ACGTCTGTGG AGAAAGTGAC GGTACGAATT CTTTCTACAT GAATAAACAG TGTTGTCGAT ACTACTAGCA AGGATCTTCT AAGCCGACAA AGCCGGAATC AGAACTTCTC TGGTGAGACA ATTGTAGATG AGACGAAAAA AATGTCTGGA GACTGCGTAC CACCACCCTC CACCATTTCT TTTCCGTCCG TGCCATCATC ATCGTCCCGT TCCGATTTGG CCCCTCCTTC TTCCGCCCAT ACTCCACAAG ACGCCAATGC CATCGGCAGC AACACACCAT CGCTCCTGTT TTCCCCGCCT CGTCAAGCAT CTTCACAATC GTTTTCCTTC GTGACTCGCC CACGATTTTT ATACGCGACC GTTTTCGTCT GGATCAGCGT GACCGGTGGA CGTTTTCTGG CTCCATTTTT GCGACACCAG GCCGGACTAG ATGATGGACA GATTGGAACC GTCCTAGCTC TGCAAACACT ACTGACAACT TTGACGGGAG GATACGGGGG AGCTTGGGCC GATCGGCGCG AAATGATGTA TCCTCATCGT GGTCGAGCGC AGGTTCTTGG TGTCGGTGTG ACAATGGGAA CTATCTCGTT TGTGTTGCAC AGCGGCATCA ACTGGAACAT TTCTACCAGT CGCAGTGGCT CCAGAATTAA CGACGATGGT GACGAGGGCA ACGACCACAA GTACGAAAGG TATTTGCATT ATGCGCTGCA ATGCGGATTT GCCCTTGCCA CTTCCTTAGT TTTTCCCGTA TTGGACGGCA TGTGTTTGGC ATATTTACAA GCTGCTGCCT TACCAAAGCA AGCCTACGGA AGGGAGCGCT TGTACGGCGC CATTACTTGG GCGGTGACAA ATCTCTGTCT AGCACCCTTG CTAGATTGTA TAGGATTTGT GATCCTGTAC TGGCTGAGCT GCCTTTCCTG TTTGGCCGTG TTGGTGAGCA TAGTCGTCTA TGTACAGGCA CAACAGCAGG TTTCACGACA ACTTCTTAAG CAAAAGAGCC AAAATATAGC AGTTGAGGAG GTGACCCACG ACGAAGATGA CGAATTTTGG GGAGGTCGCG ATCGTATCCG CGACAGCCCA ACCGATATCG GAAATACATT CGCGGATGCG GACAACTCCC ATTGCAGCCA AAGTTCGTCG GGTCTGCATA CAGCTAGCAC GGCTGACCGA CGACTAACAA CGATACAATT GTGTCGATCC TTGTACGGCA CTACCTTTGG ATTCGCCTTT TTGGTTGCCG TCCTGTCTCT CGCATCGGGA CAAGCGATCG TAGACAGTTT GAGCTTTCTA TACTTTGAAA CTTTGGGAAG CTCGTACATG ACAATGGGAT TCATGATCTT ACTTACGGTA GCGTTTGAAA TCCCCATTTT CCACGTTGCC CCCAAATTAT TGGAGTACGC TGGTGCTGGT GGACTACTCT TGTTGGGTGG TGCCTGTTAC GTGACCCGCA CAATTGGATA TTCCTTCATT CCACAAGGCA AAGTTGGATG GGTTTTGTGG TTGGAACCAC TGCACGGAAT CACGTACGCG TGTAGTCAAA CCGCCACGGT CGACTTCGTA GCCCAGCTCT TACCAGACGC CGGTTACGAA GCGACCGGAC AAGGTTTGGT ATCAGTGACA CGGGGTGTTG GATCAATGTT GGGATTGTGG CTAGGTGGTA CGGCTCAAAA CATATTTGGT GCCCGCATCG TATATCGAAT TGCGTCAGCC GTCGTCTTAA CAGGATCCAG TATTTTTGCG TTGACATTAC TCGGGATGAC CCATTCCGTA ACCCACGTAT CTCGGAGCCA TTACATGTTG TCGCAACTCG ATGGTGATGA CGACTTGGAT GTAGGCAAGA GCGATTTGGA ATTGACAGCG GTACAATCGA ACGACGATTC GTCCAGTCAA AGCGAGGAAA AAAATTCGGT AGGATACAGC AAGTAG
|
Protein sequence | MSGDCVPPPS TISFPSVPSS SSRSDLAPPS SAHTPQDANA IGSNTPSLLF SPPRQASSQS FSFVTRPRFL YATVFVWISV TGGRFLAPFL RHQAGLDDGQ IGTVLALQTL LTTLTGGYGG AWADRREMMY PHRGRAQVLG VGVTMGTISF VLHSGINWNI STSRSGSRIN DDGDEGNDHK YERYLHYALQ CGFALATSLV FPVLDGMCLA YLQAAALPKQ AYGRERLYGA ITWAVTNLCL APLLDCIGFV ILYWLSCLSC LAVLVSIVVY VQAQQQVSRQ LLKQKSQNIA VEEVTHDEDD EFWGGRDRIR DSPTDIGNTF ADADNSHCSQ SSSGLHTAST ADRRLTTIQL CRSLYGTTFG FAFLVAVLSL ASGQAIVDSL SFLYFETLGS SYMTMGFMIL LTVAFEIPIF HVAPKLLEYA GAGGLLLLGG ACYVTRTIGY SFIPQGKVGW VLWLEPLHGI TYACSQTATV DFVAQLLPDA GYEATGQGLV SVTRGVGSML GLWLGGTAQN IFGARIVYRI ASAVVLTGSS IFALTLLGMT HSVTHVSRSH YMLSQLDGDD DLDVGKSDLE LTAVQSNDDS SSQSEEKNSV GYSK
|
| |