Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47702 |
Symbol | |
ID | 7202707 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 577252 |
End bp | 580530 |
Gene Length | 3279 bp |
Protein Length | 994 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182092 |
Protein GI | 219123563 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGACAGTG AGCAGGATTT TGAAAAGGTC TTGTCCCGGA CGAGTAGTTG TCGTCTTTCT TCTTCCTACA TGGAAATTAT ATTCTTACGT TGCTACTCTT CCAAAAGTTG CCAATCCTTT GGGAGGCGTG CCGCAATTGA CAGCCGCGGG TAGCGAACGC TTTTTTCACG GATTCTCGCG AGAAGCTCGT TACGTCTTTC CACTTCCGGC ACGATGCTAT CGTCGCTGCG CTACTCCTCC GCGAGATGTC GGGTGTCACT GGCATGTCGT TACCAAACTA AGCCATGCGC TGGCAATCCT CGGTGTGCAA TTCCTATTCA AAGTAAATCT GCCGTGGGCT TCTGGAAATG TGACACTTTT GGAGACCAGA GAGCATTGCC AATTCGAAGC ACCCGATGCG TGTCGACCGA TACCAATACT CCTTCAATAG ATCCCACCAC TTTGTCGACA ATCGACCACG TGACGAAAGA CACTAATCCT CGCACTGCGA CGGAAGATTT GTACGAAAAA CTCGAGCATA CATCCCTGTC GCTCTTGACT CAAACTCCCA TTTCTCTCTC GCACCAAACC GACATTGTCA AAGTTTTGGA AGCTTGGAAG CTTCTGTTAC CCAAACTTCA AGAGCTGGAT GAAATACCTG ATAGAACGCT GCCTACGAAT CAATGCAGTG CGGCGATATC CCGTATGCAG GACCTTTACG AACTTTGGAA ACGCCGACCA ATTGTCACTA ACCGCCCCTT TCAAATCATG CTGGAAGTGT TCGCGCACAC GCCGACGATC AGCGACAACA GTAATGCGCG TGGTATGGCT GCACTCCAGG TTCTGGAAGA CTGGAACCGA TCGTTTATGG GGGACATGGA GCTAGAACCG CGCCGTACAG ATTATCACCT TGTCCTTCAC GCCTTTGGCA ATCAACCCCT ATCTTTGAGT TCCGCATACA TTGAGCCAAC TACTGCCGCC CAACCCGGCG AAGTCGCGCA GGAAATTATG ACCCAACTCG TGGCCTGGGG TGTCACCATG AAACCCACTG CCGAAACCTA CCAATTGGCG ATAAGGTGTT TGACAAGGGG CATCCAGCAA CTGACGACAT CCTGGGATAA CGAACGTGAC ACCGAGGTAA TGTCGAACGT TGAGCAGCTT CAACACACGG TGCGTGAGTA TGCGTCGCGT CTTATCAACT TAACGGTATC CTCTGGCTCT TCATCGTCCA TGACCATCTG GCTGGGGCTT TCTGACGCCT TCCAAGCGAT GCATGTACCG GTGGATCGGT TTGAAGAGGA CGGCCCCAAA AGAATTCAAG ACGCGGATTG GTATCGGCAA ACTTTACCGG TATGGAAGCA GGCACTAATC GAGAGCCGAA GAGCATTGGA CCGGAGAATA TTGCGTGAAA ATATGGATCG CACGTGTAAA GCTCTTCTGT TACTGCACGA GCGGTACTTG GAACCAGCAG AGCAAATCAC GGCAATACGA GATACGCTGG AAGAACTCAA GGGGCTGTAC ACAAGCTTGC CGTCGGCTGA GCACTACTGT ATTGCCATAA ATAGTCTCAC TTCGATGAAG GAGTCGGTGG CAAAAAAATT CGCTCTAGAC CTGGCAGTTG GCATGGAAAA GCAACACCGG CGTAAATTCG ATTCCAGCCA AAGCGATGGA TGTTTGGACT CGGAAGAGAT GGTGAAGTCG TGGAATCTAC TTATGACGGT GTACATGCAA GTAGGAGAGC TGTCTAAAGT ACTCGATCTC TGGAAAGAAA TGACGGTCAA TAAAATACCA CGGAACAGCT TAAGCTTGTT CCTGGTCCTT AAGGTCCTGG CCCAGCAAGA AACATCGCAG TCAGCCAATC AGGCGACATC GATCCTCTTC AAATATCTAT CCATGGACCG CAAACCCTTT GAACCTACCG CTGAGCATTT TCGGTGCGTC ATGATGGCAT GGTGCAACAG TCGAGATCGC CACGCAGCCT CTCAGTGCCA ACGCGTCTTT GATCGGATGC TACAATACAG TGCCGAGTCG CGAAAACAAT CGAGAAGCGA CGGCACAGAA AAGCCGACAC TAGAGCCACA CGCACTTCAT TTTACTGCAC TGATTAAGAC ATTGGGATAC AGTCGACGAC CCGATGCTGC CAAGAAAGTG TCGGCTTTGC TCGTTGAAAT GTTGGAGTTG GGTCTAGATC CTGATTTGCA AACGTATACG TCTTTTTTCT CGGCGTTGTC ACATACGAAG AGTCTTCAAG GCGCAGAGGA AGCCCAGGAA TGGTACGACA AACTGCAAAA GGAATGCTCC AAGGTAGTAC TAAATGTCCA CTGTTATGCG TCGGTTATGT TCGCCTGGAC CAAAAGCGGC GCCGTCGATG CACCTGAACG ATGTCGAGCT ATCTTTGACG AATTATGGAG GGCTTACTTA TCTACCCCTG CGTCCGAAGA AGGGGATCTC CGTCCAACTA GCGCCGTATA CCGCGCTTTG ATGGAAGCCT GGGCTAGTAG TGGACGAGCG CAGGCACCAG ACGAGGTAGA TGCGCTACTG TCCTTAATGG AGAAAAAGGC CGAGCAGGGC TTGATCGATC CCCCTGACAA AAAAGTATAC GCTTTGGTTA TGGCAACTCA TTGGAAACAC AATGACCGCA ACGCTGTAGC AAAAGTACAA GACGTATATA GTCGCATGAC TGCGAGTTAC GAAATGGGAA ATATTGCGGC TAAACCAGAC GCTCACTGTC AGACTATATT GATGAATGCT TGGGCGAAGA GCGACGTCCC CGAGCGGGCA AAGATTGTGT TGGATCTTCT CCGGGAAATG TTTCAAGCTT ATAGCCAAGG TGACTTGGAT ATGCAACCCA ATGCCTATGC CTTGGCGGCC GTGCTGAACG CCTGTGCTTT TGTCGATAAG GACAATGAGA CTCTTCGACG ACAAGCCGTG CAAATCGCTC TGACTGCGTT TAATGACTTC TCAAACAGTG AGTTAGAGGG GACGAACCCC TTTATCTACT GCTATCTTTT TCGAGTACTT GGCCATCAAG TCGATGACAT GGTCGAAAGA ACGCGTTTGG CCAGTGTTAT CTTTCAACGT GAGTCGATCG TGAAAACCTA GCTATTTTCT CGTCGCTCCT CTCTTACGTT CTCACACGTT GGCTCTTCCG TCGCAACAGG TTGCTGCCAG GAAGGATTCG TCGATGACCA AGTCATCAAA ATGATGAGAC GCTATGTTCC CGTATTGTAC AAAAAGATTC CATTGGATGG CAAGAACAAA CCTCGTTTGC CTATCGGCTG GACGCGTCAA CTGGACTAG
|
Protein sequence | MLSSLRYSSA RCRVSLACRY QTKPCAGNPR CAIPIQSKSA VGFWKCDTFG DQRALPIRST RCVSTDTNTP SIDPTTLSTI DHVTKDTNPR TATEDLYEKL EHTSLSLLTQ TPISLSHQTD IVKVLEAWKL LLPKLQELDE IPDRTLPTNQ CSAAISRMQD LYELWKRRPI VTNRPFQIML EVFAHTPTIS DNSNARGMAA LQVLEDWNRS FMGDMELEPR RTDYHLVLHA FGNQPLSLSS AYIEPTTAAQ PGEVAQEIMT QLVAWGVTMK PTAETYQLAI RCLTRGIQQL TTSWDNERDT EVMSNVEQLQ HTVREYASRL INLTVSSGSS SSMTIWLGLS DAFQAMHVPV DRFEEDGPKR IQDADWYRQT LPVWKQALIE SRRALDRRIL RENMDRTCKA LLLLHERYLE PAEQITAIRD TLEELKGLYT SLPSAEHYCI AINSLTSMKE SVAKKFALDL AVGMEKQHRR KFDSSQSDGC LDSEEMVKSW NLLMTVYMQV GELSKVLDLW KEMTVNKIPR NSLSLFLVLK VLAQQETSQS ANQATSILFK YLSMDRKPFE PTAEHFRCVM MAWCNSRDRH AASQCQRVFD RMLQYSAESR KQSRSDGTEK PTLEPHALHF TALIKTLGYS RRPDAAKKVS ALLVEMLELG LDPDLQTYTS FFSALSHTKS LQGAEEAQEW YDKLQKECSK VVLNVHCYAS VMFAWTKSGA VDAPERCRAI FDELWRAYLS TPASEEGDLR PTSAVYRALM EAWASSGRAQ APDEVDALLS LMEKKAEQGL IDPPDKKVYA LVMATHWKHN DRNAVAKVQD VYSRMTASYE MGNIAAKPDA HCQTILMNAW AKSDVPERAK IVLDLLREMF QAYSQGDLDM QPNAYALAAV LNACAFVDKD NETLRRQAVQ IALTAFNDFS NSELEGTNPF IYCYLFRVLG HQVDDMVERT RLASVIFQRC CQEGFVDDQV IKMMRRYVPV LYKKIPLDGK NKPRLPIGWT RQLD
|
| |