Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45536 |
Symbol | |
ID | 7200729 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 490993 |
End bp | 494195 |
Gene Length | 3203 bp |
Protein Length | 839 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179657 |
Protein GI | 219117735 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.607338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACAAGCGT CTCACAGTCA GTCCTGTTAG AGGACACCTA TAAGGACTCC TGCGTCGATG GTCGGCACGT CAGCTTTATG CCATCACTAG ATTAAACCTA CTTCAAATCG TATAACGATA AGGATGGATG ACGGTACGCT AGCCTTACGA CAAGAATCCA AACAGAATAT GCCCTTATTG AGGTCAATAG CGTGGGTATC TTCGAGCACG CCATTTTCGC TTTGTTCCTT GATCGTTCTA TTCGCAGTGT CATTGACTTT GGACAATACA GAGGCCTTTG TATTGCCAGA CCATATTCCC CACCCACCGA GTTCTCAATC GTTATTGTTT GCCTCGCAAG AGATTGAGCA CTCAAGAAGA ACATGGGCTC AGCGCAAATC ACCTCGTGAG AAAGTGACCC TGATTCCGGT CGATACCAAA TGGTCTGGTG GTAACAAACT CGAAAAAACT GGCCACAGTA GCCGAGCTGG ATCCCCGTCT AGATTCGACC AGCTTCCGTC ATCATCAACA ACAACAATAG TATCATCACA AAGGCGCCAA AAAACGAAAC CCATGCCTGT GACCGGGTAT GATGCTCAAT CCATCGAGGT ACACTATGAC CGACGACCTT TGGAGGTTGG TTGGCGCTTA AACTCACTCG GTATTCCACT GCTGGGTACG TCGTTCATTA TTGTGATCGT GGAATGTTGC CTTCATGGTC GAAGAATTCT GCTCAAACTC ACCTTCGTAT TCGCTCTCAT ATCTAGGTTG GTACATGCGA TTATTGCTGG ATAGAGCAAT GGGTCTAGAC GGTGATGAAA ATGTGCAGCG GAAGCGCGGA CAGGAGCTGC GCGAGCATCT GATTCGCTCC AAATCAGTGG CTCTCATCAA GTCGGGGCAG GCGGCCTCAC TCCGACCTGA CTTGATTCAG AATCGTTTTT GGGCTGAAGA ACTCGGTAAA CTTGTAGATG CAGTTGGATC ATTTTCGGAT TTACAGGCTA TGAAAATTAT GCGTAATGAA TTGCGTGATA TAAGGCCGCG TTTAGATGTG ACCCGAACAT CCTGGCAAAC CGCATCAAGA GCTCGTCGGA GGAAAGGTCG TATGAATAGG GTTGAAAAAA TGGTAGAGGC GGATGATGTT CTGAACTTGT TTGAATTTTA TAACGAGAAT CTAGCGGTGG CGTCTGCATC CATTGGGCAA GTGTATAAGG CTCGAATCAG AAGTGGGCCT CAATTGGAAG CTGCGATAGG GCCTGAGCAA GCTGCTAAGT GGGGAGGGAA GGTTGTGGCT ATTAAAGTAC AGCGTCCAGA TGTGGAAGCT TCGGCTTCTT TAGATATGTA CCTACTGCGA CGAACAGCAA TGTGGCTCAG TAAAATACGA GGAGGCGACC TACCGAAAGT TGCGGACTGC TTCGGAATGC AGCTCTTTGG AGAACTGGAC TATGTTCGGG AAGCCAACAA CTGTGACAGG TTTCGAGAAC TGTACGAAGG TTGGAGCGAC ATAAAAGTAC CCGCTGCTTG CTCGGCGTTT ACCCGAAGAC GGGTCCTAGT AATGGAATGG ATAGACGGGG TAAAGGGGCC CTGGGACGGG CAAAGGGGTA TCGATATGGT TCGCATTGGG TTGCGGTGCT CGGTAGATCA GCTCATGACG ACTGGGCTTT TTCATGCCGA TCCCCATCGC GGCAATATGC TGAGCACTCC GGATGGGCGG CTGGCCTTGA TAGACTTTGG GATGATGGCA GACATAGATG AGAAGGATCG ATACGGGCTC TTTGGTCTAG TGATTGGTCT GCAGAACAAG GACTTGGCCC TTGTCACCGA AAATCTGTTG GAGGTAAGAA ATACAAAGAG TTTTGGAAGC TGTTGTAGTC ACAAGGGCGT TTCTCACCAA CCTATTGCTA TAGTTGGGGT TCTTGAAAGA TACGACCCAG ATTGATCAAC TAATTCCTCG GTTGAGGGCT GCTCTAATGA ATGCCACAGG CGGTAGTGGA AAAGCCTCGG ACGTAAACTT TGCGCGTCTT CAAGCAGAGC TGGATGATAT TAGTCGAGAG AATGTTCTGC GCTTCTCAAC GCCCCCCTTT TTCACGGTGA TCATTCGAAG CTTGACCATT CTCGAGGGCG TCGCGTTAAG TGTTGACCCT GCATTCAAGC TTGTTCGTGG AGCTTATCCG TATGTCCTGC GACAGCTACT TTCTCCCGAG GATCAAGTCC GTATGCCAGC AGCGTTGCAG AAACTTTTGA AACGCCTGCT CACCGTCAAC GGGGAAGAAC GCGAGATAGA TTGGGAGCGC TTACGTGATT TTCTTAGACT CGCTCAAAAG GCGGCAAGGA AATACGACCC CTCAATGAGT GAAGTAGACG ACAAAGCGTC GCTTTCTCGG CAGACGATTG AACTGTTTGT CCAATTTTTG ACCAGTAGAG CAGGTTTGTG GCAGAGATCC ATCGTCTTAT CTGTGTTGAC ATACATTCTT TTTAACTCAT CAGTTTGATT CATTTGCAGG CATCTTCCTG AAGAAGCCCT TGGTCCATGA ACTCGCCGAA GCTATTGATG GCATGGCCAG TATTGGCGAA GGCAACTTGT ACCGCATGTC TCGGGGGCTG TTACCCGCTC TACCTGGTAT GAACGGACCC GTGAATTCTC GCCGTATGGA TGAGATCTCC ATGATGCTGA ATACCTTTGA AGATGCGCTT GTGATGGAGA ATAACGACGG GGGTAGCCGC GCCCGAATGG AAGCTATTAT GGAGCTCTTC CGGGAAGTTT CCGCCGCGCT GGGGGATGAA CGGCTGCGTC AAGATGCTGG CCCGTTGTTG GTAGAACTAC AATCGGTGAT CCAGATGGTT GCTGTCGAAG TGCTGGAGAT TCGTGGGTCT CGAGCTATGC GATCCATCCT CCGCGTCTAA CTTACATTTT AAAATTGCTA GAGCTATTGT CATGATGACA CATTTATCCA AGACCTCATT ACCGACTCTC CCTCGGCTGC TCCTGAGCAT ATACGTGGGC ATACGACACG TATGGCTGGA TTGCATCAGG AACATCCTTT GGTGCCGCCT CCGGAGCTAG CGACCTTAAT TCGTCCCTTG ACTCCAGAAC GAAGAACCCG ATGTCTGTGT TAGGTAACAA AGACGGAACC CCTCCTCCGA CTACCACGGT TTCCCACTAG TTTTCTTCAA AAGTTTCGCA ACTCGTGCAA TAGAAGCTTT TTTCCTTTCC TTTTGGTACG TTC
|
Protein sequence | MDDGTLALRQ ESKQNMPLLR SIAWVSSSTP FSLCSLIVLF AVSLTLDNTE AFVLPDHIPH PPSSQSLLFA SQEIEHSRRT WAQRKSPREK VTLIPVDTKW SGGNKLEKTG HSSRAGSPSR FDQLPSSSTT TIVSSQRRQK TKPMPVTGYD AQSIEVHYDR RPLEVGWRLN SLGIPLLGWY MRLLLDRAMG LDGDENVQRK RGQELREHLI RSKSVALIKS GQAASLRPDL IQNRFWAEEL GKLVDAVGSF SDLQAMKIMR NELRDIRPRL DVTRTSWQTA SRARRRKGRM NRVEKMVEAD DVLNLFEFYN ENLAVASASI GQVYKARIRS GPQLEAAIGP EQAAKWGGKV VAIKVQRPDV EASASLDMYL LRRTAMWLSK IRGGDLPKVA DCFGMQLFGE LDYVREANNC DRFRELYEGW SDIKVPAACS AFTRRRVLVM EWIDGVKGPW DGQRGIDMVR IGLRCSVDQL MTTGLFHADP HRGNMLSTPD GRLALIDFGM MADIDEKDRY GLFGLVIGLQ NKDLALVTEN LLELGFLKDT TQIDQLIPRL RAALMNATGG SGKASDVNFA RLQAELDDIS RENVLRFSTP PFFTVIIRSL TILEGVALSV DPAFKLVRGA YPYVLRQLLS PEDQVRMPAA LQKLLKRLLT VNGEEREIDW ERLRDFLRLA QKAARKYDPS MSEVDDKASL SRQTIELFVQ FLTSRAGIFL KKPLVHELAE AIDGMASIGE GNLYRMSRGL LPALPGMNGP VNSRRMDEIS MMLNTFEDAL VMENNDGGSR ARMEAIMELF REVSAALGDE RLRQDAGPLL VELQSVIQMV AVEVLEIRGS RAMRSILRV
|
| |