Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46869 |
Symbol | |
ID | 7204719 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 668917 |
End bp | 672931 |
Gene Length | 4015 bp |
Protein Length | 956 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185761 |
Protein GI | 219121059 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0965928 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGACTGTGAA AACATGTTGC TTCCATTTCT ACCTATTGGA CTCGGGATCG GCAACAAGAA AAGTAGCTAC ACGCAAGGAG CCTTTTTCTT GGAGCCGAGT AGTTTCCTCC AAATCTTGTT TAGCCTGGAT TGATACCATG ACAACGACTC CGGAGAATCC GCCAGTCCCA CGAACAGAAG CGCAGAAAGA TTCAGTGATC TTTGCTACAC CCGAGATTGT TGAGGATCCG CACCGGAACG ATCCTGTACT GCATATCGCC AGCGATAGCA AGCCTAAGAC AACCAAAACG AATGTCACTC AGAATGCGGA AGAAAATTTA ACAGCTACAA CCGGTGGTGA AGGTAAAGGA TTCTCGGCAA GGTGGACCGG TGCCATCAAT ACTATTCACG GTCCCATAAT GAAAGGCTTG CTCTGGACGA GCGGAACTGC GGCGCGCAAT CCCCGCAAAA CGGTGGCCGG CGTAACGTTC TTGTCGTTCT TCCTTTTGAT TGTAGGATTT TTGACTAATT TTTCGGTCGA CGTGGATGAA GACGTCCTGT GGACCCCAGA AAATGCTCGT CCCGTGAAAC AAGGCGCCTG GATATCGACT GCAGGCTACC CGGAAGAAAA TCGGGATATG CTACTCTTTT TCCACGCCAA TACAGAGAAT GTCCTTGGGC AAGCGCAGAC TCGCAAAGTA TTTACTGCAC TCGATACGGT TCGCAACTTG CCCGGCTATA GAGCCTTGTG CGCCCAAAGT AGCTACGTGG ATCCACGAAC GGGACAGCGC ACGTGTGAAA TTTCCGGAGC GACTGCGTTC TGGAACGACA CAGCTGCCCT CTTTGAATCC CAAGTTGACA ATGATAACGA TGCGATTCAA CAACTGTCAG CGCCGGTCTA TCCCGATGGC ACGCCCGTTT CGGACAATGA TATTTTTGGG AAACCACTTC GTAACGAAGT ATCAAGGACG TTGGAGTCAG CTCAAGCCTA TGTTCTTGTC ATAGGACTGC CTGACACAGA GGAAACGGAG GACTTTGAAC TGGAAGCGCT TGACGCGATC CTCGACCTGC GGAATAACAT CTGGAATGGA AATGAATTCA ATGTGGAAGT GACCACCCAA CGATCTTTTC CCGACGAGTG AGTAAGCAAG AGAGATGCCA ACAGTCCGTT TTGTGATCAA CGTCTTACAT CGATATCTCA TCCTGGGCGC TTCACAGATT CACTCGAGGT ATCGTGCGGG ACATTCCACT TGTACCAACT GTCTTTATCA TCATGTCGAT TTTTACCTGT ATTGTTTTTG CCAAGAGGGA CAAAGTCCGT TCACGCAGCT TGCTGGGGCT CAGCGCGACA ATTTGTGTCC TATTGAGCAT AATGAGTGGT TACGGGCTTT TGTTCATAGC CGGTGTCCCG TTCACAAGCA TGACCCAGAT TCTTCCCTTT ATTATTTTCG GCATTGGTCT GGACGATGCT TTCATTGTAT CGGGTGCCTA CGAACGTACG GATCCAGCCA AGAATCCCGT GGACCGGATT GAGGCAACCA TTGAAGATGT TGGAGCCAGC ATTACGTTGA CAACAATTAC TTCGACTTTT GCCTTTGGTC TCGGTGCAAG CTCGGACGTT CCTTCAGTGT ACTGGCTATG CTACTACGCG TTCCCTACTG TTATGCTGGT GTTTCTCTAC CAAATTACGT TTTTTGTAGC AACTATTGTG TTGGACGAAG AGCGGATTTG TGCCAACCGT CGAGATTGCT GCATCTGGGT GACCGTTCAA AAACGAGAAG GTACGGACGA GGATGTTGTT TCAGATGTCT CTCATACTGA TATTGCGGAA ACTGTCAACG ACACGCGAGT ATCTCCTGTA GATTATTGGA TGGGGGTGTA CTCTCGCCAA CTTCTACGTC CTGCCGTAAA GGTTTTTGTG GTGCTTTTCT TTTGTGGTCT TCTCGGTGCG TGTGCCTACA GTGCCACCAA GTTGACGCAG GAATTCAAGT TTACTGAGGT TCTTCCCGAT GATTCGTACT TGTCCGCCTT TCAGTTTGCC TTTGACGACA ATACGTTTCG TTCCGCCGTC GCCCCTTACG CTTACTTTCG ATTCGTCGAT CAGAGCGATC CAATGATTCA GGCACAAATG GAAAGCTACG TCAACGAACT CGTGTCAATT TCAGCTATCG AAGAACAGCC CAGCTTCTTT TGGCTCCGTG ACTTTCAACA GTACAGAAAT GAGACTGGAC TTACCAACAC CAACAATACA ATGTTTGCTA GCCAAGTACA GTCGTTTCTC TCCACGGACG TTTTCGCAGC GCTCTACCAA GATGACATCG TCCTGGACGA TACCGGCAAC ATTGTGACTT CTCGAGTTCG CCTTAATATG GACAATGTTG ATCTAGAAGA CGTCAACGAA CAAATAAAGG CCCTCGACGA TCAAGCCGCG GTAACGGCGG CCCAGCCTGT AAACCAGCAA GGCGCTGATT GGTCGTTCTT TTCCTACGAC AGTATCTTTA ACATTTGGGA GTTTTACGCC GCATCGGTCG ATGAAGTTAT ATTTACGACG GTGATGGGCA TTTCGGCCGT CACTGTGCTG ACTCTTTTAT TTGTTCCGCA CTGGACAGCC GCACTGTTCA TCTTACCCAT TATTTGCGTC CTGTACATAG ATTTGTTGGG AGTGATGCAA TGGGCTGGCG TCCACATCAA CGCCGTGAGT TACATTACCT TGGTCATGTC GATTGGATTG ACGGTGGATT TTATTTTGCA TGTGCTGCTG CGATACTATG AGTCGCCAGG AAACCGTGAA GAAAAGACTT TGTATACGCT GCAGACCATG GGAACGTCCG TGTTGATCGG TGGCGTTTCC ACTTTCTTGG GAACACTGCC CTTGGCTTTC AGTACGAGTG AAATCTTTTA CACAGTTTTT GTCTCTTTTA TTGGCCTCGT TACGTTGGGT TGCGGGCACG GATTGATTCT GCTGCCGGTG CTATTGTCAA CAATTGGACC AGAGGACCAA ATTTGGGAAG TAGAGAACCG CGCTCCAGCG ACCGAAATCC ACGAAGCCTT GGTCACTTTT CCGGCGAGTG AAGAGGACAA GGTAGGAAAG GAATTGGATA CCGAGTGAGC GCGTGCGAGC TAGTCCAGTT GTGGGCGCTA GGTTTGAGAA TATTTGTATA AATTAAACAC GACAGACAGA GACACGGGCC ATGAATAATG CCAGCATCCT ACAGTAGCGA TACCCCCATG TTTGCGTGTA ATTTTGTTGT CACTGCGCTT CAGTGAAATC GGCTACACGG AAGGAATCCG AAAGACGTTC AGTGTATACG CCACGTAGGT ATAGTACCCC GCAATACTGG TCAATTCCAC CAAAACCGAG TCTTTCCCGT CCAAGGCTTT CTTAGTGGCA GCGTAGGTTT CCTCGGACAC GTTGTACGTG TCCAACAGTT CGGCGGTAAA GGTGGCAATC GCCCGGTCTC GATCCGTCGT CAAGAATGGC CAAAGCTTGC GTTGCACCGC AGCGACACTA AAGTCGTCAT CGCGTGGAAT GGCGTCGACA AGTTCCCGTG ACCATCCCGC CTTGAGGGCT TCCCCCACGT GTATATCGAA TTCGGCGTGC GACCGTGTTT TGGCGCCCGT GAGGAGTATC ACGAGTTCGG ATTCGGCGAA TGTAAGGGAC GTGCCGTACC GACACGCCCG TCCCAGTGCC TGGGCCGGCT GTGCAATGTC CGGGACGGCC AACCAGGGAC CAAAGGGACC GGACAATCCC GTGCGGGGCC GGGATTTAAG AATGCCGTGA CGAATTTTCT TTTGTTGGCG AGTCAGTTGT TCGACGGGTG GTCCCGTGTA GCGCGGAACC AAGTCGCGGG ACGTGTTCTC CCAGAAGGTA GACGACTGGT GGATGAATTG AGAGGGTATT GTCAAGAGGT AGGCACCGTA GATGGCGACG ACGCCGACAC TGGCAGTTGT AGCCCACGAA CCCATGGCGA GACGTCTTTT GGGCGTGTGA AGGGCTCGCT GGTCTACAAG GGTGATGACC AAACTTTTTT TACTC
|
Protein sequence | MTTTPENPPV PRTEAQKDSV IFATPEIVED PHRNDPVLHI ASDSKPKTTK TNVTQNAEEN LTATTGGEGK GFSARWTGAI NTIHGPIMKG LLWTSGTAAR NPRKTVAGVT FLSFFLLIVG FLTNFSVDVD EDVLWTPENA RPVKQGAWIS TAGYPEENRD MLLFFHANTE NVLGQAQTRK VFTALDTVRN LPGYRALCAQ SSYVDPRTGQ RTCEISGATA FWNDTAALFE SQVDNDNDAI QQLSAPVYPD GTPVSDNDIF GKPLRNEVSR TLESAQAYVL VIGLPDTEET EDFELEALDA ILDLRNNIWN GNEFNVEVTT QRSFPDEFTR GIVRDIPLVP TVFIIMSIFT CIVFAKRDKV RSRSLLGLSA TICVLLSIMS GYGLLFIAGV PFTSMTQILP FIIFGIGLDD AFIVSGAYER TDPAKNPVDR IEATIEDVGA SITLTTITST FAFGLGASSD VPSVYWLCYY AFPTVMLVFL YQITFFVATI VLDEERICAN RRDCCIWVTV QKREGTDEDV VSDVSHTDIA ETVNDTRVSP VDYWMGVYSR QLLRPAVKVF VVLFFCGLLG ACAYSATKLT QEFKFTEVLP DDSYLSAFQF AFDDNTFRSA VAPYAYFRFV DQSDPMIQAQ MESYVNELVS ISAIEEQPSF FWLRDFQQYR NETGLTNTNN TMFASQVQSF LSTDVFAALY QDDIVLDDTG NIVTSRVRLN MDNVDLEDVN EQIKALDDQA AVTAAQPVNQ QGADWSFFSY DSIFNIWEFY AASVDEVIFT TVMGISAVTV LTLLFVPHWT AALFILPIIC VLYIDLLGVM QWAGVHINAV SYITLVMSIG LTVDFILHVL LRYYESPGNR EEKTLYTLQT MGTSVLIGGV STFLGTLPLA FSTSEIFYTV FVSFIGLVTL GCGHGLILLP VLLSTIGPED QIWEVENRAP ATEIHEALVT FPASEEDKVG KELDTE
|
| |