Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50614 |
Symbol | |
ID | 7199448 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011700 |
Strand | + |
Start bp | 86298 |
End bp | 90042 |
Gene Length | 3745 bp |
Protein Length | 1123 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185564 |
Protein GI | 219130843 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCTTTTCG ACAACAACAT TGTCCTTCAA CTATCCCTGA AACGTAATGA GAAAACGGAT GACGACTGAG CACCATTGTC TGTCGTGATC CCTGATCGCC TGACCGAGAG TCTCGTTTTC CCTCGATTCG AGGGCGTCTC GGTGCGGAAC TTGAAGCTTG CGAGCGCTTG CCGCGTCAAA ACTATGCCTG ACGTCGAACA GCATCGTAGA CAGTCGAGTA ACGCTGAACT GTCTTGGTCT GAAATCAGCC AGCTTACGGA TGAGAATTGG CAGTATATAT CGGAGGATAG CCGGAGTGGA ATTGTGGAAA TCTCCCGGGC GGGTAATCTG ACTTTCTCTT CACTCGATTT GTACGGGCGC GAAAAGGAAA TATCCACTTT AAAGGAAGCG TTCGATCGTT CCCGTTCATC GCGAGAATTG CTCTTAATCT CTGGACAATC TGGAAGCGGG AAGTCCGCTT TGGCATCGTG CGTGGAGCAG CCTGTTCGTC AAGCCAAAGG CTTTTATTTG AAAGGAAAAT TCGACCTCAA TCAACAAAGC CAGCCCTTTT CGGCCATCGT AAGCGCCTGC TCTCGCTTGT GTGAAGAATT GTTGCAACAG GAAGAGGGAC TTTCATGTCC TGGCCGAGAA CAAAATAGCG GTTCTCGAGA AGAACGATCC TCGAAGCAAT GGCAGTCTGT ATTTTCTTTA GACGAGATCA GAACTAGAAT GCACGAAGAA ATGGCAAACG AGGTTGAAGA CCTTATTTTC GTAGTGCCCG GTTTAGCAGA TGTGGTGGGT ATCGATGTGA TTCAGCCTTT GAATGATAGT ACAAAGGATG ACCCACTGGA AGCAAGAAAT CGGCTCAATG CTGCTTTTCG AAAGTTCATT CGGGTTCTTG GTTGTTTCGG ACCGATAGCG ATGTTTCTCG ATGATATGCA ATGGGCTGAT CTCGCCAGTT TGGAGTTGAT TGAATTTCTC TTCACAGACC CAGAGAAGAC GGGCCTGCTT ATTCTGGGAT GTTATCGGGA TACTGGGTTG GACGACACCA ATGGTTATCA AAAGCTACTA CGGAAGCTAA AGCGCAGAGA CGAAACCACT ATTACTGAAA TAGAAGTCCG AAACCTTTCC GTTCGGTATG TCAATGAAAT ATTATCTGAC TTGCTTCGCA CAGAAAAGGG GCGAACACTC CCTCTTGCTG AGCTAGTTCA TCGAAAAACA CATGGAAACG CTTTTTTTGC TATTCAGTTC ATTAAATCTC TGGTTGATAG CCGGATTTTG AAACCCTCTG ATAGTTCGGC ATGGGCGTGG GATCTAGAAC AAGCCGAAAA AAGCACGGAA CCAACTTCCA ACGTGATCGA CCTTATGAAG AGCAAACTGA AGAAGCTGCC GTATGACGTT TGCTTAACTC TGCAGCTCAT GGCATGCCTC GGATCAACTT TCACTTTCCG AGTCTTTCAT CTAATTGTCG ACGAATTTTA CAACAACCCC TTGCATGTGG ATCCAACCAA ACTAGCTGTT CAACCTCTCG ATCAGAGAAA GCCAGAGGAA TGTTTACAGC TCTGCGTAGA GCAGGGTTTG ATTGTTGCTC ATCGGCGGCA TTCATATCGC TGGATTCACG ACAAACTTCA AGAAGCTGCC CTCTCCCTTC TACCGAGCGA TAGTCTTCCG ATAGTTCAAT TTCGAATTGG CAATCTTTTG CGGCAAAACC TGTCTTCCGT AGAGATAGAA TCATCCGTGT TTGTTGTTGC GGCATTACTC AACGAAGGCT CAGAGGCACT TTTGACGGAT GAACAGAGGA TACAGATCGC TCATATTGAT CTGGTTGCTG GCAAGAAAGC CATTGAATCT TCGGCATTCG TTTCCGCCAA GTACTATCTC GAAAAGGGAG TAGAGCTTCT TCCAGTAGGA TCTTGGAATG AACATTACCG TTTAACTTTG AATATCTATT CGACTGCCGC TGAGGCTGAA TACTGTAACG GAAACGCAGT CATGGTTGAA CGATACTGTG ATGAGGTCCT CAAGCAGATA CACCGGCCTC TCCTTGATCG TCTTCCTGCC TATAAAGTTC TAATTGAAAC CACCGGAGCG CAAGGAGATC ACAAAAAAGC TGCTCACCTT ACTTTAGATG TTCTGTCGCA GCTTGGATGT CCGTTTCCAA AGTATGCAGT ACTGATACAA GCCTATGCTG GCCTTCAAAA AGCTAGATTA TTCCAACGGA GACTTTCGAC AGAGAAGGTC AATGGAATGC GGCAAATGAC GGATATCAAA GATATCTGGA TCATGTCTTT ACTTGACAAA ATGTTTGCTT TTGCTTATGT TGGACGCTTG CCGAATATTT TGTTGATGTC TATTCTCAAA AGCTTGCAAT GGACACTAAA GAAGGGACTG AGCGATTTTG CACCTTCTAC CTTCGCACGC GTAGGGCTAG TTTTTGCGGC ATTCCTCAAC GATCCCAAGA CGAGCGAAAT GTATGCTGAG CATTGCATGT CGCTTTTGGT CCGGACTTCG TCGCGGAAAG CAAAAGTCAA GGCGTCAATG ATTGTGCAAT CGTTTGTTTT CCATTATTTG CGACCTTTAA GCTCAACTAC CCGACCGCTG ACACACTCTT ACGAATTAGG AATGAAGATT GGTGCCATTG ATGACGCCAT GTGGTGCTTT TTCTTCGCTC AAGAAACAAA GGTTCACTGC GGCGTCTCAC TCAATAGCAT TGCCGACGAG CTAAGCGTGG CAATTAAGCA AATGCAAGAC CTAAAGCAAG TTAAACAGGA GGAACTCTGC TACATACTCG AACGCGTGGT GTTGGACATG ACTGGAAATG CCCGAGATGC CCATCTGCTG TCAGCCAAGT ATTTGGCGCA GGATGAACTA TTTGCTAGGC TACGACAAAC CGAGGACACA ACAATGAAAA TGTATTTTTT GAGGAATCGC ATGGCCGTCG CATTTCTGTT TGAGAGGTAC GATCTCCTCA TGGAACTACT GGAGGACACG GGCTATCAAA AATTTATAGA AAAGGCACAG CCTGGAGTTT TTGCCGTGCA GTCCATGACA TTCCGAAATG CGTTAGCTTG TGTATCGGTG TTTCATCGGA CTGGAGATCG CCACTATTTG AGATTGGCCA ACAGACTTGC TTGGAAAGTC AAGAAAGCTG CGAAACAAAA TGTAGGTTTC GCTAGTCACT GTCAGTCTTG GTCACCTGTA GTATGCCTTC GTCAATTTGG AACTCATTGC TCATTTGTGT CTGTTCAGAA TCCCAACCTT TTTCACTACG ATGCTCTCCT GGACGCTGAG TTTGCAGCCA CCAACGGCAA GCATGCGGCC GCACTAAAAC ATTTTGGCGC CGCGATTCTG TTGGCCGGAA GCCGGCGCTT CCAGAACGAC CAAGCCTTGA TATACGAACG CTTTGGAGAA TACAACGACC GTGAAGGTCA AAAGGACGAT GCTCGGTATA GTCTGCGACT AGCGATCGAA TCATATCAGG TCTGGGGTGC CCGTGGTAAA GCCAATCAAA TTCGGCTTAA GCACGCGGGG CTGTTGACGC CTCCAGCTGA GATTGAGGTA GGAGATTTCA ACCCTGACTT CCTTACTCTT CGCCACAAAT CCTTAGACAC TGCGCATGTG AGGGGGACAG CACAAGTGGA GAAGCCTTTA GGTGCAGCGT AAGAGCCGAG ATGCCCTCAA TGACTGGCAT CGTTGCCTCC CTTTGCCTAT CGTCACGCCG GTGACCTACA TCCAGCGTGG CATCTGTTGC GTGATAGTAA ATTAG
|
Protein sequence | MPDVEQHRRQ SSNAELSWSE ISQLTDENWQ YISEDSRSGI VEISRAGNLT FSSLDLYGRE KEISTLKEAF DRSRSSRELL LISGQSGSGK SALASCVEQP VRQAKGFYLK GKFDLNQQSQ PFSAIVSACS RLCEELLQQE EGLSCPGREQ NSGSREERSS KQWQSVFSLD EIRTRMHEEM ANEVEDLIFV VPGLADVVGI DVIQPLNDST KDDPLEARNR LNAAFRKFIR VLGCFGPIAM FLDDMQWADL ASLELIEFLF TDPEKTGLLI LGCYRDTGLD DTNGYQKLLR KLKRRDETTI TEIEVRNLSV RYVNEILSDL LRTEKGRTLP LAELVHRKTH GNAFFAIQFI KSLVDSRILK PSDSSAWAWD LEQAEKSTEP TSNVIDLMKS KLKKLPYDVC LTLQLMACLG STFTFRVFHL IVDEFYNNPL HVDPTKLAVQ PLDQRKPEEC LQLCVEQGLI VAHRRHSYRW IHDKLQEAAL SLLPSDSLPI VQFRIGNLLR QNLSSVEIES SVFVVAALLN EGSEALLTDE QRIQIAHIDL VAGKKAIESS AFVSAKYYLE KGVELLPVGS WNEHYRLTLN IYSTAAEAEY CNGNAVMVER YCDEVLKQIH RPLLDRLPAY KVLIETTGAQ GDHKKAAHLT LDVLSQLGCP FPKYAVLIQA YAGLQKARLF QRRLSTEKVN GMRQMTDIKD IWIMSLLDKM FAFAYVGRLP NILLMSILKS LQWTLKKGLS DFAPSTFARV GLVFAAFLND PKTSEMYAEH CMSLLVRTSS RKAKVKASMI VQSFVFHYLR PLSSTTRPLT HSYELGMKIG AIDDAMWCFF FAQETKVHCG VSLNSIADEL SVAIKQMQDL KQVKQEELCY ILERVVLDMT GNARDAHLLS AKYLAQDELF ARLRQTEDTT MKMYFLRNRM AVAFLFERYD LLMELLEDTG YQKFIEKAQP GVFAVQSMTF RNALACVSVF HRTGDRHYLR LANRLAWKVK KAAKQNNPNL FHYDALLDAE FAATNGKHAA ALKHFGAAIL LAGSRRFQND QALIYERFGE YNDREGQKDD ARYSLRLAIE SYQVWGARGK ANQIRLKHAG LLTPPAEIEV GDFNPDFLTL RHKSLDTAHV RGTAQVEKPL GAA
|
| |