Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50218 |
Symbol | |
ID | 7199084 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 18412 |
End bp | 23383 |
Gene Length | 4972 bp |
Protein Length | 1307 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185106 |
Protein GI | 219129881 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0197489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAATA GTATCCACCT TGCCTCTAGG GAAGACGCAA GAGGAGACCA GGATGAGAGA CACTATTTCG ATGGACCTCT ATTGGATATT GAGGATGATG AATACGATGT AGAGGATGAG CCCCTCGCTG AAGTCGATAA TCCTACCACG GATGCTGGTC CTAAGCCATC CTACATGGAG CCTTCCTACG AAAGGGCCAG CGCCGTTTCG TTTTCGCGTC TGTCTCAACA AATGGAGTAT CTTTATCGAC TCAAAACACA AAATGTAGTC CATCCACCAG ACAAGGTCTC AAAACTAAAA CGACTTTTAC CGCCGGCCTT AATTTCAAAA ATCTCACGCC CCACCAAGAC CAGCGATCCG CCACAATCTC TGTTCCCCAT TCTTCGTCTG CTATTACCCG AACGAGATGG TTCACGCCGA ATTTTCACCA AAGAAAACAC ATTGGCAAAG GCTTACGGTC GAGCTTTTGG ATTACCTCCA GCTAGTACCG ACTACCAGAA ATTGTTGTAT TACTCCGATC CACATATCGT TGGATCCAAG TCCACTGGAA CAGGGGACTT CTCTGAAGTT TTGCGGCAAG TTTTGGAAAA ACGTATATCA CTTCAGAAAA ATGGATCAGG TGTCACGGTG GGACAGGTGA ATGCACTCCT GGATGAGCTG GCGGTAATTG GGAAGCCGCC AACGAGATCT AATCATGACT GGAGAAACCG CGAAGCTGAG CTGAACGACT TCCAATTGAA ACCCAAGAAT TCGGCGTCCA AAGCAGAACT CCAAGCAAAC TGGGTCACTA AGCTGCATTG TCTCGGCTTA ACTCCATTAG AGCACAAATG GATTATTCGT ATCATCTTGG AGAGGCTACA GATTACGTTG GGCTCGACGG TCATTCTTGC ATGGTACCAT CCGCTGGCCC CGTCGTTTTG GAGCGCACAC AACAGTTTAA AAGCCGTCTG CAACAAGCTT TGTCAACCTG GCGTCTTTGA GTTGCGTCAC AACCTCGCTG AAGATGGCGG AGACGACGAT GAAGCACCGT CAACTTTGGC GCCTTATTTC AAAAGCACTC ATTTGACACA CGTGCAACTA GGTAATCCTT TTTCGCCAAT GGCTTCCGAA CGAACCGGAT TTCATACTGT GTTGTCAGAT ATGTCATCTC GGCATAGAGA TTTTACTGGT GAAGAAAGGT GCTACAGGTA TGAGGCAGTG ATCAGGAGCT TGGCGTTTAA ATTTCCTACG TTTGCAATCG AAACAAAACT AGATGGTGAG CGCCTGCTCG TGCATTATTT CCGAAATGGG ACCGTTAAGC TTCATACTCG GCAAGGCAAC TGGTACAGGT ACGCAAACCA CACCCATTGA ATTGACAAAC AACCAAATCT ATTGACAAAC TGATTGCACT CCCCACAGTG AATTGTATTC GCCGGTACTT GGGCCAGCTC TCCGTCGAGC AGTGGGAAAG CACGATTTGG ATCTCATTTT AGATGGCGAA GTGTTAGCAT GGGACAACGA GCGCCACGAA ACTGTTCCTT TTGGCAACAA CAGAACAATT GCGAAGATGC GGTACAGTTG GATGCGAAAG GCTGGAATAC TGGATGAGCG CGATATCAAT CTGCACAACG GCGAGAACGA CGTCAAGCGG ATAAACTCAT CGAATTCATG GAATACTCAC AACACACGCT GTGACGAAGT AGGCGAGGAA TGTTGGCTCC AGTTTATCGC TTTCGATGTC CTTTATATCA ATGGTCCTCA CGCCCTTGAT TTTTTATCGA AGACCGTCTC ATCTTTTGCC TTAGCAAAGA AACAAAGTGG GTCTATTGTT GACCTTGAAT GCCTTGAAAG GAAAAAGATT CTATATGAAT TGATTGACGA GCAAGATAAC GAGGTCGAGA TAGTGCCGAC GGTGGTAGTG CGACCAAACG GGCAAACCTC TCCCGGACAT GATTATTTCA GACTCTCCAA TCCTACAATG GAATGCGGAT TTCCTGCTCG CGATTTGGAT TCAATTGCAC GAATGTTCCA CCAGCCCAAG AATGATCTAG GCACAATCGA TGCTCAGAGA AGACACGGAT TGAGTGACGA ACTAATGCGC AAGGCTCGAG CTCAGGCTGT AGAAAGCCAC TATCGAAAAG TCGTTGAGGA CATGAGGCTA GAAGGCCTCC TTTTCAAGGA CTTGTCCACT CCATACATTC TCGACAAAAT TTCTCGAAGT TTTGGATACT GGCGCAAGTT CAAACCCGAT TATTTCAATG GATCCGTTGC AAGCGATCTC GATGTTGTAA TTATCGGCGC ATACTTTGCC TCCGGACTTC GCCTCTCCGG CAAACCATCG AGCTTCCTTT GTGCATGTAC AGATTCTTGT GATTCTGAAT ACTATTTTCC CATGTGCAAA GTCAACGCCG GTTCTATGGA TAGAAACTCG TTTTCTCAAC TTTTGTACGA AACTGGCTTC CGCGCGCCAT GCGATCGACC TGATCCAAAT GCGGAGGAGA GCAAGATGGA ATATGGACGT TGGTTTCGTG AGGAGGACCA TGGAAAAGCG CTTCCTGATT TCATTACCGA TCGGTCTCAT CAGAACAATA GCCGGGACGG TAGATGGCGG TTTAGAAAAG AAGACTACCC AGACATATGG ATCAACCCAT CTGACTCATG TGTTCTTACT TTGAACGCGG GAGACATAGT GAGCAGCGAG GCGTTTCCAT CCGGTCTTAC CCTACGCTTT CCAAGAATTG TTAAGGTACG CATAGGTGCG GATACGAAGG ATCCCACAGA AGTCGAAAGC GATCGGAGTC TTCAAAAAAT TTTGCAACAA GTTTTGAAAG AAAGATGTGA AAGTAAAGGT GAAGCGAATG GAATCTGCTC AACGAAATCG TTCACCAGAG CCAAAGCGCA GCAGTTCTGC CGTTTTCTTA CCGAGGAGGA GTATGTCCAG GGAAAGAATA GAAAAAATTC TACCGGCCGA TCGAAGACGC TTCTCCTAGT GCCCACAAGT CAACAGGTGA TGCTTTCAGA AAGCAAGGTC CTCACTGGGT TGTCGTTCCA TGTACTGGAT GGAAGGTACT CAATAGATTC GGACGACGTC GCAGTCGATG AAGCAACAGA AGAGGGCTGG CTAGAAAAAG CCTTGTCAGT CAGAAGCAGC AATGACGTCA AACACTTTAT AGCAAGACAC AACGGAAAAG TCTTGCTGAA TCCAGATTCA AGCATGTTTG TGATAGGAGG CAGTGAAACT GATGCGCGCG TTATCACATA CATCCAAGCT ATTAATCGGG CGAGAACGTC TCTAGACAGG CGGACGAAAG AAACTGACAG ATATCGTCGA ATTGCTGGGT CCGCCGGTGT ACTGCGGTGG ACGTTCGTTT TTTCTTTCGT CCAAAGTCAT TTAGGAGACG CGGATGTATC AACAAGTGCT TTAATGTTCA ACCCTACAAT GCTCGACTTT CTTAAAAGAG CGCAAATTGA AGCAGAGGAA CTGGAATTTA TGGCTGATTT GTCTTCCAAA AGCTCTCTTC GTAGGGCTTT AGGTTTGACG TCGAAAAGAA AACGAACGGG ATCCGACAGT TTTTTCTTAG GTGATTGGAG AGAGAGGGTA TTTTCCTGCC TTCCTACGAT GGAACGTTGT ATTCTATCTT GCCGCGTCCA ATCAGTCGGC CCCTTTAAAT CAGTGAAGGC TGACACTCCA TCTAACGTCA TACCAAGCGT TGTCTGTCCT GTCACTGTTA ATGACGACAC GATTGAGTCT GTACTTCCGC TCGCTCGAAT GATGGGGGCT GTTGTGGTGC GAGAGCCACG AAAAGATCTG ACCCATGTTG TCTGTCATAT GATTGTTCAT GACATAGTTA AACACAAAAG AGGCATGTCA CTAGATCTTT TCGAAAACCG AGAACAAGGT ACAGCTTTCA AGAGAATCCT AGATGATCTT CAAATCAAGC ACAACTTAAG CCATGATATT CTCTTCGTAA CGCCAAATTG GATCCGCGCT CAGTGGCCAA GTCACGCACG TTGAGCTATT CGGATATAAA TTAGAAAGAA TCACCTCAGC ATAGTTAGAC AAAGACGCTA CTCGTTAGGC TACGATGTGC TGAACTATCC GTGATACTTT CTCTCACATG CCCTTAGTTT CTTTGGCAGT TCGCTAGACA TGAGCATTAG TGATTCGATC ACAGCCTTGT CTTCTCCTAT GCAAACATGT TCTGTATTGC GTCGGTCATC GGTGCAATCC TTCGCGCATG ACAGTTTGAG TCTTTTCTTT GGTGGTCTTG ACTGTGAATA AATGTGCTGC TCCTCACTTG TGGAAAACTT ATCATACACT TCGTCCCCAC TGGAAGCTTT ACTACTTTCT GAAACGAAGG GATTTAATCG CAGGTTGTCT CGAGGAACAA AGCGATACTG CTCGTCCACC ATTTCGCCAT CTCGAATAGG ACGCAGACAG AGCAAAAGCT CTATCACTTC TTCTCTGCTT GATGTCATGC TTTCGACTTC TGCGGATGCA TAAAGAGATT TCGCTTCTAA TGACGCCTTC GAGGTCGACC TCGATTTCGT TTCCTCCCGT GAAGAAGCGT CGCTCTCCTT ATTTAATTCC GCTGTTCGGA TCGACGAAGT AACCTCACAC CATAGCGTTG TCAAATCGTT CTTAATTAGA CAGACATTGC AAGTGGGAGC AATCGGTTTT GGTCGTCCTG TTGTAAAATG TGAGAGATTT CGATGCCTCG TCCACGGGAA GCAATATTTC AAATGGTTAT TCTTACCGTT GGACTCGGAT GAGTCACTGG AAGACGATGT CTCATCTCTT GAGCCGTTCG ATTCTCGGTA ACCAGAGTCA TCCCCTGATG AAGAGGGTTC TGGTCTATGT TGAAGAGAAG ACAATCTAGC TGATGCGTTG TTCGCCGTTA CATCAGCGCC GATCACATCA TCTTTGAAAG CTGTTCCTAA TTGCTCAACC AATTCCTGGT CTTTCATTTT CTTATTGTGC CACCGAACAT TC
|
Protein sequence | MPNSIHLASR EDARGDQDER HYFDGPLLDI EDDEYDVEDE PLAEVDNPTT DAGPKPSYME PSYERASAVS FSRLSQQMEY LYRLKTQNVV HPPDKVSKLK RLLPPALISK ISRPTKTSDP PQSLFPILRL LLPERDGSRR IFTKENTLAK AYGRAFGLPP ASTDYQKLLY YSDPHIVGSK STGTGDFSEV LRQVLEKRIS LQKNGSGVTV GQVNALLDEL AVIGKPPTRS NHDWRNREAE LNDFQLKPKN SASKAELQAN WVTKLHCLGL TPLEHKWIIR IILERLQITL GSTVILAWYH PLAPSFWSAH NSLKAVCNKL CQPGVFELRH NLAEDGGDDD EAPSTLAPYF KSTHLTHVQL GNPFSPMASE RTGFHTVLSD MSSRHRDFTG EERCYRYEAV IRSLAFKFPT FAIETKLDGE RLLVHYFRNG TVKLHTRQGN WYSELYSPVL GPALRRAVGK HDLDLILDGE VLAWDNERHE TVPFGNNRTI AKMRYSWMRK AGILDERDIN LHNGENDVKR INSSNSWNTH NTRCDEVGEE CWLQFIAFDV LYINGPHALD FLSKTVSSFA LAKKQSGSIV DLECLERKKI LYELIDEQDN EVEIVPTVVV RPNGQTSPGH DYFRLSNPTM ECGFPARDLD SIARMFHQPK NDLGTIDAQR RHGLSDELMR KARAQAVESH YRKVVEDMRL EGLLFKDLST PYILDKISRS FGYWRKFKPD YFNGSVASDL DVVIIGAYFA SGLRLSGKPS SFLCACTDSC DSEYYFPMCK VNAGSMDRNS FSQLLYETGF RAPCDRPDPN AEESKMEYGR WFREEDHGKA LPDFITDRSH QNNSRDGRWR FRKEDYPDIW INPSDSCVLT LNAGDIVSSE AFPSGLTLRF PRIVKVRIGA DTKDPTEVES DRSLQKILQQ VLKERCESKG EANGICSTKS FTRAKAQQFC RFLTEEEYVQ GKNRKNSTGR SKTLLLVPTS QQVMLSESKV LTGLSFHVLD GRYSIDSDDV AVDEATEEGW LEKALSVRSS NDVKHFIARH NGKVLLNPDS SMFVIGGSET DARVITYIQA INRARTSLDR RTKETDRYRR IAGSAGVLRW TFVFSFVQSH LGDADVSTSA LMFNPTMLDF LKRAQIEAEE LEFMADLSSK SSLRRALGLT SKRKRTGSDS FFLGDWRERV FSCLPTMERC ILSCRVQSVG PFKSVKADTP SNVIPSVVCP VTVNDDTIES VLPLARMMGA VVVREPRKDL THVVCHMIVH DIVKHKRGMS LDLFENREQG TAFKRILDDL QIKHNLSHDI LFVTPNWIRA QWPSHAR
|
| |