Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46132 |
Symbol | |
ID | 7201355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 361326 |
End bp | 364358 |
Gene Length | 3033 bp |
Protein Length | 1007 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180622 |
Protein GI | 219119737 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGAAGCGCCA TGGCGACCGT ATCGCCCCCA CCTAGTGGGC AAGTACTGCA GCTCGTCTCG GCCCTTTCCA ATCCAGAAAA TCACAATGTT CATGTCCAAT CAATCCGCGC CCGAGACGAG GCGCTCTCGG CCTCGGCTGA ATCTTACGGA AATCTTTGCT ATCAGTTGGC TCTCGTGCTC GTAGGCAGTG ATAATGCGGC TGAGATGACA GCGCACATAA ATCCATCCGA GCTAGATTCA TGGAGACAAG CCGATCCTTC CACGGTGCTG AGGCTACAGC AAGATATGTC CATGTGGATT CCATTCGGAC AAATGGCAGG TCTAGTGCTC AAGAATGCTC TCTTGAGGCC ACCAATTTTG CAAGGACGTC AGTCGCTTTC CATCCAACCA CCAAGCTCGG ATCTTCTTAA AGAAGCGCTG TTACAAGCCC TCGGATGTCA GCACTCTGAG CTCCGAGCGG TTGCAAGTTC TGTGATAGCC ACTTCTGCGG TTTCGGCAGA TAGTGTCCAA CCGGGGCTAT GTGTTCGCGC GTGGCCTCAG CTGATACCTG CTTTGATTGC AAACTTACAG AAGACAGAGA ACGCTGCCTT AATGGAAGGG TCGCTTGCTA CAATTCGAAA AATGATGGAA GATGGACCGA CCGAGTTGAC GCAGGAAGAA TTGGATAGTT TGATTCCTGT GCTGATTCGA TTCCTTTCAT GCAACAGTGA ATTCTGTAAA GTTGCTGCTC TGCAATCGCT CACAGCCTGT CTCTCCGACA ACGTTATGCC GAGCGCTTTG GTCCTGTACT TCAACGACTA TCTCGGCGGA CTAAGCGCCC TTTCTACGGA TCCTAGCGCC TCGATTCGAA AGTGGGTGTG CCGTTCGATT GTCACGCTGT TACAACTCCG AACCGAATAT ATTCAACCCC ACCTCCAGGC CGTCAGCCAA TTTATGCTCA CGAGTACGGC AGATCGTCAT CACGATGCAG TGGCATTAGA AGCTTGCGAG TTTTGGTTCA CGTTTGCAAC CTTGGACGAA GATGTGTGCA CACCAGCGAT GGTCGAAACA ATTGGAGGAG TTCTACCTAA ACTGATTCCC ATTCTTTTGG AGAACATGGT ATACCTTCCC GAGCAACAGA TTGAGCTCCA GGCAAGAAAC GAGATTGACC AACAGGAAGG ATACAACGGG ATGAGTACAA TCAAGCCCGT ATTTCATCGC AGCCGGGCAA AACATGTGGG CGGTCCCGAT GAAAGTAGCG ACGACGATGA TGGCTATGAT CAAGACGATG AGGATGATGG CGAGTTTGAC GACGACAATA ATGAATGGAC GTTACGTAAG TGCGCCGCAG CAAGTCTTGA CTCTCTGTCC AGTCTGTTTG GTGCCGATTC TATCCTCCCA AGCTTACTAC CCGCTTTGCA AAATGGACTC TCCAGCTCAT GTCCGTGGGT ACAGGAGGCG TCTATCCTGG CACTTGGGGC AGTCGCAGAA GGTTGTCGGG ATGCTTTGAA TGTACACATG TCTCAAATGC ACCTGTATCT AGTAAATCAT CTTGCGGCTC CTGAATCTCC CAGTACTTTA CCTCAAGTAA AATGTATTGC CGCGTGGACG ATAGGACGGT TTGCTTCATG GGCCGTAGAG CAAGTTCAAA CCGGAGCCCA AGGTCATCTA CTGGCGCACA TGACAGAGGT ATTTCTGACT CGCTTGAGCG ATAGGAACCG GAGGGTCCAA ATTTCTTGCT GCTCTGCATT CGGCGTCATT ATCGAATCAG CAGGGGATCT GATGACGCCA TATTTGTCGC ACATTTACTA CGGCCTTGTC TCAGCCTTGT CACGCTACCA GGGCCGGAGT CTCTTAATGA TCTTCGATGT GGTTGGAATA ATTGCTGACT GCTGTGGCCC ATCAATCGCT GAAGGGGATC TGCCATCGAT CTACGTCCCC CCATTGTTGC AGATGTGGAG CGGTTTAGCC AAGAACGACC CCACCGACCG GACGTTGCTA CCACTCATGG AGAGCCTGGC AAGTGTAGCC ATGACTTCGG GAATGAACTA TCAGCCCTAC TCGCTCGAGT CATTTGACAA CGCGATGGGT ATCATCGAAG CAGTTCAGCT AATTCTTACT GCTTCTGGCG AAAAACTGGA ACACGAAGAA GAGGCGGACC CCATTGTTTG CGCAACGGAT CTCTTGGACG GATTGGTCGA AGGTCTCGGA GAGAGTTTTC CATCGCTGGT TTCAAGTAGC CGACGATACG GGCAGCATTT TCTTCCGGTA CTTCTGGCAC TTTGCAAACA TGATATTCCC GGCGTGCGAA TGAGCGCTAT CGCTTTGGTC GGCGACCTGG CTCGCAGCTC CCCGGCTTTA CTGGAACAGG CATTGCCAGA GCTTCTGAAA GAACTCGTTG CAAATATGGA TCCGGTACAA CCGTCTGTGA GTACAAATGC AGTCTGGGCA CTGGGCGAAA TTTGCGTTCG ATGCGAACGA AATTCCTCGC CTCTGGAAGC TGTTGTGCCT GATCTTGTTC AGAATCTCAT TGCATTGTTG ATGGGCAATG GTATTGAGCG GAACGGCAGG GGATCGGATA TTCCCGGCAT CGCTGAAAAT GCAGCAGCAT GTGCCGGGCG ACTCGCCAAG GTTAACCCCC AGTTTCTTGC GCCTGACCTC CCTCGATTTT TGCTCGGATG GTGTGACGGG ATGGCAAAAA TTGTGGACCC CAAAGAGAGG CGTGACGCAT TCCAAGGATT TGTTGCTGCT ATCTACGCCA ATCCCCAGGC ATTTCAGACA TCTTCCGCAA CCGTTTCTGA TGCGATCGCA TCCATCATTT TTGCTATCGT GACTTGGCAT ATGCCAGCGG AAATACCAGA GCAATCAGTT GTCCTTCTAA ATGGAGACTA CAAATTCCGT CCGTTCCCCG CTAACGAGCC GGAACTTGGC GAAGCCCTTT TTAAACTCAT CTCAGACCTA AAGACATCCG TCGATGAGAC GACATGGAGA GCAGTTCAGC AAGGACTGCC GGTGAATATT CGACGTCTCC TCCGCGAGTT TTATAACATG TAG
|
Protein sequence | MATVSPPPSG QVLQLVSALS NPENHNVHVQ SIRARDEALS ASAESYGNLC YQLALVLVGS DNAAEMTAHI NPSELDSWRQ ADPSTVLRLQ QDMSMWIPFG QMAGLVLKNA LLRPPILQGR QSLSIQPPSS DLLKEALLQA LGCQHSELRA VASSVIATSA VSADSVQPGL CVRAWPQLIP ALIANLQKTE NAALMEGSLA TIRKMMEDGP TELTQEELDS LIPVLIRFLS CNSEFCKVAA LQSLTACLSD NVMPSALVLY FNDYLGGLSA LSTDPSASIR KWVCRSIVTL LQLRTEYIQP HLQAVSQFML TSTADRHHDA VALEACEFWF TFATLDEDVC TPAMVETIGG VLPKLIPILL ENMVYLPEQQ IELQARNEID QQEGYNGMST IKPVFHRSRA KHVGGPDESS DDDDGYDQDD EDDGEFDDDN NEWTLRKCAA ASLDSLSSLF GADSILPSLL PALQNGLSSS CPWVQEASIL ALGAVAEGCR DALNVHMSQM HLYLVNHLAA PESPSTLPQV KCIAAWTIGR FASWAVEQVQ TGAQGHLLAH MTEVFLTRLS DRNRRVQISC CSAFGVIIES AGDLMTPYLS HIYYGLVSAL SRYQGRSLLM IFDVVGIIAD CCGPSIAEGD LPSIYVPPLL QMWSGLAKND PTDRTLLPLM ESLASVAMTS GMNYQPYSLE SFDNAMGIIE AVQLILTASG EKLEHEEEAD PIVCATDLLD GLVEGLGESF PSLVSSSRRY GQHFLPVLLA LCKHDIPGVR MSAIALVGDL ARSSPALLEQ ALPELLKELV ANMDPVQPSV STNAVWALGE ICVRCERNSS PLEAVVPDLV QNLIALLMGN GIERNGRGSD IPGIAENAAA CAGRLAKVNP QFLAPDLPRF LLGWCDGMAK IVDPKERRDA FQGFVAAIYA NPQAFQTSSA TVSDAIASII FAIVTWHMPA EIPEQSVVLL NGDYKFRPFP ANEPELGEAL FKLISDLKTS VDETTWRAVQ QGLPVNIRRL LREFYNM
|
| |