Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44005 |
Symbol | |
ID | 7203978 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 669627 |
End bp | 673216 |
Gene Length | 3590 bp |
Protein Length | 1135 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186393 |
Protein GI | 219113619 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.861173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATCAC CACGCCGTCT CATCTTTGAG CCACTTCCTC CGCCCCCAAC CAGTCCAGAC ATGGACCACG ACGCATCTCC TTTTCTTTCA CAAAGGTACA CGACTAGCAG GCGCCGAACG CACACTTTCC GACGAGTCCA ACGTGTTTCT CGCCACGCAG CTCTAACCAC GAAACAACAT CTATGGGATT CCAATGATGC AAGCATTGAC GCTGCCGTCG CAGCTGTCGA TCAGTGGGAA TCTGCCTACA ACGCGTTAAG AGGTCTGGTC GTTGCCGGTG TTCAGTCTGC GAAAGGTGTC TATGGCGGTC TCAAGGAAGG GGCGGGAAAA ATTGAAAACG GTGTGTTGCT ACCTGTTAGA GATTGGATTA TCCTACCAGC CTTCTTTGGT GTTGAACACG CAACGGCGGA AACAGCGAAG TTTCTTCAAA GTGAAGCGGC GCATCAGCTC GCGGGTCAGT CACTTGAGCT CGTTAAAAAG GTTCCGATCG TAGGGGACAA CGTGCTCGCA CCTGCTATGT ATTTTTCGGT TGGACTCGTT CAGCGAACCT GGGAAATTGT ACAGTACCCA ATTCCATCAA AACAACAAGT TCGAGGCTCG GTCGAGCTTG CTTTAAATGG TACAAAATGG GCGCTTTCGA CTGTTGGACG TGAAGTTTAT TTGTATTTTA AACGTGCGGA TGCCAATATC ACTCGCACAT TAATGCATAC GCAATGGAAA GTGCTAGGAA GTGGCCCCTA TGCAACGCTT GATAAGCTGA ACAAGAGTGA AGTCATCAAT CATTTATGTG AACGATACTT CAGTCTTGAG GATGCAATAT CACGATATGA GCTGGCGGCT CACATTAAAA GCCATAATGT ACCATTGTAC CATGATCTTG TCGTGTCAGG TTTGTTGAAG GATAGAGGAG GGGGCATTAC AGAAGACGAT GAATGGCTCA GTTCATGGCC TGTATATCGA CATCTTGAAG ATCCATTTCT GATCCCTGAG GAAGAGAAAC TTTCCGGCAT CGAAATATCG TCTCTTTGGT TCCGTTTGCC CTATCTGAAC GGAAAGCGAC CTTCTGGTAG GGATCAGCGC TGGGTTTGTT TCGGTCGCAT TGAGCAGAAT ATCCTAGAGG ATCGGTACCG CCATGTGATT AGAGAGGGTG TTACGGTTTT AGGCATTGGT GAAGAAAGGA AAGCTGGTGC GATGGAACCA AGCGGTTTGG TTAATGAGGC GGACAATCAG ATTTCTGCCA ATCATACTAC AAATAATTTG TCAAATGTCA CGTCTGCGGA GACCAATTGC GTAGAGTATC CCACCATCGC AAAATGGTAC GTGCCTAATG CTCAAACCGA TGTCTTTTTA GATCAGAGGC GTCATACTGT GTCGCTGTTC CTTTGCTGTC CAAAATGCCG AAACGAAATC GCAAGGTCCA TCCCGCCACT GGTGGCGAAA GAATACGGGG AGCTCTGCGA CTCTTGCAAT GAACAGGATG CCCAAATGCC ATCCGTTTCT ACTCTACTAG CTCCACCGCC AATTGGCGCA GTCATGCGTC CAACTTTTTG GAGGTTTCAT GGTCCGGGGG ACGAAGTTCG AAGAGCAGCA TGGATACTTG ATACTCCGAG GCACGGTCTG CAGCCTTTTG ATGAGGAGGC GCAATCTGTC TTGGAAGATG CCTATCTTTT CTTAAAATGG ATGTCCGTTC GCCAAGAGTT TCGTCATGAG TCTGATTTAG ACAGCGCTCT TCTAACAGGT ACATGGAGCT CTCGATACTT CGAAGGGGTA GTCCTTTGCG GCTCAAAACG AACTCATTTT GTTTGCTTTG TTTTCAAATA GTGGAGGTAC CTTGCCCTGA TGGGACGGAT AGACTTATCC AATTTAGTTC CTTGACTCAA GCTACAGCAA TTCAAAAAGG TATGCTGCCA GCCGTTGCCA TCTTCAAGAG AAGGGTATAT CGGGGAGCCT GGTTGCAAAA TTCTGCTGCA GCTCTATTAG AAACAAAGAC CCAAAAGCAG GAACTAGTGT CAGTTCAGGA ATCTATTTTG CAGGCAGTTC AAGATAATGG TGCTCTCGGC GAGACAATAG TACCCGATGC AGCGCTGCGG ACCGTTCTCT CTCCTGGCAA ACGTCAAGAT GAAAGCCGTC TCGTGCTCCA CACGAGCGGC GACGATCTTG CCGTTCCTCC CAGTAGACTT TGGGAGGAAG GATCATCTCC CGTAATGGAG AGCCATCCTC AGGCCGATGT CGATCACCTC ATCCTCGTCG TTCATGGAAT TGGGGAAATG CTACGTTCAA TAGATGTTTT TGGTCTTGCA ATGCCCAATC TCTCTTCGAT TGTGGATTGT TGTGGTTTTC TGCGGAAGAA TCACTCCGAA GTCCAAGATG CCCACTTTGC TCAGATGTAC CCAACGGCTG ACGCCACTTC AAGAGCCTCG ACTGGTAGAG TGGAGTATCT TCCTATTGAA TGGCATGAAT CTTTTTCTCT TTTATCTCAA CGACGGTCGA CTTCGGAAGC TACACCCAAA CACAATGTTA TGATCAAAGA CATCTCATTG CGAACGATTC CGAACATGCG AGAGTTCGCG AATGACACTC TGATGGATGT TTTATATTTC ATGAGTCCAG AACACCACGA TATGATCATG TCCATCGTAA CAAATGAAAT GAATGTTGTT GTTGAAAAAT TTGCTGCCTT AGCTGGTTTC TCTGGACGAG TATCTTTAAT TGGGCACTCC TTAGGATCCA TTATTTCATG GGACATCCTT GCTAATCAGT CGCTGGACAT ATTGGGGGAA AGTGCCAAGC AATCCTTACA TGGTGTGCCT TCAATTGAAA CCTTTGGCGG CACGGGGTTT TCTAACTACG GTAGTGCTAC TTCAGTTGGA CATGACGCAC CAGAGGTGAC GCAGCAGGCG ACTCGATTTG AAGGATTGAA GCCCTATCCA AAGCTCAGAT TTGCGGTTGA CAATTTTTTC TTACTTGGTT CTCCGGTTGC CGTCTTTTTG ATGATACGAA ATCAACGAAA GCCTTTGTGT GAGAATTATT TTCTTTCTGG GTGCAATCGG GTATTTAACA TCTTCCATCC ATATGATCCT GTCGCTTATC GAGTGGAGCC CTGTATTGAT CCCAGAAACG CGGACTTCGA GCCTACCATT ATGAAACATT GGAATGGTGG CTTTCGCGTT CAGTACCAGA CCAAGCGGCT TTGGAAAAAG TTTGTTGACT CAACTTGGAA GACACAGCAG AGTGTTGTTG AGGCATTTGA AGCAAGCATG GCTGGAATGG GTCTGCTTGA TGCGACAACA GACACATTCA ACGACGACGA TACTTCCGCC TCAGAAATAA GTTCAGACGA TAATCGAAGT ACCGCCAATG TCATCGCTGG AAAGTTAAAT CAAGGAAGGC GTATTGATTA TATGCTCCAA GAGAAAGAGA TCGAGACAGC CAATGAGTAC GTTGCCGCAC TGGCGGCTCA CAGCTCTTAC TGGATTGAAA AGGATCTTTC TTTGTTCGTT GCACGCCAGA TTTACCTCAG CACTCTTGAA CAATCAGCAG AGGCTGCTGA AGCCAGTTTG TGGGAGTCTA TTGGGAGTAA CTCTGTGTAG
|
Protein sequence | MGSPRRLIFE PLPPPPTSPD MDHDASPFLS QRYTTSRRRT HTFRRVQRVS RHAALTTKQH LWDSNDASID AAVAAVDQWE SAYNALRGLV VAGVQSAKGV YGGLKEGAGK IENGVLLPVR DWIILPAFFG VEHATAETAK FLQSEAAHQL AGQSLELVKK VPIVGDNVLA PAMYFSVGLV QRTWEIVQYP IPSKQQVRGS VELALNGTKW ALSTVGREVY LYFKRADANI TRTLMHTQWK VLGSGPYATL DKLNKSEVIN HLCERYFSLE DAISRYELAA HIKSHNVPLY HDLVVSGLLK DRGGGITEDD EWLSSWPVYR HLEDPFLIPE EEKLSGIEIS SLWFRLPYLN GKRPSGRDQR WVCFGRIEQN ILEDRYRHVI REGVTVLGIG EERKAGAMEP SGLVNEADNQ ISANHTTNNL SNVTSAETNC VEYPTIAKWS IPPLVAKEYG ELCDSCNEQD AQMPSVSTLL APPPIGAVMR PTFWRFHGPG DEVRRAAWIL DTPRHGLQPF DEEAQSVLED AYLFLKWMSV RQEFRHESDL DSALLTVEVP CPDGTDRLIQ FSSLTQATAI QKGMLPAVAI FKRRVYRGAW LQNSAAALLE TKTQKQELVS VQESILQAVQ DNGALGETIV PDAALRTVLS PGKRQDESRL VLHTSGDDLA VPPSRLWEEG SSPVMESHPQ ADVDHLILVV HGIGEMLRSI DVFGLAMPNL SSIVDCCGFL RKNHSEVQDA HFAQMYPTAD ATSRASTGRV EYLPIEWHES FSLLSQRRST SEATPKHNVM IKDISLRTIP NMREFANDTL MDVLYFMSPE HHDMIMSIVT NEMNVVVEKF AALAGFSGRV SLIGHSLGSI ISWDILANQS LDILGESAKQ SLHGVPSIET FGGTGFSNYG SATSVGHDAP EVTQQATRFE GLKPYPKLRF AVDNFFLLGS PVAVFLMIRN QRKPLCENYF LSGCNRVFNI FHPYDPVAYR VEPCIDPRNA DFEPTIMKHW NGGFRVQYQT KRLWKKFVDS TWKTQQSVVE AFEASMAGMG LLDATTDTFN DDDTSASEIS SDDNRSTANV IAGKLNQGRR IDYMLQEKEI ETANEYVAAL AAHSSYWIEK DLSLFVARQI YLSTLEQSAE AAEASLWESI GSNSV
|
| |