Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_54178 |
Symbol | |
ID | 7204223 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 752089 |
End bp | 755096 |
Gene Length | 3008 bp |
Protein Length | 863 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | glycoprotein precursor |
Protein accession | XP_002186122 |
Protein GI | 219113077 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.148187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTCA GTTTTAATCT CGGGGTGACG GCTTTTTTCA CCTACCCTCA CCCAGATGAT GTAGAAACCG AATCCCTTGG TTTCTCAGAG CGATCGATGG ATGAGGATAT GTTTCATGAG GCCGAAAAAG GAGCTGACGA TATTTGGGCA TTATCGAACG CAAATGAGGA CGAAATAAGG TGGGTTTCTG GCTTACATTG TCTTTGATTT ACAGTTAGTA AACTTCCTTC ATCCTCTCAC GATAATCCCT TACTTTTAGA AGCGGGGTTG GAAGTTTCGG GAATTTGGCG ATGAAGGGGT TTGGTGCATT GGGGAATGCT GAACCATTAG GGGAGCATGA GAACTTTTTC GATATGATCG TAGATAATGA CAACAATGGC GCTTCGGGAG ATCCACCCGA CTTGGCTCCT TCTACTGGGT CAGATATGGC TGGGCGAGAC GTATCTCAAA ATACAATGTT TGTTGCTAAC GATGGTGTAA AAAAACTTGC CTCAGCTGTA GATGATGTCG CCAAAAGGCT CCTTGAGACA AACAACGACT CCGACGATGC TCCAGTAGCG TCAAGGCCTG ATCTATGGAA TGAAATAAGC AACAGTTTTC GCTTGGAATC TTTGAACACA AGTTTCAACG GAAGTTTTGC AAACATAGCA AGTGAGAGTC TTCAAGAGTC ACGAAACTTT GTGGGCGCGT ATTGGATGGG AAGTGATCAG TAAGTATGGC TGTTTGAAAC TGACTGTGAG TCTACGATGA ATTATTCATC TTCGTCCCTA ATGAATTTTT CACCGTGTTC GTGCAGGCAT AGCGCGATTA TAGCCCATAT GGCTACAGAG GCAGCCAAAG TCGCGGCAAA GCAAACCGCT TCTGGACTAA CAACTGCGGT GTTGACGACG AATAGCGCCG GTGTATCTAT TTTTTTGAAG ATAATAGTTG GTGTGAGCTT CTTCTTCTTT TCGGCTGGGA TCATTTTTAC CGGCGTATCG CTTCGCGACA GAATAGGACC AGAGGTTGTG GCCCAAGGCC TTAAGATACC TACAAATTTG ACCGAAAAGT TGAACAAGAC ATTGCCCTCA CGGCCAACAA GATCTCCAAC GATGTACCCT GCTGAGACCT ATTTTCCAAC ATCTTCACTT GGACCTTCCG TGCAGCAGGT GGATTCACGC TTTCCATCTC CTTCTATCAC CTCCCAACCC TCTATTTCAG GGGTTCCTAC TCAAAGTCAG TCGACTACAA ACAATTATAC TACATTAATG CCTACTCCAA GCACCAATCC AGAAAACAAT CAGGTAGTAG GTACAAACTT ACCAACGGTA ACGATGAGTC CGCTCGGACT TCAGCCAACT CGGCTGCCCA CTGTCCGCCA TTCATCGGCA GTGGAGATGG GAAGTCTTCC TCCAACAGGA CGACTCGTCT CGAATTCAAA CAGCCCGTCG ACATTAGAGC CGATAGCGAC CGAATCAACA CCAATGAATC CAAGCAATGC TCCCTCTGAC GAAGCGGCTG AAGCGATAGT ACCAACGACT TTGCCAGCCA TGAAGCCCTC TTCCTTACCA ATAGGCGAGA AAACCCTGGT GCCAAATGCA AGACCAAATA CGGTAAAAAC CCACGTTCCA GCAGCATTCA CAAGCACCGT TCCCTCAACT TCAACGCCCG GGGAAAACCT ACTATCCAAT GACAATCCAA ACAATACGCC TTCTCCGTCA TTGCAACCCA ACAAGTCATC ACCCGGCGAG CAAAGCAAAT CTCCATCAAA TCTCATTAAC ACATTGAGTC CTTCCCAAAG AGATCAACCT ATTTCGGTAC CAAACGCGCA GGACAGCCGG ACACCCTCTA TCGTGACCCC CAGCGGGTCG CCAAACGATT CCTCGTCAAA CAATACCCGA CTCGGGTCCT AGCTCCAGCT CTGCACCCAA CGCAGCCGCA AATGAGCCAT CTAGCATGCC GAGCCAGTCG CCATTACAAA GTCCGACTGA AAATCCTTCC GCCGCCCCAG ACTCGGGGCC TAGCTCCAGC TCTGCACCCA ACGCAGCCGC AAATGAGCCA TCTAGCACGC CGAGCCAGTC GCCATTACAA AGTCCGACTG AAAGTCCTTC CGCCGCCCCA GACTCGGGGC CTAGCTCCAG CTCTGCACCC AACGCAGCCG CAAATGAGCC ATCTAGCACG CCGAGCCAGT CGCCATTACA AAGTCCGACT GAAAGTCCTT CCGCCGCCCC AGACTCAGGG CCTAGCTCCA GCTCTGCACC CAACGCAGCC GCAAATGAGC CATCTAGCAC GCCGAGCCAG TCGCCATTAC AAAGTCCGAC TGAAAGTCCT TCCGCCGCCC CAGACTCAGG GCCTAGCTCC AGCTCTGCAC CCAACGCAGC CGCAAATGAG CCATCTAGCA CGCCGAGCCA GTCGCCATTA CAAAGTCCGA CTGAAAGTCC TTCCGCCGCC CCAGACTCAG GGCCTAGCTC CAGCTCTGCA CCCAACGCAG CCGCAAATGA GCCATCTAGC ACGCCGAGCC AGTCGCCATT ACAAAGTCCG ACTGAAAGTC CTTCCGCCGC CCCAGACTCA GGGCCTAGCT CCAGCTCTGC ACCCAACGCA GCCGCAAATG AGCCATCTAG CATGCCGAGC CAGTCGCCAT TTCAAAGTCC GAGTGAAAGT CCTTCCGCCG CCCCAGACTC AGGGCCTAGC TCCAGCTCTG CACCCAACGC AGCCGCAAAT GAGCCATCTA GCATGCCGAG CCAGTCGCCA TTACAAAGTC CGAGTGAAAG TCCTTCCGCC GCCCCAGACT TAGGTCCTAG CTCCAGCTCT GCGCCCAATG CAGCCGCAAA TGAGCCATCT AGCACGCCGA GCCAGTCGCC ATTACAAAGT CCGAGTGAAA GTCCTTCCGC CGCCCCAGAC TTAGGGCCTA GCTCCAGCTC TGCGCCCAAT GCAGCCGCAA ATGAGCCATC TAGCACGCCG AGCCAGTCGC CATTACAAAG TCCGAGTGAA AGTCCTTCCG CCGCCCCAGA CTCAGGGCCT AGCTCCAG
|
Protein sequence | MTVSFNLGVT AFFTYPHPDD VETESLGFSE RSMDEDMFHE AEKGADDIWA LSNANEDEIR SGVGSFGNLA MKGFGALGNA EPLGEHENFF DMIVDNDNNG ASGDPPDLAP STGSDMAGRD VSQNTMFVAN DGVKKLASAV DDVAKRLLET NNDSDDAPVA SRPDLWNEIS NSFRLESLNT SFNGSFANIA SESLQESRNF VGAYWMGSDQ HSAIIAHMAT EAAKVAAKQT ASGLTTAVLT TNSAGVSIFL KIIVGVSFFF FSAGIIFTGV SLRDRIGPEV VAQGLKIPTN LTEKLNKTLP SRPTRSPTMY PAETYFPTSS LGPSVQQVDS RFPSPSITSQ PSISGVPTQS QSTTNNYTTL MPTPSTNPEN NQVVGTNLPT VTMSPLGLQP TRLPTVRHSS AVEMGSLPPT GRLVSNSNSP STLEPIATES TPMNPSNAPS DEAAEAIVPT TLPAMKPSSL PIGEKTLVPN ARPNTVKTHV PAAFTSTVPS TSTPGENLLS NDNPNNTPSP SLQPNKSSPG EQSKSPSNLI NTLSPSQRDQ PISVPNAQDS RTPSIVTPSG SPNDSSSSSA PNAAANEPSS MPSQSPLQSP TENPSAAPDS GPSSSSAPNA AANEPSSTPS QSPLQSPTES PSAAPDSGPS SSSAPNAAAN EPSSTPSQSP LQSPTESPSA APDSGPSSSS APNAAANEPS STPSQSPLQS PTESPSAAPD SGPSSSSAPN AAANEPSSTP SQSPLQSPTE SPSAAPDSGP SSSSAPNAAA NEPSSTPSQS PLQSPTESPS AAPDSGPSSS SAPNAAANEP SSMPSQSPFQ SPSESPSAAP DLGPSSSSAP NAAANEPSST PSQSPLQSPS ESPSAAPDSG PSS
|
| |