Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46016 |
Symbol | |
ID | 7201071 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 971322 |
End bp | 973148 |
Gene Length | 1827 bp |
Protein Length | 491 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180356 |
Protein GI | 219119179 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGCCGTATG GGAACTGTGT AGCGCATACT CGAAAGCTTG GCTCATATTT TCGTCCCATT ACAAGTTGTG TTCTTCTCTC TGACCTAGTG ACTCTAACAT TGGAGGGGTA TGTCATCTAC GCAAGAAGTT GGAGAGATTT TTGGGGCGGG CAAAGGCCCA GCTCACGCCA TGATGAGATC GCCGACTGTA CTTATTGCCT CCATTGGCCT GTGGGGCATG AACGTCTTTT TCTTTCGAAT TTTTGGAATC GACTACGTTA AGGTGCTTAA ACATGATCTC TGGCTCATCG ATGGAAACGG TGGAACGCCG ACGATCTTAT CATCGTCGGC AATACCAACA CAAGGTTTTG CACATAGGCC TCATCAACTT CCTCCTTCGT CCCAAGAGGA ATTGCTCGTA CGGCAAAGGA GGCTTGGAGA CGACGCATAC TCTTCCGACG AATCTCAATC GGACGAGGTC GCAGATGATG ATTCAGCTGT CGAAGTCACT TTGATCGTCG ATGGGAATGC TGTTACTTGG GGTCGTCTCG TCGGGCTTTC CCTCTGCCTG TTAATATTGC TGCATTCGAC GTACTAGTAA GTGTCCCTGC CATTTCATGA TATCATCCTT CAAAAGTTAA CTCAAGGTGC TCGCTATTTC AATTCCAGTT GCTGGATTGA TATGTGGGGT GGCGGGTCTA TAGGTGCCGT CTTTGCGTTT TACGCCACTG TTACCATCGC AATAGTCTTT CCTTTACCAA GCACACGTTG GTTGCGCAAA GCGACGGTCT TGGTGCTCCA ACGAGCGTTT GAGCTCGTCA ATCCGCGTTG TTCGTGTGTA AGCTTGGAAC AGAACACATG TCCCAGGCCA ATACCTTTCG TGGACGTATT TTTCGCTGAC GCCATGTGTA GTCTTTCCAA AGTCCTTTTT GATTGGGGAA TGCTTATGCA TATGGCTTCC CACTATCCGT ACCCCGTCCC CAAAGACATT CACCACATTG TCATCCCCAG TGTCTTTGCT GCCATTCCTT TTTTAATTCG TGCCCGCCAA TGCCTTGTCA TGTACACTGT CGGCCGGTTG CGGAACGATG CGCACCGTGC AGCTCACCTT TGGAACGCGC TGAAGTACTC TACATCAGTT TTCCCGCTGT GTTTGTCGGC CTACCAGAAG ACTGTCTCAG CCAAACGAGC ACTAGAATTG GAGCCGTATT TGATTGGGCT GGTCATTATT AATTCGACCT ACGCTTTATA TTGGGACATT GTAATGGACT GGGGTTTTTT CAAAAACCCT GGAGCAGCTT GTGTTGGCGG TATCTATCCG ATGGATCAAA ACCGCCCAAA ATCGTGTGGA CACGCTATTT TACGGCCCCG ACTTCGATTT GGTGTCGCAA TGTCCGTCTT AATTCTGACG GCCGATACCA TTTTGCGTTT CAGTTGGTTA CTTCGGTTTT ATCATACCAT CTTTCCCAGC GGTGATTCTT TCGCGATGTG CACCCAGTTT TTGGAAGTAT TTAGACGTGC CATGTGGAAC TTGCTGCGGA TCGAGTGGGA GAACCTAAAG CAGTCGACCA CCCCGCAGCC TAATTCCAAA ACAAAGGACG AAGAAATGGT CAAATTCTTA CCTAAAAGCG GGACCATTCC ACGAAAGATT GAAACGTCGG AGGTGAAAGA CGCCTGACCT CGATTGCAAT TCACTCTTCT TCACGAGGTC TCAGAGCAGT ATGAGGATGT TTTCCGTTGT TGCGGATGAG GCGTATCTTG AAGCATCTTG CCCAATTTGG GGGCAAAGTG TGAGCTCAGT GGCCATCTCA AAGCAATCTG TAGCGCTAGA GAGACGTACT TGCTTGT
|
Protein sequence | MSSTQEVGEI FGAGKGPAHA MMRSPTVLIA SIGLWGMNVF FFRIFGIDYV KVLKHDLWLI DGNGGTPTIL SSSAIPTQGF AHRPHQLPPS SQEELLVRQR RLGDDAYSSD ESQSDEVADD DSAVEVTLIV DGNAVTWGRL VGLSLCLLIL LHSTYYCWID MWGGGSIGAV FAFYATVTIA IVFPLPSTRW LRKATVLVLQ RAFELVNPRC SCVSLEQNTC PRPIPFVDVF FADAMCSLSK VLFDWGMLMH MASHYPYPVP KDIHHIVIPS VFAAIPFLIR ARQCLVMYTV GRLRNDAHRA AHLWNALKYS TSVFPLCLSA YQKTVSAKRA LELEPYLIGL VIINSTYALY WDIVMDWGFF KNPGAACVGG IYPMDQNRPK SCGHAILRPR LRFGVAMSVL ILTADTILRF SWLLRFYHTI FPSGDSFAMC TQFLEVFRRA MWNLLRIEWE NLKQSTTPQP NSKTKDEEMV KFLPKSGTIP RKIETSEVKD A
|
| |