Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21513 |
Symbol | |
ID | 7202384 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 255069 |
End bp | 258441 |
Gene Length | 3373 bp |
Protein Length | 1035 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181518 |
Protein GI | 219122368 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.275387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCCTTTTTC GACCAGAAAT TGGTTGTGTC ATCGCCTTCT GCGCTGTTGT TGCCCGACAG CATAGTAGAA GTCAACTCTG CTTGCTGTGA AAGAAACAAT TGTCTTCCTC GGACAGAGCA CTTTCTGGCT TTGATCGCCT CACGCACGCT CAAGCCATCT TACTATCCAC TCCCGTGTCA TGCCCAGCAT CCTCCGCAAG CCTTCCTCGT CATCGTCTCG TCCCCGTAGG AATAGTATGG TGACAATGTG GCTTTATTTT TCCACCGGCC GGACGGCTCT GGCGTTGGTA CCAACTATTA ATTTTAGCGG TGCACGGAGC CTTTCGCGAG AAATCCCTTC ATTTGGTGCG GCTTTTGCGA GGACATCTCA TCTTTCGACC CGCCGGTCTC GAGTCGGCCA AGCCTTTATG TCCACCACCG CCGACCCTGA TACATCGAAA ACGACCGAAA CCGACAGAGC CTACGATATA ATCAACAAGG AGCGTGGACT AGACAAATAC GAGCCTGCTT CCTTCGAGTC CGACATTTAT CGTTGGTGGG AAACTGCGGG TTGCTTCCAA CCCGACGCCA AGCAAAAGGC GATAGACGAC AACGACGACA GCACATCCAC CGCACCGTAC GTCCTTCCCA TGCCCCCGCC TAACGTGACG GGGCGCTTGC ACATGGGGCA CGCCATTTTT GTCGCCCTCC AAGACGTTCT GGCCCGCTTT CACCGCATGC GCGGTCGACC CGTGCTGTGG TTGCCCGGCA CCGATCACGC CGGTATCGCT ACGCAACTGC AAGTCGAAAA ACTTTTGATT GCGGAAGGAA CAACGCGAGA AGAAGTGGGT CGCGACGAGT TTTTGCGACG CGTTTGGATG TACAAGGAAG AACAGGGAGG ATTCATAACG TCGCAATTGC GGTCACTAGG GGCGTCGGCG GATTGGAGTC GGGAGCGCTT CACGATGGAT GACGATTTGT CGCAGGCAGT TGTCGAAGCC TTTTGTCGCC TACACGAGAA AGGTCTTGTA TACCGTGGGG AATACATGGT CAACTGGGCA CCTTTACTTC AGACAGCCGT TAGTGACTTG GAAGTAGAAT ACAGCGAGGA GGAAGGTAAA TTGTACTACT TCAAGTATAT GGTTGAAGGC AGTGAAGGTA CGTAAAGACA AGTGCAAAAT GTCTACAAAA TTAGCTGCTA TCTCACAAGT ATTTGGGTTT CAGAATTTAT ACCAGTCGCT ACGACGAGGC CCGAAACCAT TTGTGGAGAC ACGGCTGTTT GCGTGCACCC CGAGGATGAG CGGTATAAGC ATCTAGTTGG AAAAGCTTTG GTGGTACCAA TGAGCGGAGG TCGTACCGTG CCCGTGATTG CTGACGAGTA CGTGGATATG GAGTTCGGGA CGGGAGCGCT CAAAATCACT CCAGGTCACG ATCCCAACGA CTATACCCTC GGTAAAAAGT TTGATTTGCC GATTATCAAC ATAATGAATA AGGACGGTTC GATGAACGCC AATGCGGGCC AGTATGATGG TCTCGATCGC TTCGAGTGTC GTCAACAGTT GTGGACCGAC ATGGAAACCG AAGGTCTCGT AATAAAGGCC GACCCGCACA CGCAACGAGT TCCGCGATCG CAACGCGGCG GAGAGATAAT TGAACCTTTG GTAAGCAGCC AGTGGTTCGT CAAAACAGAA GGGATGGGCG CCAAAGCTCT GAAAGCTGTG GAAGATGGTG ACATCAAGAT AGTTCCGCAG CGCTTCGATA AAATTTGGAA TAATTGGTTG ACCGACATTC ACGATTGGTG CGTATCACGA CAGCTATGGT GGGGTCACCG TATTCCGGTC TGGTATGTTG GCGAGACAGG CGAAGACGAG TTTATAGTGG CGCGGAACGA GAAGGAAGCT CGTGAAAAGG CGGTGGCAAA TGGTCACTCC GCAGACGTTG TACTCCGACA AGAGGAAGAT GTGCTCGACA CGTGGTTCAG CTCAGGCCTG TGGCCGTTTG CGACGGTTGG CTGGCCTCAA AACGAAGGAG TCAAGGGTTC GGATTTTGAT CGCTTTTTCC CTGCTTCTTG TTTAGAAACA GGCTACGACA TCATCTTCTT TTGGGTAGCT CGTATGGTCA TGATGGGTAT TGAGCTCACC GGGAAGAGTC CATTCAGTGT GGTGTATTTG CACGGCCTTG TCCGTGCCGC TGACGGAAGT AAAATGTCCA AAACCAAAGG CAATGTGCTG GATCCTTTAG ATACTGTTGC TGAATTCGGC GCTGACAGTT TACGCTACTC TTTAGTTACG GGTGTTACTC CCGGACAAGA TATTCCGTTG AACATGGAAA AGATTGAAGC GAATAGAAAT TTTGCCAACA AGCTCTGGAA TTGTTGTAAG TTTGTTACGG GAAACGCACT CAAAGATCTT TCAGACGAGG ACTTGGCAAG TCTGGCCGTA TCCGGTCCAA TCGAGCAGGA AGAGTTCGAT AGCCTTTTGC TACCGGAGCG ATATATCATC TCAAAGTGCC ACACTTTGGT AGCAAGCGTT ACACAAGACA TTGAGAAATA TCAACTCGGA GCTGCCGGTA GCAAAGTATA CGAATTTTTG TGGGATCAGT TTGCCGACTG GTACATTGAA ATTTCCAAGA CTCGCTTGTA CGAGGGCGCC GGTGGGGGTG ACAATATTGA GGAAGCACAA GCCGCTCGTC GAGTTTTGGT GTATGTTTTG GACACCAGTT TGCGTCTGCT ACATCCCTAC ATGCCGTACG TAACCGAACA GTTGTGGCAC CACTTGCCTC GTGCCGACGC TGGCCCGGAC CAAGCTGCAC ACGCACTCAT GTTGGCGAAC TGGCCGCAAA TGAACGACAA CGTGCTGACC ACGAGCGAGG CCGCTGTGGC CCAATTTGAA TCTTTCCAGG CATTGACCCG AAGCGTGCGC AATGCCCGCG CTGAATATAA CGTGGAACCG GGCAAACGTA TTGCTGCTGT GATCGTGGCG CGCGGCAAAT TGAAACAAGC GATTGAAAAA GAGCTCAAAT CGCTCATTGC ATTGGCGAAA CTGGATCCGG AACAAACGCT AATTTACGAA GCAGGGTCGG AAGAAGCGAG ACAGGCAACG CAGGTGGAAT CAGTCCAAGT CGTAGTCCAG GACGGTGTAG AAGCCTTTCT GCCGCTGTCG GGATTAATCG ATCCGGAAAA GGAACGTTTG CGTCTCGAGA AACGCCGCGA GAAGCTGGAG AAGGAAATCC AAAAACTTGC AGGGCGCTTG CAGTCAAAAG GATTCGTGGA CAAGGCCCCC GCCGATGTTG TGGAGAAGGC CCAGGCAGAA CTGGCCGAGC TGGAGGATCA AGCTGGTAAG GTACAAGCTA GCTTGGAGAC TCTGACCCAA TAGTAAAAAC AGATTTTTAC TCC
|
Protein sequence | MPSILRKPSS SSSRPRRNSM VTMWLYFSTG RTALALVPTI NFSGARSLSR EIPSFGAAFA RTSHLSTRRS RVGQAFMSTT ADPDTSKTTE TDRAYDIINK ERGLDKYEPA SFESDIYRWW ETAGCFQPDA KQKAIDDNDD STSTAPYVLP MPPPNVTGRL HMGHAIFVAL QDVLARFHRM RGRPVLWLPG TDHAGIATQL QVEKLLIAEG TTREEVGRDE FLRRVWMYKE EQGGFITSQL RSLGASADWS RERFTMDDDL SQAVVEAFCR LHEKGLVYRG EYMVNWAPLL QTAVSDLEVE YSEEEGKLYY FKYMVEGSEE FIPVATTRPE TICGDTAVCV HPEDERYKHL VGKALVVPMS GGRTVPVIAD EYVDMEFGTG ALKITPGHDP NDYTLGKKFD LPIINIMNKD GSMNANAGQY DGLDRFECRQ QLWTDMETEG LVIKADPHTQ RVPRSQRGGE IIEPLVSSQW FVKTEGMGAK ALKAVEDGDI KIVPQRFDKI WNNWLTDIHD WCVSRQLWWG HRIPVWYVGE TGEDEFIVAR NEKEAREKAV ANGHSADVVL RQEEDVLDTW FSSGLWPFAT VGWPQNEGVK GSDFDRFFPA SCLETGYDII FFWVARMVMM GIELTGKSPF SVVYLHGLVR AADGSKMSKT KGNVLDPLDT VAEFGADSLR YSLVTGVTPG QDIPLNMEKI EANRNFANKL WNCCKFVTGN ALKDLSDEDL ASLAVSGPIE QEEFDSLLLP ERYIISKCHT LVASVTQDIE KYQLGAAGSK VYEFLWDQFA DWYIEISKTR LYEGAGGGDN IEEAQAARRV LVYVLDTSLR LLHPYMPYVT EQLWHHLPRA DAGPDQAAHA LMLANWPQMN DNVLTTSEAA VAQFESFQAL TRSVRNARAE YNVEPGKRIA AVIVARGKLK QAIEKELKSL IALAKLDPEQ TLIYEAGSEE ARQATQVESV QVVVQDGVEA FLPLSGLIDP EKERLRLEKR REKLEKEIQK LAGRLQSKGF VDKAPADVVE KAQAELAELE DQAGKVQASL ETLTQ
|
| |