Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50081 |
Symbol | |
ID | 7198679 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 336901 |
End bp | 338869 |
Gene Length | 1969 bp |
Protein Length | 569 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184865 |
Protein GI | 219129374 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGTGTAC GTAATTCACA GTCACTCCCC ACCCAAAGGA CGTTTACAGT TCCACTGGAC ACACCACCGC GACGTTGCCA ACGGAACCGT GGATTCCCAA ACCCACCACC AACTACGAAC CCATGGCATC GTTCGGAATC ACCGCGGGCT CACGGACGTT CCTCCTTCCC ATCCCACCGG TGCGGCTCAT TCCGTGGCGT CGTGTCACGG AAACATTCAC GTCGACGACT GCGTCGGCAC ACGTGTCGTG GACGCATCCC GTTTGGTTTT GCGTGGCGGG ACGGCGCGGG ACGTCGACCT ATTCCTTGCG CCATTCGACC ACCGCTCCTT CGACGTGGGC CCGCAACGGC GCCCGATTGT ACGCACGGCA CGTGCTGGCT GCCGCCGCAG TCGCCGCGGC GACGGCGCGG CGGCGGACGC GTCTGGAGCA ACTCGTGGTG GTCGAGCCCC CGTCGGATTG GTTGCGGTGG GAAAAGAAAT CCTGGGCCGA CGAATTGCTC GCTAGTCTCG GACGGAATAA ACTGGAGCGG TTCGGGTCGG CCACGCGACG GATTGCCTCC TTGTTGGTAC TGGCCGCGCC TTTGACGCTC CTCGTGCCCT TGTCGTGCGT GTCGGAACGT GCCACGGCCT GGTCCTGGGC CTACGCGTTA TGGAGTATTG AACAGGCCGG ACCGACCTAC ATCAAGTTGG TGCAGTGGGC CACCACCCGA CAAGATCTGT TCTCGCCCGA GTTTTGTCAA TATTTTGGAA AACTCCGCGA TGAGACCACC GGACACGCCT GGCAAGCCAC GGTGGACACG CTGTTGGAGG ATTTGGGCAT TGGCGCCGAT TTTCTGCAAC TCGAAACGAA ACCCATCGGC TCCGGATGCA TCGCACAAGT CTACAAGGGA AAGTTGACGC AACCCTCCGG TCCCTATCCC GTTGGTACCG ACATTGCCGT CAAAGTACAA CATCCCGGAA TATGGGACAA GGTCTGCGTC GATTTTTACA TACTCGGCAA AGCCGCGGCC TGGTTGGAAC GCATACCCTA CTTGAATTTG TCCTACCTTA GTTTGGCCGA CAGTGTCCGA CAGTTTCGGG ACATTATGCT CCCGCAACTC GACTTGACCC TGGAAGCCAA TCATTTGCAA CGCTTCAATC GAGATTTTCG GGACGACGAT CGGGTGGCCT TCCCGGAACC CTTGAAGGAA CTTACCACCA CCCGGGTCCT CACGGAAACC TTTTGTCACG GGACTCCCAT TCTGGAATAC ACCAAGGCCC CTCCCAAGGT CCGCGAGGAA CTGGCCTATC TCGGACTCTC CACCACCCTC AAAATGATCT TTCTACACGA CTTTTTACAC GGTGACTTGC ATCCCGGTAC GTACGAGTGT GTCTGTGTCG TATGTTGTGT GTGTGTGCAT TTGTATTGGT ACAGTATATA TTGGTATAGT GTGTGTGTGT GTGTGTGTGT ACGCGTGTGG GCGCTTCTCA TTCCACGTCT CGCTTCGTTG CAGGCAACAT TCTCGTCAGT AACACCCCCA AGGGCGACAT TAAGCTGAAT CTGCTCGATT GTGGATTGGT GGTGGAAATG GGTCCGGAAC AACACATCAA CTTGGTCAAA ATCTTGGGCG CCTTTACGCG TCGCGATGGT CGTTTGGCGG GACAGCTCAT GGTGGACACC AGTAGTCACT GCCAGGCCAG TCCGTTGGAC GTCGAACTCT TCGTCAACGG CATTGAACGA ATAATTTTGG ACGACGCCAA GAACAATTTT GTCGAAAACG TGGGGGACTA CATTACGGAT ATCTGTTACA TGGCCTGCGT ACGCAAGGTG AAACTGGAAG CTTCCTTTAT CAACGCGGCG TTGGCGATTG AGATTATTGA AGGCATTGCC CAACAGCTAC ATCCGCAAAT CGTCGTGACG AAAGAAGCAC TGCCACTCAT CGTCAAGGCG GAAATGATGC ACCGGTTGCC CAAGTTTTCT CTCTGGTAA
|
Protein sequence | MASFGITAGS RTFLLPIPPV RLIPWRRVTE TFTSTTASAH VSWTHPVWFC VAGRRGTSTY SLRHSTTAPS TWARNGARLY ARHVLAAAAV AAATARRRTR LEQLVVVEPP SDWLRWEKKS WADELLASLG RNKLERFGSA TRRIASLLVL AAPLTLLVPL SCVSERATAW SWAYALWSIE QAGPTYIKLV QWATTRQDLF SPEFCQYFGK LRDETTGHAW QATVDTLLED LGIGADFLQL ETKPIGSGCI AQVYKGKLTQ PSGPYPVGTD IAVKVQHPGI WDKVCVDFYI LGKAAAWLER IPYLNLSYLS LADSVRQFRD IMLPQLDLTL EANHLQRFNR DFRDDDRVAF PEPLKELTTT RVLTETFCHG TPILEYTKAP PKVREELAYL GLSTTLKMIF LHDFLHGDLH PGNILVSNTP KGDIKLNLLD CGLVVEMGPE QHINLVKILG AFTRRDGRLA GQLMVDTSSH CQASPLDVEL FVNGIERIIL DDAKNNFVEN VGDYITDICY MACVRKVKLE ASFINAALAI EIIEGIAQQL HPQIVVTKEA LPLIVKAEMM HRLPKFSLW
|
| |