Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_12740 |
Symbol | |
ID | 7201359 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 382498 |
End bp | 383962 |
Gene Length | 1465 bp |
Protein Length | 459 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180426 |
Protein GI | 219119326 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.60383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGCTCTACG GAGTGCCGAT TTCCGTGAAA GAACATTTGG CTTTGCGGGG TTCGTACTCT ACTGGAGGTC TCGCCTGTCG ATTGAACCAG AAAGATACTA AAGATTCCTT GATTGTGCAA GTAATTCGCT CCGCTGGGGC CATTCCTATG TGCAGTGGAA ATGTACCTCA GATCATGATG CTTCCGGAAA CGTACAATCG AATCTGGGGA CGCTCTCGGA ATCCTTGGGA TTTGTGCCGC TCCACGGGTG GATCGTCCGG CGGCGACGCC GCTCTCGTGG CCGCCAGGTG CGTGCCTTTG GCTATTGGCA GCGATGTGGC TGGTTCTATA CGCATTCCGG CTTCGTTTTG CGGTATTGTG GGATTCAAGC CGACGGCATA TCGCGTGTCG GGCAAGGGAA ATATGAAGGC TCGCAAAAAC AATCGCTCCG GAACGAGCGC CGTTATTCCC GTCGTGTGTG GACCTCTCGC CCGCACTGTC GACGACTGCG CACAATTTAT GAAAGCCGTT CTAGTACCAG AAATGTTCCA AGGTGACAGC AGCGTTCCGC CCCTACCGTT CGACGTGGAT TCGTACCAAA GTAAGGCGAA GCTAAAGATT GGCTACTTTG ATACCGATGG TTGGTTTGAA CCTTGTTTGA CTTCCAAACG CGCCGTTCGA GAAGCGATCG ATGCATTGAC CAAAGCAGGA CACACGTGCG TGCCGTTTAA ACTCCCGACC GACGGATGGA TCAGTTACGG TCTACTGGTG GCCATCAATG CTGCCGAAGG AAATTTTCGT TCTTTTGTTG AGGCTTTGGA AGGGGAGCAA ATGATTTCCG AGTATGATAC ATTACACCAA GCGAGCAACC TTCCGAACTT ACTCAAGCCG GTCATCATGG CTTTGATTGA TAAACGCCGG GGACACTTGC TGAAGCAAGG CCGAAATGGT GGTGTACCCG TTTGGGATCT GTGGCAGTCG GTCGCAAAAG TTCTCGAACT TCGTCAGAAA TGGGACAACG CGGTACGAGA GGCGGGCTTG GATGCAATCG TCCATCCTGC CATGCCGATT CCAGCAATCC AACACGGCTT GTCGGGTAAA CTTACAGCTA GCTGTTCGTA CATGTTTCTA GCGAATCTCC TACAGGTAAG AATGGCACTT TATATGTGTT GTTTTTTGTT CGTGAGAGTT CTCTGATCAA GTGCGTGCTC GTTCTCATTT TCTCGCTTAG TGGCCTAGCG GAGCTTTGCC CGTTACAACT GTCCGTGCGG ATGAAGCGCA CTATCGAAGG AAAGATATGC CGGCTGACCA ACGAGACATT ATCTCCAGGA TTGTGGCTCA AGTTATGCAA GGAAGCGAAG GAATGCCAAT CAGCGTATGC GTCATGAGTC CGTCCTATCG CGATGAAACA TGTTTGCGAG TTATGAAGGA AATTGAAAAA ACTATTCCCT TTCAAGAAGA ACCGAAAGCT TTTCTGAAAG CTTAA
|
Protein sequence | PLYGVPISVK EHLALRGSYS TGGLACRLNQ KDTKDSLIVQ VIRSAGAIPM CSGNVPQIMM LPETYNRIWG RSRNPWDLCR STGGSSGGDA ALVAARCVPL AIGSDVAGSI RIPASFCGIV GFKPTAYRVS GKGNMKARKN NRSGTSAVIP VVCGPLARTV DDCAQFMKAV LVPEMFQGDS SVPPLPFDVD SYQSKAKLKI GYFDTDGWFE PCLTSKRAVR EAIDALTKAG HTCVPFKLPT DGWISYGLLV AINAAEGNFR SFVEALEGEQ MISEYDTLHQ ASNLPNLLKP VIMALIDKRR GHLLKQGRNG GVPVWDLWQS VAKVLELRQK WDNAVREAGL DAIVHPAMPI PAIQHGLSGK LTASCSYMFL ANLLQWPSGA LPVTTVRADE AHYRRKDMPA DQRDIISRIV AQVMQGSEGM PISVCVMSPS YRDETCLRVM KEIEKTIPFQ EEPKAFLKA
|
| |