Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37808 |
Symbol | |
ID | 7202629 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 123615 |
End bp | 125171 |
Gene Length | 1557 bp |
Protein Length | 453 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181849 |
Protein GI | 219123058 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTAACG CGTGTGATTT GCAACTCGAT TGGACCGTAA ATGGCGAAGA AAAGGACGCC GGTACGAATA GAACGAGAAA GACAGCAAAG CCTGCCCGTC GCCGCCGACG GCAAGTTGAA GCTACCCAAG ACAATTGCAA CAGCGCCAAC GAAGACACTG GAAGTCAAGG AAAAGCAGAA GTTGGGCAAG TTGGGCCGAT GCATCTCAGA AAGCATACGG AAGCTCGGAC AAGGCTACCG AGAAAAAACT CCGAAAAATT CGGAACAGAA ATAAAACATG CGGACATCGA CATTGACCAA AAGAGCAGAT GTCAATCTTT GCGACATGTA CAGGAGAAGG TCGTCTACCA CCACTTTGGT ACGGACGCAA TGAACCTATT GAATGTTGAA ACTGCTGATA TTCCGCAGCC TGAAGCACCT AACCATGTCG TCGTAAAAAT TCACGTATGT ACGATAGCCT ATTGAAGCTC CTTTGAAATG GTTCTGTATT ACTAATCCTT CTGCTTCTGT GCGTTTTTTA TCACAGGCCT CAACTGTATC CCTTGACGAT TGCATTTTGC GAAGGGGATT TTGTTTTGAT GTTACATCTC CTATTTCTCT TCCTGTCACG CCAGGTATGG ATTTTGTTGG TAAGGTTGTA GCATGCGGTT CCCAGGTAGA CGACTTTAAA AACGGCGAGT GGGTAGCCGG TCTGGTTCGT ACGGGGGGTA ACGCTCGCTT CATCAGTGTC GCTCAATGCA GTTTGGTACC GGTACCAAAA ATGCTTGACT CTGCCGAAGC GGTCTGCATG GTATCTACCT ATACTGCGGC CTATCAGACG ATGAGAGTAG TTGCAGATCG GAATACAGTA TTTTCGATGC AAGGAAAGAA AGTTCTTATC GTCGGGGGCA TGCATAATGT TGGTCAAGCT CTCATTGAAC TCTGCACGAA AGCTAAGGCT GAAATTTTTG CGACAGCTCC CGACCGACGG CATAGCTACA TTAGAAATAT GCTCGGTGCC ACTCCTCTAC CAGAGAGTGC TTCCGAATGG TCAACTATCG TAGATGGAGA AATGGACTAC GTCTTGGATG GCGTCTGCGA GGATGGCCTG GTTCCAGCAT TGCAAGCATT AAAACAAAAT GGGAAGCTTG CTTGTTTCGG ACATTCGTCT ATTCTTCGGG AGGAGATGGG CTTGTTCGGT ACACCACTTT CGGCTCGCTT CAATCGGTGG CGAGGAGACT TTGTCTCTGG TGGGAAGAGA ATCGATCTTT GGGAGAGCCA CCAAGCCGAT CCAGAACTGT ACAAGGTAAG TCCTCGACCG TGCCGAGTGG CGCGTCTAGA TTCAGAATGA TGCTCACCAT TTCCGTGTCA CTTTAGAAAA ATTTGCAATC ACTATTTCAA CTCCTGAAAT ATCAAAAGAT CAGGCCGCAC ATCGCCAAGC GAGTCATGTT GTCGGATGTG GCCGCCGTGC ATGCCCGATT GGAAAACGGA GACATCAGGG GTATTGTTGT CTGTCAACCG TGGAAAACAA GTCGTCCTGG AGCCTTTCAA AAATCCAACC TGGAAGAGGA AAACTAG
|
Protein sequence | MCNACDLQLD WTVNGEEKDA GTNRTRKTAK PARRRRRQVE ATQDNCNSAN EDTGSQGKAE VGQVGPMHLR KHTEARTRLP RKNSEKFGTE IKHADIDIDQ KSRCQSLRHV QEKVVYHHFG TDAMNLLNVE TADIPQPEAP NHVVVKIHAS TVSLDDCILR RGFCFDVTSP ISLPVTPGMD FVGKVVACGS QVDDFKNGEW VAGLVRTGGN ARFISVAQCS LVPVPKMLDS AEAVCMVSTY TAAYQTMRVV ADRNTVFSMQ GKKVLIVGGM HNVGQALIEL CTKAKAEIFA TAPDRRHSYI RNMLGATPLP ESASEWSTIV DGEMDYVLDG VCEDGLVPAL QALKQNGKLA CFGHSSILRE EMGLFGTPLS ARFNRWRGDF VSGGKRIDLW ESHQADPELY KIRPHIAKRV MLSDVAAVHA RLENGDIRGI VVCQPWKTSR PGAFQKSNLE EEN
|
| |