Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36253 |
Symbol | |
ID | 7201389 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 922936 |
End bp | 923985 |
Gene Length | 1050 bp |
Protein Length | 315 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180540 |
Protein GI | 219119566 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTCTT ACCGGAATAG TGATGGATTG GCTCCTTCTA GGAAACGGTT ACATTTTCGG GCCTTTGCGG ATAAGAACTC TTTTCTTTTT GGTATGATCC TGGTAGTAAG CTTGGCGAGA GCCTTTCCTA CGCTTGGTGC CAACGGTAGT TTCCTTCGTC CGGAGCTAGT GATCGCCCAG TTTGGCGTCT CTTTTATCTT TCTGTTGATG GGATTATCAA TGTAAGTTTC TCAGGTTAGC CAAGCTTTGT CAAACATAAA GCTTATCTCT CTTATTCAGT TCTGCACTTT CGGAGTCTGG CCGTTTTTAG TGGGAATTCC GCTGACAAAA GCATGTACAT GGCTTCTCCC AAATGCTTTG CCAAAGCCCT TGCTTGATGG CTTACTTATT CTATCATGTT TGCCAACAAC TGTGAATATG TGCGTCATAT TGACTTCGGC GGCGGGCGGC AATGTCGCAT CGTCGGTCTG CAACGCTGTA CTTAGCAATT TGATGGGGAT AGTTGTGACT CCCGCATTGC TCTTCCACTT TTTTGGATCC AGCATCCAGT TACCCTTTCT GGAGATGTGT CTCAAACTCT GTGGCAAAAT ATTGGTGCCG GTTGCTCTTG GACAATTGCT GCGCTCTACT ATCGCTGGCG AATTCTCTCA GAAATACTCG GGATTCTTCA AGCGGTCTCA GGAGTTTGCC TTGTTGAGCA TTCTGTGGAA CTCGTTCTGT ACAGCTTTTG TACAGGGTTT CGGAATCGAA ATTCGTCATA GCCTTACGCT ACTTGCGCTT TTGCCCACTT TGCACGTTGC GTCGCTTGCT ACCTTGTTCA AAATCTTTTC GATACCTGCT ATTGGCCTAT CTCGCAGCGA TGTTATTGCT GGCATGTTTG TGGCTTCGCA CAAGACGTTG GCGTTTGGTT TGCCGCTCAT TAGTACGGTT TTTGCGGGGG ATGTCAACCT TGCGGCCTAC TCTGCCCCTA TCATGTTATT GTTCCCCCTG CAACTCATTA TTGGTTCCTT GCTAGTCCCT CAACTAGAAA TTTACACTGC AAAGTTATAG
|
Protein sequence | MASYRNSDGL APSRKRLHFR AFADKNSFLF GMILVFPSSG ASDRPVWRLF YLSVDGIINF CTFGVWPFLV GIPLTKACTW LLPNALPKPL LDGLLILSCL PTTVNMCVIL TSAAGGNVAS SVCNAVLSNL MGIVVTPALL FHFFGSSIQL PFLEMCLKLC GKILVPVALG QLLRSTIAGE FSQKYSGFFK RSQEFALLSI LWNSFCTAFV QGFGIEIRHS LTLLALLPTL HVASLATLFK IFSIPAIGLS RSDVIAGMFV ASHKTLAFGL PLISTVFAGD VNLAAYSAPI MLLFPLQLII GSLLVPQLEI YTAKL
|
| |