Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46383 |
Symbol | |
ID | 7201642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 172191 |
End bp | 173637 |
Gene Length | 1447 bp |
Protein Length | 454 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180771 |
Protein GI | 219120048 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.372126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAAAGCCTT CGAATCAATT GGGTGCATCA GGGACTATCA AACAATTCTT TTGCTTGCTG TTTTCTTAGA TAGGCTGGTG AGATGCGTCA ACTACGAAAT ATAATACTTG TAGCCTCTTC GGCTTTGTGG GCAGGCCAAC AAATCTGCTT TGCATTCGCG CCGTTGTGCT GTCAAGCCCG TCGTTGGTCA ACCGCCCTAC CTAACAAAGG TCAGCGAAGA CAATTGGAAG TTTACTGCGC CCCTCCAACA ACAAGCACTC AAATTCAGGA AAGAGCACCA CTTTCCGATG CTGCCACTTG GCACCGCGAA AGGCGAAGGC AAATGTTAAA GAAGTATGGA CAACAGATTG CGCCTCTGGA GCGACAAGCC TCCAGCCAAG ATGTGGCTGT CCCTCTGCTC GCTTTAGCGA ACTTATCGCT ATTAGGAATG TCAATTTGGA GTGGATCACT GCCAATTGCT GGTGTTGTTG CTTTGGCTGC TTTTCCTGGG TCCATGTTCT CCCTATGGCA GTTGCAAATT CTGCATGATG TGTTACATGG TTCTTTGCTG AAAAAGGGAG TGAGTTCGTT TTGGGGAATT AAAAGGAAAA CACTACAGGA CCAGATTCTG TTTTGGGGGT CTATGCCCTC CGTATTTGGA TACTACCTTT ATTTAAAGTT CGGCCACTTG TCGCACCACA AGAATGTTGG AGATCCTCAC CAAGCTAGCC TATCGCAGTT GTTTGCTTCC GATCAAGTCG ACTTCGAAGA CGGGGACGTA CTCTTTGTAG CACACCGTAT GAATTTGAAG GGCGATATTG GCCCGGTGTT CAATATGCCC TTTGGCAAAA AGATCAAAAT GTCTATAAGC AAAAGCGGTT TTAACTCTTG GAGACAAGGC CATGCCATGT GGAATGCGAT CATGTTTACC GCGTCTTTCA TGTACGAACG TCTCATGCTC CTTCTGAATG ATGCAATAGT TGCTGGTACC GGCTACAACT TGTTCTTTCC CAATAAGCCT CAAATCTTTC ACGATGAATG TGCCAAGTAT GCTCGATGGG CAACTGCGCT TCGTGCAAGC TTGTGGATCT TCGCTGGATG GCAATCGTTA TTATTTCTCT ACCTGGCAGA GACGCTGTGG TCCATTCCAC CACACCCGGC TTGTGCAATG TTTGTGACAA ATCATCCATC GTCAAAAGAT GGAGAATCTG GGAAGTGCAT TCCGTCACAA TCAACATATG CCGGTGCCTG GTATTCAATC TTTACGTTGG GAACAAACTA TCATTGTGAA CACCATGACT TTCCAACCAT TCCATTGCAC AAACTAGGCG AGCTAAGAGA AATCGCGCCC GAGTTTTATC GTCATGGGTC CAACGATAAC TTGGCACAGG TAATGATCAA AGCCTTTGAC GATCCTGATT TTTACGCATG TATGGATACT GGAATTGGAT CAACTAAGCA AAATTAG
|
Protein sequence | MRQLRNIILV ASSALWAGQQ ICFAFAPLCC QARRWSTALP NKGQRRQLEV YCAPPTTSTQ IQERAPLSDA ATWHRERRRQ MLKKYGQQIA PLERQASSQD VAVPLLALAN LSLLGMSIWS GSLPIAGVVA LAAFPGSMFS LWQLQILHDV LHGSLLKKGV SSFWGIKRKT LQDQILFWGS MPSVFGYYLY LKFGHLSHHK NVGDPHQASL SQLFASDQVD FEDGDVLFVA HRMNLKGDIG PVFNMPFGKK IKMSISKSGF NSWRQGHAMW NAIMFTASFM YERLMLLLND AIVAGTGYNL FFPNKPQIFH DECAKYARWA TALRASLWIF AGWQSLLFLY LAETLWSIPP HPACAMFVTN HPSSKDGESG KCIPSQSTYA GAWYSIFTLG TNYHCEHHDF PTIPLHKLGE LREIAPEFYR HGSNDNLAQV MIKAFDDPDF YACMDTGIGS TKQN
|
| |