Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46687 |
Symbol | |
ID | 7204417 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 192141 |
End bp | 194240 |
Gene Length | 2100 bp |
Protein Length | 626 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185649 |
Protein GI | 219120835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACACGAAA TCCCTCACTT CCTGAAACTA CAGGCGAAAC CTCGGACGAA TTCACATACG ATGTCCAAAA AAAGTTTCGC AGTTTGTGCT CTTCTCGCGC TCATCGTCGC TTGCCAGCAT TCCTTTCGGG TCACTGGATT TACTATTCCT CGTCAAGTTC CAGTCCGGTT TCGTACAGCT TCGGCTCCAT TCAGATCTAG AGCAGATGCA TTCGATCGTT CCATTCCTCA ATCGGCATCC CTTGCGACGG AAGGTATAAC GAGCTCGTCC CTCGGCCAAG TTGTCCTGCA AAAGATGGAG TCGTCTCGGG ACAAGGCTCT TGAATTCGCC GACATGTTTG CATTGACTCC GGCAGAAGCT GCCTTTTATA GCCTCTTTGC CGCTATTCGC GAAGCTGTTC CACTCGGTCT TAAAGGACAA CCGTTTGTTT TGCGACACGC AGAAATTGAA AAGGCCATGC ACCAAGAATC CCAATGGCCT GGATTCTTTA CCATGAAGGA TTTGGAAAAG GCCCTCAGCG ATGACTTTCT AGATGCAGCG CGAGGCTCAA CCGATAATCG CAAAGGATGG CAGGTAGGCA CAGTTTCCCC AAACACCAGC CGCTTCTGAT TGTTTCAAGT ATTGGCTCAC AATTAGCTGC CTTTCTAGAT CACGGATGTT TCGGTTCCAC GGGGGCAATC GTTCGAAGAA GCTCGCATGA CTTTTCAGGA CGTGCAAGCG GCATTGGAAA AAGGGACCGT CATCTTCAAC GCTGCCGGGG CGCACATCCC CAAATTAGCG GGTCCATCGT TGGCCTGTAC AGAATCCACT CTCCTGCCAT GCGCTCTGAA CTTGTACGTG ACGGATGCGG GCAAGCGCAC AAGTGCTCCG CCGCATACGG ACAAACAGGA TGTAGCGGTG GTTCAAACCT CAGGGCGCAA GCACTGGAAA GTATATTCTC CGCCTAATCC GGCCATGAAA CCAACCGTTG ATATCTTTGC CCGTGGGAAG GGCGATGACA GTTTGCCTCT ATACATACTG GAATCCGATT TGGGCTGTCA ATTGCTGTTG GAGACAACGC TAAACCCGGG CGATGTAATG TTTGTCCCAG CGGCCTTCCC GCATACAACG AGCACCGTGA CTGAGGACGA TAGCACACAC GCCGATAAGA CAAGTATTCA TTTGACACTG GGCATTGACC ATCACATTTG GGAGTTAGAT TATTTGTGTT GCCGTCGCTT GGCTCTCCGG CGGGCCAATG TGAAAGATAC TGCGCTCGGC CAAACAGGTG AAGAAGATAG TCCGTATATA GGGGCCGCCA ACGAAGTTAC AGCGCCGTTA ATCAACGATC TGTTTGCAGA GTTGCCGCTA GGTTTGCTTG GTGGTGTCGA CTACGCGGCT CCTGTTATAG AACACGTTGC CGCCGAATTG GAACGTATCA GCCGCGAAGT GGACGAAACA ACTGCGTCCG CTGTTGGTGC CTCCGTATGG CGCGAAGCAG TGGAAAGGCT TCGAACTCAA GGGATGGAAT TGTTGGACAT TCATCGCGAC ATGTATCTAG CTGCCCTAGA GGAGGGTCAA ATTCGAGACG AGGAAGCCGC TATGACAGCA CACTTGGGCC AAGCGGTCCG TCGCGCGATG ACTCCGGAAC GCATGCAGCG GCTTAGTCTT TTCCGCGTGA AACGATATTT TGATCAAATT GACGCGAGCA AGAAAGCTCT TCAGCAGTGG TCTTATGCTC GCGTCGAGCC AGAGGGCGAG GGGGATTTGG CCGATAACTG GGCTTTGACC ATGCCTGTAA AAGTTGGGGA CCAGGTAGAA GCCGATCTTG GAGGGGCTTT TTTCCCAGCT ACAGTCAGTC GAGCTTCGGG AGGCACCTTT GACGTAAATT TTTTTGACGG TGATCGAGAA ACAGGCTTGG AACGAAACCA AATCAAGCTA CTGGCTCCTC CAGCACCACA AGGTGATATA AACACCTCCA ACATGACACC GAAACAATTG AAACGATGGA AAAAGCAACA AGAGAAAACG AAGTAAAGCA CCAGCTAAAG ACTTAAAAAT AGGAAGAATG TAACATGTCT CAGGCTCTTG GGTTTGCACC CAATTGCAAG ACGCTCAAAA
|
Protein sequence | MSKKSFAVCA LLALIVACQH SFRVTGFTIP RQVPVRFRTA SAPFRSRADA FDRSIPQSAS LATEGITSSS LGQVVLQKME SSRDKALEFA DMFALTPAEA AFYSLFAAIR EAVPLGLKGQ PFVLRHAEIE KAMHQESQWP GFFTMKDLEK ALSDDFLDAA RGSTDNRKGW QITDVSVPRG QSFEEARMTF QDVQAALEKG TVIFNAAGAH IPKLAGPSLA CTESTLLPCA LNLYVTDAGK RTSAPPHTDK QDVAVVQTSG RKHWKVYSPP NPAMKPTVDI FARGKGDDSL PLYILESDLG CQLLLETTLN PGDVMFVPAA FPHTTSTVTE DDSTHADKTS IHLTLGIDHH IWELDYLCCR RLALRRANVK DTALGQTGEE DSPYIGAANE VTAPLINDLF AELPLGLLGG VDYAAPVIEH VAAELERISR EVDETTASAV GASVWREAVE RLRTQGMELL DIHRDMYLAA LEEGQIRDEE AAMTAHLGQA VRRAMTPERM QRLSLFRVKR YFDQIDASKK ALQQWSYARV EPEGEGDLAD NWALTMPVKV GDQVEADLGG AFFPATVSRA SGGTFDVNFF DGDRETGLER NQIKLLAPPA PQGDINTSNM TPKQLKRWKK QQEKTK
|
| |