Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14180 |
Symbol | |
ID | 7202815 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 251945 |
End bp | 253872 |
Gene Length | 1928 bp |
Protein Length | 545 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182036 |
Protein GI | 219123447 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.364389 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTATACGTAC CACCCGTCCT GGCCAAATGG CTACGACCTC ATCAACGAGA AGGTGTACAG TTTATCTACG AATGTGTCAT GGGTCTCAAA GACTTCAATG GTCACGGTTG GTACGTAACT TTGGGGCACG CTCGGACGAC TTGCTGCTTT ACCCTGGCCT CGGCGGCCCT GGACTGACAC CCTCTATTTT CTTCCGTTTG TCCTTGTTCG GGACAGTATT CTAGCCGATG ACATGGGTCT CGGCAAAACC CTACAGTCCG TCACACTCAT TCATACGCTG CTCAAAACTG GCATCACTGC CAACGGAGCC CCGACCGCGA AACGGGTCAT TGTGGTTTGT CCCTGCAGTC TCGTCAAAAA TTGGGAAAAC GAATTCGTCA AATGGCTCGG ACCGGGGGTA GTCAAAACGT TGGCAATCGC CGAAGCCGAC CGCAAAACCG TCGAACGCAA CCTGGATACC TTTGTGCGTA CCAAAATATT CAACGTGATG ATTGCCAGCT ACGAGTGCAT CCGGACCCAC GTCGGGCGTC TGTCCAAACA CGCCGACTGC TGCGATTTGC TCGTTTGCGA CGAAGCACAC CGCCTCAAAA ACAGCGACAA CCAAACCTCC CGTGCCCTCA ATTCCCTGCC GGTCCGACGA CGGGTGCTTC TGACCGGAAC GCCCATGCAG AACGACTTGC AAGAGTTCTA CGCCATGGTG GACTTTACCA ATCCCGGCAT ACTGGGAACG CCGGAAGAAT TCCGCCGCAA AACACTCTTC CCGATTCTGC GCGGACGCGA GCCCGACGCC TCCGACGCCC AAAAACACAA AATGATGCAA ATTCAAAACG ATATGAGTCG GATCGTCAAC GATTTTATTC TACGCCGGGT CAATACGCTC AACGCACAGC ACCTACCACC GAAACTCGTA CAAGTTGTCT GTTGCAATCT GACCGAAATT CAACAAAATA TGTACCAACA TCTCGTCAAC AGCAAAGACA TGCAACACGT TTTGGACGGC AAGCAAGTCA ACTGTCTTTC ATCTATTCAA ATGCTAATGA AGCTCGCCAA TCATCCCAGC CTGGTGGCAG AAGAAGACAA AAGCTTTGCG GTGGGCCCAA GTAACAAACG GGGTGGTAAA GTAGTCAAGT ACACGGACGA GGACGACGAC AAGGCTTCCA TGGCCGCACC GGGAGCGGAC GGCATTGCCA AATTTCTCCC GTATGTTCCC GGTGAAGGAG GCGGCCGACG GGGAGACTTT GCTCCCGTAC GCCCCGAATG GTCCGGTAAA ATGTTTGTTC TCTACCGCCT TATGAAGGAA ATGCGCAAAC CGGGCAACGG TAACGATAAG ATTGTCATTG TGAGCAATTA CACGCAAACG CTGGATCTCA TTGGACGCAT GTGTCGCGAA AATTCCTGGG GCTTTTGCCG TCTAGACGGA TCCATCACGA TGAAAAAGCG ACAGAAAATG TGTGACGAAT TCAACGACCC CAATTCGCCC CTCGTCGCTT TTTTGCTTTC CAGCAAAGCC GGTGGATGGT ACGTACATAA ACGTTCTCGT CTGTACTTGT CGTCCCATAC ATTTTCAGCG GAAGATTTGC TCACGGTTTT CTTGATGTTA CTCTATCGCC ACCAGTGGTT TGAACTTGAT TGGGGGCAAC CGTTTGGTCT TGTTTGATCC CGACTGGAAT CCTGCGGTGG ATAAGCAGGC TGCAGCTCGA TGCTGGCGTG ACGGCCAGAA GAAACGGTGC TTTACGTATC GCTTTTTGGC AACCGGAACT GTGGAAGAAA AGATTTTCCA GCGCCAGCTC TCAAAAGAGG GACTGCAGTC CGTGGTCGAC GACAAAGAAC AGACCAATCA GCTATCAACC AAAGACCTCA AGAATTTGTT CAAACTACGA ACGGGAACGC CTTCTGATAC TCACGACAAG CTCCGATGTG AACGGTGC
|
Protein sequence | VYVPPVLAKW LRPHQREGVQ FIYECVMGLK DFNGHGCILA DDMGLGKTLQ SVTLIHTLLK TGITANGAPT AKRVIVVCPC SLVKNWENEF VKWLGPGVVK TLAIAEADRK TVERNLDTFV RTKIFNVMIA SYECIRTHVG RLSKHADCCD LLVCDEAHRL KNSDNQTSRA LNSLPVRRRV LLTGTPMQND LQEFYAMVDF TNPGILGTPE EFRRKTLFPI LRGREPDASD AQKHKMMQIQ NDMSRIVNDF ILRRVNTLNA QHLPPKLVQV VCCNLTEIQQ NMYQHLVNSK DMQHVLDGKQ VNCLSSIQML MKLANHPSLA SMAAPGADGI AKFLPYVPGE GGGRRGDFAP VRPEWSGKMF VLYRLMKEMR KPGNGNDKIV IVSNYTQTLD LIGRMCRENS WGFCRLDGSI TMKKRQKMCD EFNDPNSPLV AFLLSSKAGG CGLNLIGGNR LVLFDPDWNP AVDKQAAARC WRDGQKKRCF TYRFLATGTV EEKIFQRQLS KEGLQSVVDD KEQTNQLSTK DLKNLFKLRT GTPSDTHDKL RCERC
|
| |