Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_28652 |
Symbol | |
ID | 7202486 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 282385 |
End bp | 283525 |
Gene Length | 1141 bp |
Protein Length | 314 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181691 |
Protein GI | 219122725 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.105937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCAAAGTTG GAGCCGAAAT GCCGAAAGGG AGGCCCTCGT GGTTCAAAGT CCCGGCGCCT TCACAAGGTA CGCAATCTAC TTATAGCACT GCGTCGTGTA AATGTAACGC GCAACACCAC CGTCCTCAGC TTTGCCTTTC TGATTTACGT TACTTTATTT CATCGCATCT AGCCAAAGAC TCACGCTACG CACAAGTGAA AGACAGCTTA CAAAAGCTAG ACCTCCATAC CGTTTGTGAA GAAGCGCAAT GCCCGAACAT TGGCGAATGC TGGAACGGAG GTACGGGTAC GATCATGTTG CTTGGTGACA CATGTACGCG TGGCTGTATG TTTTGTGCCG TCAACACGGA CAGCAAGCCC CCTCCACCGG ATCCATTCGA ACCTTTTAAA ACGGCGGAAG CCGTGGTGGC GTGGGGCGTC GACTACATCG TGCTGACGTC GGTGGACCGG GATGATATTG CCGACGGTGG AGCACAGCAT TTTGCCCAAA CCGTCCAGCT CTTGAAACAG AACAAACCAA ATTTGTTGGT TGAATGCCTG GTTTCGGATT TTCAAGGCAT GTTGGATTCA GTAGAGACAC TGGCACTGTC TGGATTAGAC GTGTACGCGC ACAATGTGGA GACGGTGGAA CGATTGCAAC CCTTCGTGCG CGACGCTCGT GCTAATTATC AACAATCCCT TTCGACCCTG CAACACGCCA AAGTGGTCAA ACCTGAGCTG TATACCAAAA CTTCGATAAT GTTGGGCTTG GGTGAAACGG AGGAGGAAGT AACGCAGACA ATGACAGATT TGCGAGCCAT TGGCGTGGAT GTGGTGACCT TTGGCCAATA TCTGCGTCCA ACCGAACACC ATTTGTCTGT TGTCGAGTAC GTGACGCCGG AAAAGTTTGA CCACTATCGC CAAGTCGGAG AAAACATGGG CTTTAAATAT GTGGCTTCTG GCCCAATGGT TCGTAGCTCG TACAAGGCGG GAGAGTTTTA TCTGGAACAC ATGATCAAGA AGGAGCGAAC GGAAGCAGCA ATAGATATCG GCGAAAATGC ATGACATCTC TTGATACTTG TGTTGATTTA AAGCCTGATC TCTGGTCTTG TTATTAAACT CATGATAGCT TAACATATTG ATAACTTTGA AAAGTCGTTG G
|
Protein sequence | MPKGRPSWFK VPAPSQGTQS TYSTASYSRY AQVKDSLQKL DLHTVCEEAQ CPNIGECWNG GTGTIMLLGD TCTRGCMFCA VNTDSKPPPP DPFEPFKTAE AVVAWGVDYI VLTSVDRDDI ADGGAQHFAQ TVQLLKQNKP NLLVECLVSD FQGMLDSVET LALSGLDVYA HNVETVERLQ PFVRDARANY QQSLSTLQHA KVVKPELYTK TSIMLGLGET EEEVTQTMTD LRAIGVDVVT FGQYLRPTEH HLSVVEYVTP EKFDHYRQVG ENMGFKYVAS GPMVRSSYKA GEFYLEHMIK KERTEAAIDI GENA
|
| |