Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20120 |
Symbol | |
ID | 7200453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 874406 |
End bp | 875534 |
Gene Length | 1129 bp |
Protein Length | 339 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179931 |
Protein GI | 219118307 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATCCCCGCG ACCGCGTCAT CAAGTACGCC GCGCTCTTTC TCCTCGTCGC CCAAATGGTA GGGCTCGTCC TACTCATGCG CTACTCCCGC ACCCAACATG ACGATACTCA ACCGCTCTAC TTGGCATCCA CTGCGGTCTT TCTTATGGAA GTTATGAAAC TCGTTATTTG TGTCGGTGTC ATTGCCGTCC AGACTAAATC GGGGGTGCTG CACGAACTCT ACACTCACAC CATCGGATCC CCTTTGGAAC TGCTCAAACT GACCGTGCCC TCCTTGCTGT ATACCGTACA GAATAATCTA CTATATCTGG CGCTGACGAA CTTGGACGCG GCTACGTACC AAGTGTGCTA CCAACTCAAA ATTCTTACCA CGGCTCTCTT CAGTGCGCTT CTCTTGCAAC GCAAGTTCTC CACCATGAAG TGGTTGTCGC TGGTTGTTCT TACGATTGGA GTTGCTATCG TTCAGCTTTC CGGCAGCGGT GACCAACATT CGGAACAAGA CAGCAAGGCC GCGACTGACG CTGTCGATGA TACTAATGGA ACCGCGGCGG CCCACACGCG TTGGGTGGGA CTCGTGGCCG TACTGTGCGC GGCATGTACC TCAGGCTTTT CTGGCGTCTA CTTTGAAAAA ATCCTTAAAG GATCCCGGAC GTCTCTCTGG ATCCGCAACG TCCAAATGGG ATTGTCCTCC ATCGTAATTG CGTACTTGAC GGTTTACGTC AAGGATGCCG AGGCCATTCG GACGCAAGGT TTCTGGGGCG GCTACAACAC TCTCGTGTGG ACCGTCGTCA CGGTCCAAGC CGTCGGCGGC CTCATCGTGG CTACCGTCGT CAAGTACGCC GACAACGTAC TCAAAGTCTT TGCTACCAGC TTTAGTATCG TCGTGAGCTG CATCGTGTCG GCGTTCCTGT TCGACTTTCA TCCGTCCGTA TCCTTTCTCG TCGGCGCGAG CCTGGTGGTC ACGGCCACCG TTATGTACAG CTCACCCGAG ACCCGGACAC GCAAAACGCG GCGAAGACCC GTTTTACCTA TTCACCATCG TAATAATACT GCAACCACCA AAAGTCGGGT CTAATCATTG CTGCTTAATG AAAAATTGAA GTTAAGTGGC ATTGCGTGTT AAATGACAA
|
Protein sequence | MVGLVLLMRY SRTQHDDTQP LYLASTAVFL MEVMKLVICV GVIAVQTKSG VLHELYTHTI GSPLELLKLT VPSLLYTVQN NLLYLALTNL DAATYQVCYQ LKILTTALFS ALLLQRKFST MKWLSLVVLT IGVAIVQLSG SGDQHSEQDS KAATDAVDDT NGTAAAHTRW VGLVAVLCAA CTSGFSGVYF EKILKGSRTS LWIRNVQMGL SSIVIAYLTV YVKDAEAIRT QGFWGGYNTL VWTVVTVQAV GGLIVATVVK YADNVLKVFA TSFSIVVSCI VSAFLFDFHP SVSFLVGASL VVTATVMYSS PETRTRKTRR RPVLPIHHRN NTATTKSRV
|
| |