Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45947 |
Symbol | |
ID | 7200829 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 749920 |
End bp | 751656 |
Gene Length | 1737 bp |
Protein Length | 403 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180306 |
Protein GI | 219119079 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATTCATCAC CGGAACAAGT CCTGATCCTC TTCGGAAGGT AGGGAAGAAG AAGGTAGGCA GCGGGATAGA GACCGACAGT CCTTGTGCAA ATACACAACA CCCCTCAAAC ACAAGAGTCG TTTCCCACAC TCACAACCAA TATCTTTGTC CAACAGTAAG CAAATCCAAC AACACTGTAG TCGCTGTAGC AACCCGATCG AGGAATTCAT CATACCCTAC ACACTCCATG AGTCTTCCGT TTGCCCAAAA GGTGTACATC GTGGCGGCCA AGCGTACGCC CTTCGGAGCC TTTGGCGGTG CACTCAAGTC CGTCAGTGCG ACGGACTTGG GTGCGCACGC GACCAAGACA GCACTGGCGT CCGGCAACGT GGATCCGTCC CTCGTCGACG CCGTCTACTT TGGCAACGTC ATTCAATCCA GTCCGGATGC GGCCTACCTC GCCCGACACG TGGGACTGCA AGCCCAGTGT CCAATTGCCA CACCCGCTCT CACTATCAAC CGACTCTGCG GATCCGGATT CGAAACCGTC GTGCAAGCCG CCAACGGAAT ACGACTCGGC GAATCGCACG TTGCCGTAGC GGGAGGTACG GAAAACATGT CCGCCGCGCC CCTTACCTTG GACGGAAACG TGGCGCGATG GGGTGGAGTC AAATTAGGAC ACGGGATGAA GCTGGGAGAC GCTCTCTGGG ATGGACTCAC CGATAGTCTC GCGCAAACAC CCATGGGACA GACGGCCGAA AATCTAGCCA CCCAGTACAA CATTTCCCGA GCCGTAAGTA ATCGTGTAGG AGAATCCGGC ACCTCGTGAG AGCCTCTGGT TCCATACGGG ACGGACCGGC ACGGTGACAA TGCATACATA CATACACGTT GAGTCTCACC GGCATGAATT TTCGTTCACG GCAGGAATGC GACGAATACG CCATTCGCAG TCAACAAACG TGGGGTGCGG CACAACAGGC CGGATTGTTC GACGCCGAAA TGGCTCCCAT GGAATTACCC GGCCGCAAAG GCACCACCAC GGTCGTGGAC ACGGATGAGC ATCCCCGCGT CGATACCCTG CTCGAAAAGA TTGCCAGCCT CCGTCCCGTC TTTTCCAAAA CTGGCGTCGT TACGGCCGCC AACGCCTCGG GCATTTGCGA CGGAGCGGGG GCCGTCATTC TCGCTTCCGA ACAAGCCGTG CTCGAACACA ACCTCACGCC GCTCTGCCGG GTCGTCTCGT ACGGAATTAC CGGATGTGAA CCTACCGTCA TGGGTATCGG TCCCGTGGAG GCCATTCGGC AGGCCTTGCA CCGAGCCAAT CTCAAGTTGG CCGACATGGA TCGGATCGAA ATCAACGAAG CCTTTGCCGC CCAAGTCCTG GCATGTGCCA AGGAACTAGG CCTCGATTGG GACAAGACTA ATCTGCACGG TGGCGCTATT TCACTCGGAC ATCCCTTGGG CGCGTCCGGT TCCCGCATCG TCGCTCACCT TGCCCACGAG TTTGCCACCA ATTCCGCGGC ACAGTACCAC ATTGGCAGTG CCTGCATCGG AGGAGGGCAA GGTATCGCTG TTCTCATGGA GCGAGTGTAG TTGGCGAATA CGACCGTGTA TGCAAAGTTG ATTGTCAGGA AATTTACCAG GCATGTCACG GAGTATACAG ACGAACGGGC ACTGTACAGC CCATACTTGG CTAGTAGGTA GGTAGGTGTT TAATCAAACA CAAAGCTGTA AGTAAAACTG AATGTGAACG TGGATCG
|
Protein sequence | MSLPFAQKVY IVAAKRTPFG AFGGALKSVS ATDLGAHATK TALASGNVDP SLVDAVYFGN VIQSSPDAAY LARHVGLQAQ CPIATPALTI NRLCGSGFET VVQAANGIRL GESHVAVAGG TENMSAAPLT LDGNVARWGG VKLGHGMKLG DALWDGLTDS LAQTPMGQTA ENLATQYNIS RAECDEYAIR SQQTWGAAQQ AGLFDAEMAP MELPGRKGTT TVVDTDEHPR VDTLLEKIAS LRPVFSKTGV VTAANASGIC DGAGAVILAS EQAVLEHNLT PLCRVVSYGI TGCEPTVMGI GPVEAIRQAL HRANLKLADM DRIEINEAFA AQVLACAKEL GLDWDKTNLH GGAISLGHPL GASGSRIVAH LAHEFATNSA AQYHIGSACI GGGQGIAVLM ERV
|
| |