Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47737 |
Symbol | |
ID | 7202727 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 687269 |
End bp | 688668 |
Gene Length | 1400 bp |
Protein Length | 361 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181956 |
Protein GI | 219123281 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTGTTGGGG GAGCATGGCG TCCGAAGGAG ATATTGACAC GGCTGCATCA GCTTCTGGTA CTACCGAGGA TCCCGACTCT GGCGAAGTGA TTGGTTCTGC CGCTCTGCGT CCCGTCTTCT TGGGGAATTT GGTCCCGAAT TACAGTACGG ATGAAGTGAC GACGCTCTTC GAACGGCCCA TGCTGCCAGC CGCTGCTGCG GAAGGCGCGT ATCGCCCCAT TCCCGTGGAT CGGATCGATC TCAAGCGGGG GTACTGCTTT GTCTTTCTCA AGGATGCTGC TACGCAAGCG GACAAGGAAC AGGCCGAGCG ATTTGTGTCC GACATCAACG GCATGTGAGT CTGTCTACCC GTTTTTTCGC AACGTACTTT CTACGTATTC TGACTACCGG AAGTTGCCTT GTGCGAATCT TTCTCTCGCT CCCGCACCTA TATCATCGTG TCTTCCCTCT CTACCCTCTC CTCCGTTCGG GCACACTCTA CTACGACACG TGGGAGAAGC GCGCCGTCTC CGGGCGCCGT CCGTCGACGC AAACTCGTCC CACCCAACCA ATCCGTGACT GACCTCTACG TTTTCCCTTT TTTCTTTCTT TCTTTCTGTT CTACCAATCG TCGCAGGCAA ATCGCCAACG TCTCCAACTC TCTCCGTGCC GAGTTTGCTC GTGGCGATGG TCGGGTCAAA CGCAAGGAAG ACGAACGGCG CAAGAACATT GCGCCCTCCG AAACTCTCTT TGTCGTTAAC TTTCACGAGG AAACCACCAA AAAGGAAGAC TTGCAAATGC TGTTCGAGCC GTTCGGGGAA CTCGTGCGCA TTGATTTGAA ACGCAATTAC GCCTTTGTGC AATTCAAAAC TATTGCCGAA GCGACCAAGG CCAAGGAAAC GACCAACGGA GGCAAGTTGG ATCAGTCCGT GTTGACGGTA GAGTACGTGG CTCGCGAACG CAACATGAAC GGTGGGGGCG GCGGCGGTAT GGATCGCCGC GATCGGGATC GTCGGGGACG CGACTACCGT GATCGCTACG ATGACCGTCG GGGTCCTCCC AACCGGGGCA TGCCCCCGCC TCCCTACATG GACGATCGAC GCGGCGGTTA CGACCGTCGC GGAGATCGAT ACGATCGTCC CGGATACGAT CGGATCGACG ACCGATACCG ACGGGAGCGT AGTCCCCCCG GCTATCGGGG ACGCCCGCGG TCCCGCTCGC GCAGTCCACC CCGACACTAC CGTTCGCGCA GTCCACCGCT GCGCTACGAC GAGCGCTACG ACGATCACCG CCGCCGTCCC CCGAGCCCGC CCCCCGCGGC GGATTACCGA GACCGCCGTG GTGGGGCTCC GAGTCCGGAT CGGGATTATC GTGGGGACCG TGACCGTGGC TACCGCTCGT AAACAAGTCG GACCAGTGCC
|
Protein sequence | MASEGDIDTA ASASGTTEDP DSGEVIGSAA LRPVFLGNLV PNYSTDEVTT LFERPMLPAA AAEGAYRPIP VDRIDLKRGY CFVFLKDAAT QADKEQAERF VSDINGMQIA NVSNSLRAEF ARGDGRVKRK EDERRKNIAP SETLFVVNFH EETTKKEDLQ MLFEPFGELV RIDLKRNYAF VQFKTIAEAT KAKETTNGGK LDQSVLTVEY VARERNMNGG GGGGMDRRDR DRRGRDYRDR YDDRRGPPNR GMPPPPYMDD RRGGYDRRGD RYDRPGYDRI DDRYRRERSP PGYRGRPRSR SRSPPRHYRS RSPPLRYDER YDDHRRRPPS PPPAADYRDR RGGAPSPDRD YRGDRDRGYR S
|
| |