Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_1946 |
Symbol | |
ID | 7204757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 852725 |
End bp | 853882 |
Gene Length | 1158 bp |
Protein Length | 386 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185797 |
Protein GI | 219121134 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.48638 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAAGCATTG GCTATCTGGG ATCCTTGGCG ATTGCCGTCA ACAGTCTCGC CGGTCCGGCT GTCCTCCAGC TTCCTTTTCA GTACCAACGC TCAGGGTTAA TCCCTACAAC GATCTGTCTC GTTTTCGTAT CGATTCTATC CTACTTTGTA AGCCTCCATA TGGCAAACGT GGTTTCCCAG GTTCCAGGAA ATCACAAGTT TAAGCACTGC ATCGAATTTT CCGATCCATT TTCTATCTTT TGGGGTCCCC GCGCTTTCCA AATTACGCAA GTACTTTTCT TTCTGTGCAC GACCTGTCTC AACGTTGCCG CTATTGTTGA TACGGCTGAA GTCGTTGATT CCTTTCTGGG ATTGCGGTTC GAGTCCGTCG GCTTCAATGC ACAGACATTG ACTTTGCAAA CATGGTCGCA TGGACCATGT TCCCGGAAAG AGGTTAAGCT AGGATTATGT GACCCTTTCG GAGATGCCGA TGTGTACGGT GACTACCTCT TGACTCTTGG TTACCTACTG ACGGCACTGG TCTTCGTCCC GGTCTGTCTA ATGGATTTGA AAGAAAACAC ATCCTGGCAA ATCTTTGGGT TTGCAGTTTT GACAGCAACG TCCATTTACT TTTGTGCCAC TTTTTCCACC TACGAGTTGT CTCTCAAACA CGTTTCGTTG TGGGGATCTT CCTGGAAAGG CATGCTGGGA GTCATTCTGT TCAATTTCGC ACTCGTCCTG GCCATTCCAG CCTGGTTGCA CGAGAAAAAG GAGACCGTTA ATGTCAACCA GGTTATTCAA CATTCCACTG CCATGTCAAC GGGCTTGTAT ATTTGCGTCG GCATTCTGGG AGCCTTTGCG ATTCCGAAGG TGAACGTGAA TATGCTTAGT CCGATGGTTT CAGGCGCTTT CGGTTCCGGC ATTCAAATCG CCGGCTCGGT TTTTGCGTTC TTCATTATCG GCCTCGATAT ACCACTATTT TCCGTTCTGA CTCGGTACAA TTTGACACAT TCGGGAATGT GTAGCGAACG AACGGCAAAC ATCTTTGTGG TGTGGTTGCC CTGGTTGACC TCATGGTTGT GGTATCAAGG TGACGCTATC GGGAGTTTAT TGAGCTGGGG CGGCGTGCTA TTAACATCGG CCGTGGCGTT TCTGCTACCT CTGTATTTGG CGTTGCGT
|
Protein sequence | KSIGYLGSLA IAVNSLAGPA VLQLPFQYQR SGLIPTTICL VFVSILSYFV SLHMANVVSQ VPGNHKFKHC IEFSDPFSIF WGPRAFQITQ VLFFLCTTCL NVAAIVDTAE VVDSFLGLRF ESVGFNAQTL TLQTWSHGPC SRKEVKLGLC DPFGDADVYG DYLLTLGYLL TALVFVPVCL MDLKENTSWQ IFGFAVLTAT SIYFCATFST YELSLKHVSL WGSSWKGMLG VILFNFALVL AIPAWLHEKK ETVNVNQVIQ HSTAMSTGLY ICVGILGAFA IPKVNVNMLS PMVSGAFGSG IQIAGSVFAF FIIGLDIPLF SVLTRYNLTH SGMCSERTAN IFVVWLPWLT SWLWYQGDAI GSLLSWGGVL LTSAVAFLLP LYLALR
|
| |