Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46908 |
Symbol | |
ID | 7204449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 773284 |
End bp | 775641 |
Gene Length | 2358 bp |
Protein Length | 708 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185949 |
Protein GI | 219121451 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACGCATTA TTCGTTCCAA AATAAGTTGT CCGCAAGGCA ATTGGTGCTT CCGTTTCGTC CATTCCATCC ACTCATACCT TTCTTCTGTG TTGCTAACGC ATCACACGTT GCTGATTCGG TATCACCCGA CGGAGTTCCG TTAGATACGA TGGAATGGCC CTACTGGACC TCCGCATCGC CACAAAAACC TTACCCCTCC TCGTTTCCAC CGTCCCGCGA CCTGCCGCCG CCTCCGTCCG ACTTTCCCAA CGATCAGGAA CAATCCAGTA GCGGTGGAGA TGATGCTATT GAAGGAACGT GGCTTAACAT CCGCCGCTTG AGCAACGTAT CAGAAGATGA CGAGGACGAC TGGTCGGAAG GTCCCCCGGC CAGCGGATAC GGAACGGACG CCTACGACGA CGGCTCTCGC AAGCAATCGC TCCGGCGATG CGGGTTTCGC TGCTCGCGAT ACCGACTCGC ACTAGCAACC CTGATTGTAC TCGCCATTGC TGCTGGCGGT GTCTTTATTG GCGCCATTGT CACCAAACGA AAAGACGACG GACCCGCCGC AACGAGTGTC GAGACGAATC CGAATACCCC GACCACGGCG CCCACCGTGC GAGTTACGTC GTCGCCCTCC GTCACCAACA GCGGCAACGG CAATAGTACT ACTGCTGGTG CATTGCAAGC CACTGTACAA CTCTCACTGG ACAGTGTGGA ACAACTCCTC TCCGACGAAG GGAAGGAATT TATGGGACTT GCCACCCTGG AATTCTTACG AGCGTCGAAC GCTCTCAACG ACGACGCCAA CACGACGCTA CAGCCGGTGT CCTACCGTTC CGTTACTGTA ACGGACCAAG CATTGATCAC GGGCACCGCC ACCGCCGCCG CTATTGAAGG ACCCGTCCGT ACTCGTGGAC AACAACGACA GTTGCAAAGG CCTACCAGTC TTCGTGTCGA GCTCTCGGTA CAGGGTGTAG CAGCGTTAGC CGACAATACA ACAAACCTCT CCGATGGGGC CTTCGGGTCG CAAATCGCCA CCGCTATCGC TACCAACCCA GAAGTTTATC GATCCGTGTT ACAGGCGGAT GGCAACGAAG TGTTTCAAAA TCTGACGACC ATTGGAGTGC GCGTTGCGGA AGACACCCTC GCTGCTGCCG TACCCAGTTC ATCGCCTATG AGTGCTTCGG AATCACCTTC GATTCTGGCC AGCAGCAATA GTCCCACGGT AACGTTATCT GCGGTACCCA CAAACTCTGT TAGCACGTCC CCCACACTTG TTCCGGAACC CAGTGCTTCG CCGAGCACTT TGCCTAGCGT TGCTGTAGCT GCTTCTACGC CGCCGAGTTT ACTGCCTGTG ACAGTAGCTC CGACCAACCT AGTCGTGACA GTGTCCCCCA CGGCCATTCC GCAAGCCCTC CCTGCTTCCC CGACAGCATC TCCTTCCATA CCCGGAACCA CGGCAGGCAC TCCGCGGGTG TGCAACGGCG TGGCCGGACT TTGTGACGTT CGCGTTAACG ATGCCGTCTT TGGGATGGTC CACAACGCCA TGTCGACGCA ACCCGACAAT TTTCTGTCCT TCAACCACGA AGACACCCTC GAAGACGCCC TGACGGCTGG TTTTCGAGGG ATCAATGTGG ACGTAGGAAT ATGCGATGGA CAAATTGTGC TTTTTCACGC GTTTTGCTTT CTGGGAACGC GAGACGTGGT CGACACCTTT TCGAATATTC ATAATTTCTT GACGCAAAAT CCCAACGAAG TACTGATTGT GTCGTTGCAA ATTGAATTAG TGGATCTACA GCAGTTGGCG AATCTTTTGG GAGGCGTTCC AGGCTTGACG GATCGATTCT ACGACCACGC TTTGGGAGCG GACTGGCCCA CCCTAGGCGA ACTGATCGAC GCCGGCACGA ATATTGTTCT GTTTCATTAC AACGGCCCAA GCTGCGATCA AGTAGTCTGT CCACCAGCCT TTTTGGATTA CTTTCGGTTC GTAGTCGAGA CGGAATTTAA TTTTCAAAGC TTAGCCGAGA TCCGAGACCA GTTCAACTCG TGCGCACTGG ATCGGGGCAG CTCGGGCTTT CGGAATTTCT ACGGCGTCAA CGTGTTCATC ACGTTACCCA ATGCCGCCGC GGCGGACGTT CTCAACACGG TAGGCTTTTT GAATCCGCAC ATATCGGCGT GCGAGGCCCA AACGTCCAAC CAGGTTAATC TGGTTTTGGT AGACTTTTGG AAACGTGGGA ATGTGTTGGA TTACGTGCGA TTACGGAATT TGCAACGCGT TGCCGTCCTA AACTAGTCTG GGACAACACC AGCCTAACCT GCTCACGTGT GTCTACCTAA TTGTGTAAAT AAAAACGGAA ACGTACTTTT TTGAATTC
|
Protein sequence | MEWPYWTSAS PQKPYPSSFP PSRDLPPPPS DFPNDQEQSS SGGDDAIEGT WLNIRRLSNV SEDDEDDWSE GPPASGYGTD AYDDGSRKQS LRRCGFRCSR YRLALATLIV LAIAAGGVFI GAIVTKRKDD GPAATSVETN PNTPTTAPTV RVTSSPSVTN SGNGNSTTAG ALQATVQLSL DSVEQLLSDE GKEFMGLATL EFLRASNALN DDANTTLQPV SYRSVTVTDQ ALITGTATAA AIEGPVRTRG QQRQLQRPTS LRVELSVQGV AALADNTTNL SDGAFGSQIA TAIATNPEVY RSVLQADGNE VFQNLTTIGV RVAEDTLAAA VPSSSPMSAS ESPSILASSN SPTVTLSAVP TNSVSTSPTL VPEPSASPST LPSVAVAAST PPSLLPVTVA PTNLVVTVSP TAIPQALPAS PTASPSIPGT TAGTPRVCNG VAGLCDVRVN DAVFGMVHNA MSTQPDNFLS FNHEDTLEDA LTAGFRGINV DVGICDGQIV LFHAFCFLGT RDVVDTFSNI HNFLTQNPNE VLIVSLQIEL VDLQQLANLL GGVPGLTDRF YDHALGADWP TLGELIDAGT NIVLFHYNGP SCDQVVCPPA FLDYFRFVVE TEFNFQSLAE IRDQFNSCAL DRGSSGFRNF YGVNVFITLP NAAAADVLNT VGFLNPHISA CEAQTSNQVN LVLVDFWKRG NVLDYVRLRN LQRVAVLN
|
| |