Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47941 |
Symbol | |
ID | 7203130 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 508097 |
End bp | 509817 |
Gene Length | 1721 bp |
Protein Length | 381 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182406 |
Protein GI | 219124218 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGAGATATAC TACCGCTGAG AGTGGTGATG TTCCCTTTAC GCTCCACCTT AACTGTAAAG CATTTGTCGC ATCGCGTACA CACTAGACAG GAAGTTGCTT GTGTACAAAA CCTTTGTTGT TCGACCCATT TACGGAGGCC TTTTCAGTTG TTCATGGTAG TTGCCGCTCC ATAGTGCGCA GAGCGCACGT TGAGGAGATG TCACGATAAC GCCCCGTTGG TGCAGGTTGA CTCCTTGATC TATAGTACTT ATCCGTGACG AGTTGCGTGT TTTTTGCGGA GTTGCTCTCT TTTTTGGGCT GTTCAAAGGT TCACCGTGTG CGGGTCTCCA AACTTTGCTT TACACGCCAT GATGAGTAAA CACTTTTACT TGTTTTCATT TTCGTTGTTG AGTGACATTC TTTGTTTGGA TCTCGACAAC AAGAATGGAA AGGTCTGGTT CAGTCGAAGA CTTTCCCCCT ACTTTCGTGG TAAGTATCCA ACGCGCGTGG CGTGGCTTCC GCAAATCAAC ATATTGTATA TTGTTGGCGT GGTTTGGCTC ATTGACATTT TGTTTCGTCC TGACTTGTGT TTCAGTCGTA TCGCTCTGAG TACCCTCGAG AAGAATCTTA CCGCTGCATG ATGAACAACC GAACTCCCAA TTCAGTCTCA CCTCCAGACT TTTTGAAGAT TCCTTGCTAT CGCCCGGTGC CGAGGAAAGG TTTCTCGTCG CATCATGATA GCTCATGTCA TCCTGGTTTT TATGCCGGGA ATCATTCAGC TAATGTACCG CACTACGCAA CCCAGGCAGC ACCATCATTC GATTCGACGG GTAGTAGTCA CGGAGGCCAC TATACTCCTC CTCCTCCTCC TCCACCAGTA CTCAGCAACA TCAATCCACA TTCTCATAGC TATCCTCCAC CGTACCACGG CGGTTACCAA TACTACGCTC CCTGGCCCAA CACGCCACCA CCCGAATACG TGACAGACAT TCAACCGGAA GATGTCCTTT CGGGACGCGG CGGCGCCACC AATTCGCACT CTGGTAACAG AGCCTTTCGT ACTCTCGTGA AAGATTTTCA GGAGCGATAT CTGAAAGCCA AGAAACGAGA CAAGCCGTCG GTGGCTTCGC TTGTCGTGGA ACTGGTTCGT CAAAAGGGCG GCCGCTTTCT CCGTAGGATG GGCACCGATT CCGATGGCCA GGTTTTGTGG ATAGACATTG GCGATGAAAG AGCTCGTGAG AAAACATGTC AGGCCCTGCG GGAAGGTGCC CCTTTGTTGC GTCGATCGAG GCACACACCG AGATCATTCG ACGACGTGGT GGATGCAAAA CTGCACGATT CAATAAAAGA GAATGACAGT TTTGAGACGC CGTCGAGCAC GGTACGTACC ACTCCAGCAA GAAATCTCGG CCACGGGATG GTACAGTCTT CCATCGTCCG TGTCGTCCAA GACAATGAAA ACTGGATGAA AGGGAGTATT TTTTCATCCA GCAAAGACCA CGACATTAAT GACGGTCCCA TTGTGATTCG ACCGATGCGT CGGCTATTGC ATCGTCGGTC AGTTGCTCCA ATCCCTTTGG ATCAACTATC TCCACAAGAT CGGGATTTAT ATCTGCGAGA CTTTTTGCCG CCGTGTCCGT CAATAGGCAA GCAGAGCAAT ATTGCTGCGG AGCCCACGGC TTCGCCTTCG CACCATCCCG TGGAGTACGT GGAGAAACCA AACCCTCGGG CTACTATATA G
|
Protein sequence | MERSGSVEDF PPTFVSYRSE YPREESYRCM MNNRTPNSVS PPDFLKIPCY RPVPRKGFSS HHDSSCHPGF YAGNHSANVP HYATQAAPSF DSTGSSHGGH YTPPPPPPPV LSNINPHSHS YPPPYHGGYQ YYAPWPNTPP PEYVTDIQPE DVLSGRGGAT NSHSGNRAFR TLVKDFQERY LKAKKRDKPS VASLVVELVR QKGGRFLRRM GTDSDGQVLW IDIGDERARE KTCQALREGA PLLRRSRHTP RSFDDVVDAK LHDSIKENDS FETPSSTSSI VRVVQDNENW MKGSIFSSSK DHDINDGPIV IRPMRRLLHR RSVAPIPLDQ LSPQDRDLYL RDFLPPCPSI GKQSNIAAEP TASPSHHPVE YVEKPNPRAT I
|
| |