Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35540 |
Symbol | |
ID | 7200781 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 153813 |
End bp | 155477 |
Gene Length | 1665 bp |
Protein Length | 481 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179985 |
Protein GI | 219118425 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0300403 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCTCTT CTTCTGTTCT TATCGGGATC TTCTTTTCTC TTATCGGCGT TTACGCAGAG GAAATTTCTC TGGAGCCCAG AGAGTCCTTT GGCAAGCAGA AGACACGAAA TGCTCAGGTA ATCCTCGCTA ATTATTCTGA AGCGGAGATG CTTGGCTTCA AGATCCAGCA CCAAGTGCAC AGCCAGGCCG ACCTATTTGT CCAGCAGCTA GTAGGAAAGT TGCTACTTGT CAAGAATGAG ATGATATCAC TTGAGAATCG CGACCTGGCC AATCTCGATA TCTTTACTTC TTACGCCAGT GTTGTCGCCG ACGGCACAGA GAAGAACGCC GCGTCCTCAC TGTTTGTGAG CAGACAGGAT CCGGCTATAA CGATTGCACT GGAATCCGAA GGCAACTTGA GGGAAGCCGT ACGCCTTGAC CCTGAAATCG GTAAAACAAT ATCCATTTCA CGCATTGATT CCCGAAAGGC GGATCGATTT GTTACGATCA CTGCAGAAGA CTTCGACCAA GACAAACTTG CTAGTTTTGA AGTAGAAGAC AGAGTAGCTC CATTAGCACA TCAACTCAGA AGCTCACACA AGAGCGGAAC AAGCAGAGAG AGGTCTCTCC AAGCCTGTCA GCTTTCGGTC CCAGTCCCGG AAGCAAAATT ATTCCCCGTG ACTCAACGGA GTCATGTTGC TTCGCCCCGC TCGATTCTCA AGAACTCGCT TCTATGAATA TTTCGTCAGA GACGAAACGG ACAGAAAACT CCGTTAACGA CCTTTCAGTG AATCAATCGG TCGAGAACGA TAATGTTGTA TCGGCTCACG CGACGACCAG AGAACCCAAA ACAGAATACA AGTATGAAAC ACGTGAAAGT CAGACTTTGG AGGGACAATG GAATATTATT GGAAAAAAGG GAAAGTCCAC GATTTCCCAC GATATCGGCA CGAAATCGTC TTTTAGTAGC TATAATCTGA ATTAACGACA GTTGTAAACC ATCTTTCGTT ACGCCTTGAG GCACTATTCT ATTGATTCTA CACTGGAGAA GATTGCTTTC TTCTGCCTTT GTACATAGCC ACTGGAATAT TAAGTTGTCA AAGTCATCGG TGCCTGCTCC GAATATGGTT TTGACGTAAT CGAAGTTGCT GTTGTGGTGG ATTCCCTTCT CTGTGCTGCC GTAGGTGGAA CTGAAGTAGC GGCTTCCACC GTTGCACAGT CTGTCATTGC AGGCGCCAGC CAATTCTACG AGGTTGACGG ACTTTGCAAG AAACTCCGCA TTTCGTATTT GGAAATTCAC TGCAATGCTG GTACCGATCC TATCGCGCCT TTGCTTCAAC GAGCAGGAAA CTCTGACATT TGTAATACCG ACGCAAATGG ATTATTACAA AACTTCGTGC GTTACACAGC AGATCAAGGT ATTGCCGCGG ACTTGAACCT TCTGTTTCAC GGCAAGTCAT TTACTGGCTC GCCATCGATC GGCTGCGCTT TCATTGGAGC GCTCTGTCGC ATTGATGGTG CGGATTCTGG AGTCAATGAG ATCACCTTTA CATCCGACCC CGTGTCCCGG GCCAAATTGG TCGCCCACGA GGTGGGCCAT ATCCTAAGTG CTGTTCATAT CAATGATCAA CGGGACATTA TGTTTCAATC AGTGTGCCCG ACTTGCAACA TGTTTGGAGC CATGA
|
Protein sequence | MISSSVLIGI FFSLIGVYAE EISLEPRESF GKQKTRNAQV ILANYSEAEM LGFKIQHQVH SQADLFVQQL VGKLLLVKNE MISLENRDLA NLDIFTSYAS VVADGTEKNA ASSLFVSRQD PAITIALESE GNLREAVRLD PEIGKTISIS RIDSRKADRF VTITAEDFDQ DKLASFEVED RLTQERNKQR EVSPSLSAFG PSPGSKIIPR DSTESCCFAP LDSQELASMN ISSETKRTEN SVNDLSVNQS VENDNVVSAH ATTREPKTEY KYETRESQTL EGQWNIIGKK GKSTISHDIG TKSSFIVKVI GACSEYGFDV IEVAVVVDSL LCAAVGGTEV AASTVAQSVI AGASQFYEVD GLCKKLRISY LEIHCNAGTD PIAPLLQRAG NSDICNTDAN GLLQNFVRYT ADQGIAADLN LLFHGKSFTG SPSIGCAFIG ALCRIDGADS GVNEITFTSD PVSRAKLVAH ECARLATCLE P
|
| |