Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41071 |
Symbol | |
ID | 7198953 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 283358 |
End bp | 284469 |
Gene Length | 1112 bp |
Protein Length | 342 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185004 |
Protein GI | 219129666 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.762931 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCCCT CTCTGTATCT ACCAGCTCTT CTGTTTTTTT ATTCTATTAA AGCAGAGGCG TGGACAAGAG CGGCTCGGAT AAGTCGATAC AACGCTCGGC CTATTTCGAC GGGTGCTCTG CAAGCGGCAA GAAGCAATGG TTTTCCCAAA AAGCGCGGCC CACAACAATC CAAGGCATCT CCTAAATCCG ACGATAAACC GATTTATTCC ATGCCTGCTC TTTACGATTT AGCGTTTGGA TACAGATCTT ACGAAGACGA AGTCACATTT TTGTTACAAA CACATGAACG CTGCACAGGT CAGACACCAG CAAACATTCT GGAAATGGCC GCTGGCCCTG CTCGGCACGC GCTAACGGCC CTTCAACTAC ACGATGCGGT ACAGTCTGCG ACATGCGTCG ACCTTTCACC GGAAATGGCA GCTTATGCAA AAGCAATCGC AGACGAGGAG CTACCTGAAA ATAGAAAGAA TATGTTCACG TATATCGTGG ACGATATGCG CTACTTCCAG CTCGAAGATG CTTCGCAAAC TTTTGATACA GCGTGGATAT TACTTGGCTC TCTTCAGCAC TTGACTAAGA ATGAAGATGT TATTGCTTGT TTGACTCTTG TCAATAGACA TCTGGAGACT GGTGGGACAT TAATTGTTGA ATTGCCACAT CCACGAGAAA CATTTTCTAT GGTGGAATGT ACTCGAAATG GTTGGGAGGT ACCCCTCGAA GATGAGAACG GCGAAGAATC CGGCGAGCTT AGAATCGTTT GGGGAGACGA AGACGATGAG TTCAATTCCA TTACTCAAGT ACGGAACTTT ACCGTTGCGA TGGAATTGAC TGGTGTCGCT GAGACGGACA AACTTCAGAA CGTCCGCGAA GTGGTGAGTG CAATCAAATT TTTGCTCGCA AAATCAATAT CTCCATCGCG ATGGTGTCTC AGGTTGTAAA TTATTGCATT TTGCAGGTTC CTCTGCGTCT CTTTACTGCT CAAGAGATTG ATGCTCTAGC ACGATGTGCT GGCTTCGAGA TGGTTGAAAT GTACGGCTCT TTGGACGAGG AAGTCAAAGT AGATGACGAA GACCTTGCCT TTCGATTGGT GTCGGTTCTG CGAAAACTGT AA
|
Protein sequence | MTPSLYLPAL LFFYSIKAEA WTRAARISRY NARPISTGAL QAARSNGFPK KRGPQQSKAS PKSDDKPIYS MPALYDLAFG YRSYEDEVTF LLQTHERCTG QTPANILEMA AGPARHALTA LQLHDAVQSA TCVDLSPEMA AYAKAIADEE LPENRKNMFT YIVDDMRYFQ LEDASQTFDT AWILLGSLQH LTKNEDVIAC LTLVNRHLET GGTLIVELPH PRETFSMVEC TRNGWEVPLE DENGEESGEL RIVWGDEDDE FNSITQVRNF TVAMELTGVA ETDKLQNVRE VVPLRLFTAQ EIDALARCAG FEMVEMYGSL DEEVKVDDED LAFRLVSVLR KL
|
| |