Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32339 |
Symbol | |
ID | 7196925 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2239258 |
End bp | 2241705 |
Gene Length | 2448 bp |
Protein Length | 666 aa |
Translation table | |
GC content | 43% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176939 |
Protein GI | 219110375 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000123305 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCCTC ATGAAGTAAA AAAGAGTATT GCCTACATCA ATGGCCACGA AGTCCAAATT ACGCTGGATG CAGCCAAGAG CAGTGCACCA CCTAGTGTTA TTGGAAAAAT GCTCCATTTA CAAACTGGCA ACATCTTGTC GGACTCTTCT TTGCAACACA TAAGGATACA GTCGGGAAAA GAGCAAGAAA CAGTGTTGCT CAAGGATGAA GAATTCATGA CGCAGGCAGA CAAGCTTTTG TCCTATCTTG AGAACACTCC TGAGATCAGC TTTTGTGCTA TATACGATGA CCCAGATTCA CCTCTCTTCA CAGTTTACAA GCAGAGAGCA AAAAAAGACC GCAGGTGTCT ACCCACAAGT ATTAGAGTTA ACTCCGGGGT ATCTGCTGAA ACGGGAATTC TTGACAACGC TACACTAGAT GCAATGGATC CCAATGGAGA ACTTGATGAC TATGTTGACC GCACACGTCG CGCATTCAAG CTCCTTGGCT CACAAAAAAT GCTGTTAGGA GTTGCCTGGA CAAATAATGA GAGCAGAAAT GTATTCTCTC GTTTTTCGGA GATAATGGTA GCAGATGTGA CGGAAGGCAC AAACAACGCC AAGCGACCGC TATTTCTATT CTCTGGGAAG ACGTCAAACC AAAACACTTT CACAGCACTG TGGGCTTTTT TACCACAGCA GGCTCGTTGG GCCTTCCATT GGGTGTGGAC TTGATGCATA CCGCAACTAC TCCCAAAGTA AGGAATTCAA CAAATTCGCC TGACAATAAC AGATGGAGAT CCAAAAGAGT ACGGGACCTT TGTAGATGCA ATACCAACTT TCTACCCTCT TTGCGAACAC AAGCTTTGTC ACTGGCATCT ACTGTATCGC AGTAATCTCA TGAAGGTGCA GACTGGAAAA TGTGGAGTTA AAGCTACTAT TCTATTCCGT GTAGTTGTTC TTTGGATTGA GAGCTGGATG ACCAAAATTG AGACACAAGA GGAATACGAA CTTTCTAAAA GGCTCTTGGC TGATTGGCTT GCAACCCCCG AAGCTATTGA TGTCAAATTG GGTGGTATGG GGCAAACTAT TGTATCGCAA ATTAATGCGT ACATGACACT GTCACTTTTT CCTCATGAAC AGCGCTGGGC TAGATATCGC TATTTATACA CACGAGCATT CAACACATCT GCAAGCTCGT ATGCCGAGGC AGAAAATAGT GCTTTAAAAC GACGGGGCGA CGGGGTCAGG CCAAGCTTTT CCGTACCAAA AGCAACTCAG GTTATAAATG AAGGGACACA AATTAGGTCA AAGAAGAGGC ATCAAAAAGC TGTTTACAAT TTAAATGCTG CCAAGACGAG AAAGCCTGCC TACTACGCAA ACATTGGGGA TTTAGTGGAT TACATTCAAG ATTCTCTTTC CAAAGATTTT GAAGCAGCTG CTTCATTTGT GCTCTTCCGT CCAAATGCAG ACCAGTTTTG GGTCAAGCAA GCCACTCGCA AAAGCAAAAA CACGGACATT CGGAAAATCA ACGACAGTAG CTATTACAAG TATATGATTC CGCAGTTTGA ACGCACACGA ATTGTGGAGC TTGTTAATAT TGATGGTACA TTCTATTTGG TGTGTAGCTG CGGAAAATTT CAGCGACAAG CTTCCCCATG TGCCCATCTT TACAAGGTTC TTGGTCAATC ACCCACATCA ACCGATGTCT CTGTACGCTG GACAAAGCAC TGGGATGTGT ATTTGCACCG AAGTGGCCAC AGTGACCTGT CAAAGCATTT GGAAGACCTG TACAAACAGG AGCGACCAGG TCCAGTATTT GTTGATAGTG GTCAGTGGGT GATCGGAAAA GGTGAAAAAG GGTCATATTT TTTCGAAACT TCGCTTCCGT ACAAGCCCCC TGTCATACGA GATTTTAATC AATGGGCAGT GTCTTCGCAA ACGACTGGAG CTGATTTGAG TGGGACCAAA AATACCACAA ATATGTATTT TTCGAGTGGA ATGGTGCAAG AATCAACAAG CCTGTCCAGA GAGCATGCAT TCCAGGATTC ATTGCATGAA AATTCCACAA ATTCGCAAAA TTCGGATTGT TTTGATGATA CACAGGCAGT GACAACGGAA AGTGTATCTT CTACAAAAAG AATGTTTGGC TCAAGTGCTT TTGTGCACAA TTCCATTTCT ACCAGGAAAT GTCCAAGCTG GCAGGATTTG ATATGGAAGC TGCTGAGTCG ATGAATAAAG CTATGCAAGA AGCATTGGAA AAGGTACAAG CCAATGTTGC AAAAAGGGCT GGGAAGATGG ATTACACTAT AGGACCAGCC ATTACAAAGG ACCATGTAGG TCTAAGATTG AAGCCAAGCT ACAGTCCAAA GAAGAGGAGG AAACCAAACT TACAAAGGAA ATGAAATAAT TTTACTTACA ATTACTGATT TACATTAAAC AATACAGGCG TTCTTCAATA GTCTGTAA
|
Protein sequence | MEPHEVKKSI AYINGHEVQI TLDAAKSSAP PSVIGKMLHL QTGNILSDSS LQHIRIQSGK EQETVLLKDE EFMTQADKLL SYLENTPEIS FCAIYDDPDS PLFTVYKQRA KKDRRCLPTS IRVNSGVSAE TGILDNATLD AMDPNGELDD YVDRTRRAFK LLGSQKMLLG VAWTNNESRN VFSRFSEIMV ADVTEGTNNA KRPLFLFSGK TRLVGPSIGC GLDAYRNYSQ NGDPKEYGTF VDAIPTFYPL CEHKLCHWHL LYRSNLMKVQ TGKCGVKATI LFRVVVLWIE SWMTKIETQE EYELSKRLLA DWLATPEAID VKLGGMGQTI VSQINAYMTL SLFPHEQRWA RYRYLYTRAF NTSASSYAEA ENSALKRRGD GVRPSFSVPK ATQVINEGTQ IRSKKRHQKA VYNLNAAKTR KPAYYANIGD LVDYIQDSLS KDFEAAASFV LFRPNADQFW VKQATRKSKN TDIRKINDSS YYKYMIPQFE RTRIVELVNI DGTFYLVCSC GKFQRQASPC AHLYKVLGQS PTSTDVSVRW TKHWDVYLHR SGHSDLSKHL EDLYKQERPG PVFVDSGQWV IGKGSDNGKC IFYKKNVWLK CFCAQFHFYQ EMSKLAGFDM EAAESMNKAM QEALEKVQAN VAKRAGKMDY TIGPAITKDH AFFNSL
|
| |