Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55198 |
Symbol | |
ID | 7199249 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 50287 |
End bp | 53582 |
Gene Length | 3296 bp |
Protein Length | 894 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | STT3 subunit-like protein |
Protein accession | XP_002185420 |
Protein GI | 219130538 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.835655 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTTCGCAT CCGTATTGCC TTCGTCGTTG GTCACGCTTT TCTCTCGATC GATATCTCTT TCTCTCGATC GATACCTGCT TGTTTGCTTG TAGCGATTGT GAGGATTAAT TGGGCTTGTC AGTGTTGGTC ATACTGGGGT TACACTGTCC GCACAGGATG TCCCCGTCGA CGATTTCCTC CACGCGTCCG CCGGAACCCT CGTCCGACCT TTTGTCGTGG GTTTTGCTCG GAGGGACGGC TTACGCCGTG TATGTGATCC TGCACACGGC GTATCGCATC CGTATGGGTG CCATTGACGA TTTTGGCCTC GTCATTCACG AATTTGACCC CTGGGTACGT ACGGGTCAAT GGGCCTCGAG TTGCGCTCCA GACATGGCGC GACGGACTGT CCAAGCGCTA TTACCGACCG GCTTTGTTAC GGTGACGCTA ATCATGAATG GCACCGCATA GCTCACTCTC TTCTACTGCT TCTTTGCGTT TTTGTGGTCG AACAGTTCAA TTACCGCGCG ACGGAATATT TGTACTACAA CGGCATCAAG GACTTTTTCC AATGGTTCGA TTACATGAGT TGGTATCCAT TGGGACGTCC CGTTGGTACC ACCATTTATC CCGGTATGCA GTTTACCGCC GTCGCTATTA AGCGGTACTT GCTGGATAGT GTTATGAGTT TGAACGACAT TTGTTGTTAC ATACCCGTTT GGTTCGGCGT GATGGCCTCG TTTGTTACCG GCTCCATCGC CTACGAAGTT TCTATTCCCC ACAATACCCA TTCCTCCCTG CTCGGATTCA TTTGGGACAT TTACAAGGGT CAAAAGAGGA CGTACCAAGG TACCGACGTC GATCGCAGCG GCTCGACACC CCTACTCTTG GGATTCTGGT CTCCCGCTAT TGGCTGCTCG ATTTTCGCCA TGGCCATGAT GGCCATCGTT CCTGCCCATT TGATGCGCTC CATGGGCGGT GGATACGACA ACGAAAGCAT CGCCGTATTT GCCATGGTAC TCACCTTTTA TTGCTGGGTA CGCTCCCTGC GCAGTACTTC CGGTAGTGCC CACGCACAAA CCTACGTCGC ATGGAGCGTC GCCACGGGGT TAGCTTACTT TTACATGGTC GCCGCCTGGG GAGGCTACGT CTTTGTTCTC AACCTAATTG GCGTACACGC GGCCTTTCTC GTCCTCGCCG GACGCTTTTC TACCCAGACC TACGTCGCGT ACACGTTGTT TTACAGTATC GGTACCGCCT TGGCCGTTCA AATTCCGGTG GTGGGATGGG CACCACTCAA GTCACTCGAG CAACTCGGAC CGGGTGCCGT CTTTTTGGGA TACCAGCTTT TGTATGTTTG TGAAGTCCTC CGGAAACGGC AAAATTTGAC CCGGGCACAG GCGTGGAAAC TGCGTGTACA AATGTGTGCC ATCGCTGGGG CGCTTGTCAT GTTTGCTGCC TTTTTTCTAG CCCCCAAGGG ATACTTTGGT CCCTTGTCAT CGCGGGTTCG TGGATTGTTC GTTGCGCATA CCAAGACGGG CAATCCGCTG GTGGATTCTG TCGCTGAACA CCAGGCCGCT TCGTCGCGGG CCTATTTTCA ATACTTGCAT CACGTTTGTT CCCTGGCACC CGTGGGTTAC ATACTCGTGT TTTTCAACTT GAGCGACGCC AGTTCCTTTC TGATCGTTTG GGCGACGGCA GCCTATTTCT TTTCCCACAA AATGGTTCGT CTCATTCTAC TGACGGCGCC CATTGGGTCG ATTCTTGGTG GTATTACCGC TGGTCGTCTC TTTACCTGGT GCCTGCACCA GTGGTGGGAC ATCGTGGATG ATGATGAGGC CAATAAAGGT GAGTCACCCG CCGTGGTGAC TAAAACAACC AAGAAGGACG GCAAGGTCAC TATGAAAGCA ACCAAGGAGA AAAAGAAAAA GATTCCCAAA ACTGTCGGCA AATCCGATAA GTATACTTCG GAAAGCTTTT CTTCGTTTGA AGGATTAGCC GCCATTCAAG AAACAGCCAG ATTAGCTTTG AATACTACGG AAGGCATCCT TGTCCGTCGA AGCGTGGCGC TTGTCCTTTT GCTCATCGGA TACTTTCTGG GTGGAAGCTT TAATAATTAC AGTTGGCGGC TGAGTCAAGA CCTTTCCAAC CCGACCATCA TTATGCGAGC GCGCCTCCGT GATGGTCAAC TAGTCATGAT TGACGATTAC CGTGAAGCCT ATTGGTGGTT GAAAGACAAC ACTCCGGAAG ACTCCCGCAT CATGGCGTGG TGGGACTATG GTTACCAGAT TGCCGGTATT GCGAATCGGA CAAGCATCGC GGATGGAAAT ACGTGGAATC ATGAGCACAT TGCGCTACTC GGAAAAGCGT TGACGACCGG TGTCGAGGAG GGATACGAGA TTGCTCGTCA TTGGGCCGAT TACGTTCTGT TGTGGACCGG CGGTGGCGGT GACGACTTGG CCAAATCTCC GCATTTGGCT CGCATCGCCA ACTCGGTTTA TCGTGATCAC TGTCCCGACG ATCCTACCTG TCGGGCCTTT GGCTTTGTGG ATCGCGAGGG AACCCCGTCG GCCATGATGA AGCGCAGCTT TTTGTTCAAT CTTCACGGCC ATCAAATCAA GCCGGAAGCC AATGCACCCG CGGACAAGTT CCAGGAAGTT TTCCGGTCCA AGTACGGCAA GGTGCGGATC TTCAAGATCC TCGGAGTCTC GCAGGAATCC AAGGAATGGG TTGCCGATCC ATCCAATCGT ATCTGTGACG CTCCTGGTTC ATGGTTTTGT CGCGGTCAGT ACCCACCGGG ATTGAGTCGA GTTCTGGAAG GCAAAAAAGA CTTTTCACAA TTAGAAGATT TCAACCGGGG CGATCGGGAT GAAGAATACA CGCGTCGGTA TTTCGAAGAT CTGAAGGACC CGGACAGCGC CCGGAGAAAG GCCATGGCCA AAGAAATTGA ACGCAACAAA GAACAGGTCG ACGCCGAAGT ACAGGAGAAG AAACATGTCT CGGTTGACGA TATATATAAC ACTTGGGAAA ACACCGACGA CACAACACGC ATGTGGAACT TAATCAATTC AAACGCCGTT GAGGAATTAA AGGCATGGCT CGAAGCGGAG CCGCATAAAG CATATGTGAG GTCAGAGGAT GGGAGAGGAC CTATGTGGTG GGCTTTTGAG AAGCGCAACG AGGATGTAAC CAAGCTTCTC ATGAAGGCAG GAGTTCCTTA TACGGATCGT GACGGCAGCG GAAAAACCCC ATTGGATTTG CAACAGGGAG GCTAGATTAA ATTTTGCTTC GTTCGCTTTA AAATCGATTT TACAAC
|
Protein sequence | MSPSTISSTR PPEPSSDLLS WVLLGGTAYA VYVILHTAYR IRMGAIDDFG LVIHEFDPWF NYRATEYLYY NGIKDFFQWF DYMSWYPLGR PVGTTIYPGM QFTAVAIKRY LLDSVMSLND ICCYIPVWFG VMASGSTPLL LGFWSPAIGC SIFAMAMMAI VPAHLMRSMG GGYDNESIAV FAMVLTFYCW VRSLRSTSGS AHAQTYVAWS VATGLAYFYM VAAWGGYVFV LNLIGVHAAF LVLAGRFSTQ TYVAYTLFYS IGTALAVQIP VVGWAPLKSL EQLGPGAVFL GYQLLYVCEV LRKRQNLTRA QAWKLRVQMC AIAGALVMFA AFFLAPKGYF GPLSSRVRGL FVAHTKTGNP LVDSVAEHQA ASSRAYFQYL HHVCSLAPVG YILVFFNLSD ASSFLIVWAT AAYFFSHKMV RLILLTAPIG SILGGITAGP TKEKKKKIPK TVGKSDKYTS ESFSSFEGLA AIQETARLAL NTTEGILVRR SVALVLLLIG YFLGGSFNNY SWRLSQDLSN PTIIMRARLR DGQLVMIDDY REAYWWLKDN TPEDSRIMAW WDYGYQIAGI ANRTSIADGN TWNHEHIALL GKALTTGVEE GYEIARHWAD YVLLWTGGGG DDLAKSPHLA RIANSVYRDH CPDDPTCRAF GFVDREGTPS AMMKRSFLFN LHGHQIKPEA NAPADKFQEV FRSKYGKVRI FKILGVSQES KEWVADPSNR ICDAPGSWFC RGQYPPGLSR VLEGKKDFSQ LEDFNRGDRD EEYTRRYFED LKDPDSARRK AMAKEIERNK EQVDAEVQEK KHVSVDDIYN TWENTDDTTR MWNLINSNAV EELKAWLEAE PHKAYVRSED GRGPMWWAFE KRNEDVTKLL MKAGVPYTDR DGSGKTPLDL QQGG
|
| |