Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43585 |
Symbol | |
ID | 7197315 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 882047 |
End bp | 885579 |
Gene Length | 3533 bp |
Protein Length | 1105 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177709 |
Protein GI | 219111915 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTCGG ATAGAAAAGA ATACACAAAG AACGACGAGG AGCAAGGGGA CCAGGCGCAA GCTTTGCCTC CTTCCGAATA CTCGACGACC GAACGGAATC CACCGACACC AAATTGCGCC CTGCGTCAGG ACTTGTGCTC CGAAAACAGC TCATTAGTGT CCGACCTTAT TCAATCCGCT GCTCGGTCCG TCTTGGAAGA CGAGCACTAT GAGCAGGAAA GGAACGGTTC GTCGTGCTCG GGCTCTTACA CCGACATCGA CGGCGGTCGA AGAGGATACC ATACCTCTGG AATGACACCG AATTCACGGA AAAAGGGTCC ACTATCTCCA ACCTTGCAAT CCTTACAAAC TAAAACGCTA CCCGCCATGC CCGATGAACA AGATCGCAAG CGTTTCGTGG TGCGTACTTT TCCACATTCC ATTGTGCTCC CCTACAGGTG CAGGTTCTGA TCGTTTGGGG AGATTTTGGC TTACAAAAAC TCGGATCAAG TGTATTTCTC ATATACTTTT TGAGTTTGTC TCCTTTTTTG TTTCTAAAAT AGGGTTGTTT AGCAGCAGTT TTGGCGTCTC TTTACGATTT CGATGTGGTT GATGACGACG AAGACTTGTC TCAAGCCGAT AAAGTCCTGA GCATGGCATA CCTTGATAGA AGTGAGCAAG ACGACGAGGA CGAGCATAGT CTTAGTCCGA CCAAAGCAAG TCGTAACAAT ACAAAAGACA GCAGCAGCTT ATATTCGCGC TCAGTCGACG GTTTTGACAC ACCAGCCAGA GCGCAACAAT ACCGATCGAT GATTTCGTCT CGATCATCCA TGCAAATCGA TCGGGGCACC CGGGATCAGC TTCAAAAGGC TCGATCACGC CATCGCAAAC GGCGGTACGA TATATTGTCG GATCTTCTTT TGGCGTCAGG AGACTATTTG CAACTCGAAA GTGGCCAGGT CAAGGCTTTT TTACCTATGC TAGCCAAACT CCTGGTGCCG AACAGTGATA AAAAAGAGGC ATCGCATGCA TCTCAACTTT CAGCTCAACA TGGTCAGCGG CCGCACTCCA ATAGTAGCAA CACGAGCAGC AATTTGGCTG CTATGGACAA TCAGGAAAAG ACTATATTGC AACGCTCAGG GATCTCTGGC GCCAATGTGA ATGGCTTTAC AAATTCCGAA GAAATGGTGC ATCTTGAATT GGATGACATC GAGTATTTGC GGCCGTTCTT GGAATCACTG ACTCCTGGTG CCGGGTTGCG ATGTGTTGCG TTACTGCTCC TTCAGTATTT ATTGCTGCAC AGCCGTCAAA CGGGCTATGA CGCTCGTGTA CGACATGCCA TAAAAACGCT TGGTGTCCTG GTTCTGGTTC ATGACATGCA ACATGACCCC GTTGATGTGT ACATTGATGA TGAATTGAAA AAGCCTTCTT CTTCACCAAG GACTCGACGA CATCGAGGTC ATTTGAAGAC GTCGCATCCC GATTTGGTCG TACTGGCCAC CCGCAAGTTT GAATCGCTTG AACACTTTAT TGCAGCAAAA CTAATCGTGT TGTCACGTGA ACAGCAGGCA CATAAAGTTC ATAGGGGCGC CCGGAGTGCT GGTGCTCGCT CGTCTCAGAC TCAACAGACA CCAGCATCAA AAGGCCTGAC CCGGGAACAG TGGATGCGAG GGATTAAGAT TGGTGGCACG GCGATGGCAG CCGGTACCTT ATTTGCAATA ACGGGAGGAC TCGCCGCTCC AGGTATTGCC GCAGGGGTTG CGGCGATTGC GGGAGGGACG GCAGTGACAG CGGCCGCCGC GGCTGTCTTA ACAAGTACGG CAGCTGTGAC GACAATCTTT GGAGTGGGGG GAGGAGGATT GGCAGCGTAC AAAATGCAGC GGCGGACACA AGGTTTAACC GAGTTCGAAT TTCGTAAAGA AACTGGAAAG GCAAGTCGGG AGAAAGAGGG TCAAATAGAC ACAGTAGACG CTGAGCTGTT TAGTACAATC TGCATATCAG GTTGGCTCCG GGACAAATTC GATTTTCAAC GACCTTGGGG GGTCTCCCCA TCACGACCTG AGTTGACTGA TCGACAAGAG CTGTTGGAAA GATTCTACAC GATCCATAGT CCATCGCATA TATCACGTTG TGCCAAAATT TTGGACCATT GGAAAGGTGA AGAAAAGGAT CTTTGGGGTT TGCTCAGGCA AAAGTACGGG CAAGATCCAG ACCATTTATT TCCTTTGGAG AAAGGTCCTC GATTACACGC CTCGTTGACT CTTGAGCAGA AGGAGGTCAT AGATCAGTTG TTTGTAGAGC TGGGATACAC GCCCAAATCT CTGGACGAAA TAAAAACGCA GCCTACGCCT TTCGAAAGAA TTAGGAAGGG CTGGAATAAA CAAGCCGCTG GACCTCGACG CGATGAAAAT TTATCTACTT CACACATTCC TGTCGGTCCT GCACATCGAT CTCTTGCAGA TTCCTTACAA AGTCCTGAGA GTGTCGAGAC ATACGTTGGG TCTAGAGCTG AGGTTACATC GTCGGGATTT GAGAGCTTTT CTACTGCGCT GTCAATGCTT CCACCGGACA AGCGATCAGA TGAGTCGACA GAGAAAGTTG AGTTGCCAAG GCACATTGCT ACTGTTTGGG ACTATCCATC TATATATGGA GGGGAGCAGT ATACGGTACA ATGGGAAAGT GAACTGCTGA CTGAGTTGTG CGACTCTGTC AATGACCTTG CGCGAGATTT GGTAAGCGGT GGAACCGCTC AGATCTTAAA GCATACTGCT TTGTCAACGC TAATATCGGC CTTTGCTTGG CCGTACGCGC TTGTAAACGC CGCAAACATG ATTGATGGGA CGTGGACGCT AGCAGTTGAA CGATCCGATG AAGCGGGGAG AGAGTTGGCC AGAAGCTTGC TCCTCAGCCG GGCAGGCCAT CGTCCTGTTA CTCTCGTAGG ATTCTCCTTT GGCGCACGAG CAATCTATTC TTGCTTGAAA GAGCTCGCTC GCCTTCAGGA AAAATGGGAA GATTTTTGTG AAGACGAGGA TTCCTCTCGG AGCGGAAAAG TGTTGCAAAA CCAATCAGTC GCCGATTTAG AGTTAGACGA ATCAAACAAG GACTATTTCA GGTACATGCG AGAGCCGGCA AGCATAGTTG AAGATGTGGT ACTAATGGGA CTTCCAAACC ATCTTAGCTT ATCTTCTTGG AAGGCATGTC GCCAAGTTGT GGCCGGGAGG CTTATCAACT GCTTTTCTCA GAAGGATTTG ATCCTTTCAC TGATGTTTCA ATTCAAAAGG CTCGGGCTTA AGCCGGTATG TGGAACTTGT CCAGTTAACG TACCTGGGGT GGAGAATATT GATGTATCCG ATTTGGTATC CGGTCACCAG GATTACACTC TCGTTAACGG AGATATTTTG AAACGCGTGA GGCATTGTCA ACCTTTTCGA TCCAGGCACA CTCGTATATT TGTGCCGGAA GTCGCTGCAT CAAGCATGTA AATAAAAACT CTTGATGGAG TCGAGGCTCA GCGAAAAGTG CAAGTTTTAT TTCAGAGTAT TAAGCGAATC AAT
|
Protein sequence | MESDRKEYTK NDEEQGDQAQ ALPPSEYSTT ERNPPTPNCA LRQDLCSENS SLVSDLIQSA ARSVLEDEHY EQERNGSSCS GSYTDIDGGR RGYHTSGMTP NSRKKGPLSP TLQSLQTKTL PAMPDEQDRK RFVGCLAAVL ASLYDFDVVD DDEDLSQADK VLSMAYLDRS EQDDEDEHSL SPTKASRNNT KDSSSLYSRS VDGFDTPARA QQYRSMISSR SSMQIDRGTR DQLQKARSRH RKRRYDILSD LLLASGDYLQ LESGQVKAFL PMLAKLLVPN SDKKEASHAS QLSAQHGQRP HSNSSNTSSN LAAMDNQEKT ILQRSGISGA NVNGFTNSEE MVHLELDDIE YLRPFLESLT PGAGLRCVAL LLLQYLLLHS RQTGYDARVR HAIKTLGVLV LVHDMQHDPV DVYIDDELKK PSSSPRTRRH RGHLKTSHPD LVVLATRKFE SLEHFIAAKL IVLSREQQAH KVHRGARSAG ARSSQTQQTP ASKGLTREQW MRGIKIGGTA MAAGTLFAIT GGLAAPGIAA GVAAIAGGTA VTAAAAAVLT STAAVTTIFG VGGGGLAAYK MQRRTQGLTE FEFRKETGKA SREKEGQIDT VDAELFSTIC ISGWLRDKFD FQRPWGVSPS RPELTDRQEL LERFYTIHSP SHISRCAKIL DHWKGEEKDL WGLLRQKYGQ DPDHLFPLEK GPRLHASLTL EQKEVIDQLF VELGYTPKSL DEIKTQPTPF ERIRKGWNKQ AAGPRRDENL STSHIPVGPA HRSLADSLQS PESVETYVGS RAEVTSSGFE SFSTALSMLP PDKRSDESTE KVELPRHIAT VWDYPSIYGG EQYTVQWESE LLTELCDSVN DLARDLVSGG TAQILKHTAL STLISAFAWP YALVNAANMI DGTWTLAVER SDEAGRELAR SLLLSRAGHR PVTLVGFSFG ARAIYSCLKE LARLQEKWED FCEDEDSSRS GKVLQNQSVA DLELDESNKD YFRYMREPAS IVEDVVLMGL PNHLSLSSWK ACRQVVAGRL INCFSQKDLI LSLMFQFKRL GLKPVCGTCP VNVPGVENID VSDLVSGHQD YTLVNGDILK RVRHCQPFRS RHTRIFVPEV AASSM
|
| |