Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47853 |
Symbol | |
ID | 7203078 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 240966 |
End bp | 245693 |
Gene Length | 4728 bp |
Protein Length | 1314 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182354 |
Protein GI | 219124109 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCGGA GTGACCTGAC CGTAACTGAC CTCCAAGCAG TAAAATTGAG TCGATTATTG GAGGTCCAAA ACACAACCTC GCTGTCTGTT TCGCCACCAT CATTCCCAAA CAAGCAACTC AGCCAAGCGT AACAACTCTG CGTCCGTCGC AAAAGCTCTA CGTTCCATCA ATACATGTAC TGCATTTGCT GAGCCATCAC TAACAGTAAA ATACGCGAAA CGGGAATTGT TTTCTGAGCT TTGCCTGGAT TGCTTCCAGT ATGATACGCC GAAGAGGTCA ATCAGTCGTG TTCGAAGACG TGGGCGGTAT TGTCCACAAA ACGGCGCAAG AAAAGGCGGT AGCGTCCGCA TCAAAGCGAG GCTCAGATGG GGGAGGGAAG ATAGTTGACG ATGTTCAGGG TGTTCCTTTT GAAAACTCGA CAGCATCATC ATCGCAACTC AATTTATTTT CACGAAACAA AGCGCTTGTC TTTCTTATCC TAATTGTACT CGTTGGTGTT TTTGCCAGCA GTCTCTTTCT GGTTCTTGGT GTTCGATCCG CCAAAGAAGA AGTGGAAGAC AACTTTGTTC GGCAAGCGTC TGATGTGGTT CAACAAACCG AACGAGTTTG GGAGGATTAC CAAACACTCG CCATGTGGAT GCATCAGGCA TGCTACGAAC GGCAAATCAC ACGTCCCAAG TTTCGTGAAA TCCAAACGTA TATGAACGCT ACAGGTTTGG AATTTCAGGC TGTGTGTTGC GCACACAACA TCAGCTCACC GACGGAGAGG GCTGCGGCCG AAGCGGAGGC CAGCGCTTAT TATTCGGAGA ACCACCCTGA CATGGTTTAC GAGGGTATTA CGGGTCTGGA ACCGGATCCA GAAACGGGGC TTTTTTCACC ACGCTACCGC TCGCAACAAC CATTTTATTT TCCTGTTCGA TTCATCGAAC CGCTGGAAGG GAATGAGGCT GCTGTGGACT TTGATCTCTA CTCCGCTGAG GGTCGAGCGA GAACGATTGA TGCCGCAATA TCCACCGGAA AATCGGCTCT GACGCCACGA CTTCGTCTCG TGCAAGAGTC CGATCCGAGT GCCTACGGTG TCATACTCAT GCATCCCGGC ATTCCAGTCT CCCCGGAATT CAAATGGCCT TTTGATCTGG CGACAGTGGT GATTCGCATT CCGTCTCTTT TGAGCCGAGC AATTCGGTTT GAGTCGCAGA GCATCGCCGT CTACCTGTTC GACTCAACCG CAAAAAACTC CGATCCCGAG TTTATGGGAG GGGCTGCAGT AAATATGGTC GAAAATAAAC CGTCAGTGTC TTTTCTCGCG GAAACATCAC TAGCGGATCA ACGGAAACGG GGCCGTACCT ATGAAGACAC GATTGAGATG GTTTCCAATC AGTGGACAAT GATCGTCATT CCCCTGGATG GAATGTACGA AGAGAATTAT ACCTTTGTCA TTATTGGAAC CTTAATGATT TTTCTTTCGT GTGCACTGAT CGGGTTTTTG TCTGTGCATC AATTCGCCGC GTGGCGCATA TGAGTAAGGT CAAGTCAGAG ACTGAAGCTG AAAAGGCTGC TCTTCTGGTT GATAACGCCA AACAGTCTGC CAAAGCCGAG CGAGAACTTA ACGACTTTAT CGCGCACGAA GTCAGAAATC CTTTGTCAGC CGCCATGTCA GCCCTGAGCT TTGTTTCGAT GGAAGTGGAT ACCGATCCTC CTTTAGCTTC TGAGGAAGCT CGACAATCGG TGCAAGAAGA CCTCGATATC ATTAAAGGAA GTCTGCACTT CATCAACGAC TTACTCAGAA GTATGCTAGA TATGCATCGT GCGGCTAGTA ACCAATTAGT GATAGCGATG TCTCCAACCG ATATCAAGAA AGATGTCTTT GAACCTGTGG CTGCAATGAT TTACAATCGT AGAGAAAACT TTGAGGTTCT CATCGATTGT GCGGACAATT TTACGATTTT GGCGGATCGT TTGAGATTAA CTCAAGTCAT TCTGAATTTG GCAAGAAACT CGGCAAAATT TGTCACTACG GGGTACGTTC GGCTACGAGC AGGAGCTGTT GGCGAAGATA GAATTGTGAT TTCAGTGGAA GATTCCGGTC CAGGTATTCC AAGTGAGAAG CGAGGCATCC TATTTTCCAA GTTTCAGGAG AGTCTCGATT CCTTAAATCA GGGTACTGGT ATAGGTCTTT GTCTGTGCAA ACACTTGACT GAACTTATGA AGGGAGACTT GGCATTGGAC GAGACCTTTG ATAGCGGTAT TCCACACTGT CCTGGAACAC GCTTTGAGGT GAACCTTCGT ACGTCGATTG TAGATCTTGA TTCTAATTCA CTTGACTTAT ACGAGAAAAC GTCAAGGGAG GGAAGCCGCT TGCTTACCAA CAGTTTGCGA ACAGCCACTA CGTCAACGAA ATCACAGTCG ATGTCAAATT TCGTTCCTCA CAATGAAGCA GTCGCGACCT CCATTCACCA GCTTCCTCAG ACCATATCTG TTCTATTTGT CGACGATAAT CTTGTCCTTC GAAAGTTATT TTCGAGAGCC ATTAAAAGAG TAGAACCTGG TTGGATTGTA CAGGACGCCG CAAGCGGAGA AGCAGCACTG ACTATGGTAG AAAGCGACAC TTTTGATCTT ATTTTTGTTG ATCAATATAT GGCAAGCATG GAGAAGCAGC TTTTGGGAAC GGAGACAGTC GCAGCATTAC GCAGTAAAGG AGTAAAATCA AGAATATGTG GCCTTTCGGC TAACGACGTT GAGGTACAGT TTGTCACCGC AGGGGCAGAC TTTTTCCTGC TCAAGCCCAT ATCGTCGGAC AAAGATATTT TGACTGCTGA CTTACATCAT ATCCTATACG GGGCAAGGCA GTGGAAAGGC GAGGCAGGCT CTTCAAATGG AGACAGTGAT ATCGGAACTG CCTCAACAAA GACACCAGGC TCTGTCCTTG GAAGCGACGA CATGGTGTGA TCCATTTTTC ATTTGGATTC TATCGCAGTA ACGAAAGATT GTTCGTTGCG AAAAGCATGC ATGCGATTAT GCACACTAGC GTTGGTGTCT CCTTGGGGAG ACTTTACAGA TGACATTTAA TCGGAAATGA TAACTTCACA ATCAGAAAAC ATCTCGGTTG AGATTGCCGA AAAATCCCAA TCCGGCCAAT CTTTTTTTGA ATTTCTTTGG CTGGATTGTG TCGACCCGTA GAGTCCTTGA CATCTTATCA CCGAATTTAG AGGTATTAAC TGTAACTTCA CAGTCAATAG TCACGCTGGA AAACGGTCAA ATTGGAAAGC TACGAAAAAC TTGATTTTCT TTTCTGACGT TAACAGTAAA ACAGAGTTTT ATGGATTTTT CGCTCGCGTT CTGATTCGAT GTATTTCGCT TGAAAGTGAG TCCATGCTTG TTGGCTCGGT ATTTGCAGTT AGTCAATTTG ATGCTCACGC CACCTGTGTC GTCCCATTGT GTCCATATTC ATCCCTTTTC GACGCGCTAG TGCGCGGTAT AGGCGCAATT TTCACAGCTA TTAACCAAAC ATGAGTCACA CAGATAACAA TGAGAGCTTG ACGGAAGCCG AACGAAACCT TTCTTCACCA GGTAAAGCCC CTCTGTACAA GTTTGTCATG ACGGGAGGTC CTTGCGGAGG TAAAACAACG GCTCTTGCTC GAGTTTTCAA CTTTTTGCGC GAACGAGGAT TCGAAGTGAT TACCTGTCCC GAAGCCTACA CATTGTTGAT GTCTAACGGG ATGTCGGTAG ATTTCTTCTC GACACCCGGA ATGGGACGGA TCATCCAAAG TACTGTGCTG GATGTGCAAC TTAATCTAGA AGACAATGTA GCTCGAGTTT TGAAAGCGCG TGGGAAACCT GGCATCATCC TATGCGACCG AGGATCGATG GATGGTACGG TGTACGTGAC TAAGGAAGAG TTCCAAAAGG TTATGCAGGA ACGCGACACG GATGTTGTGC AGTTGCGCGA TAATCGCTAC GACGCTATTT TCCATCTCGT TACGGCAGCA GACGGCGCGG AACACGCCTA CACGTTGGAC AATAACAAAG TGCGCACCGA AAATGTGGAA GAAGCGATCG AAGTGGATCG CAGGACTCAG AAGGCATGGG TAGGACATCC TCATCTGTAC GTGCTTGACA ACGCGACTGA CTTTGAAGGC AAAATGAACC GCTTGATTGA TGTGATTAGT AATCTGGTCG GTCTACCATC TAATCTCAAG CGACGATCGG CCAAGTTCTT ACTAAAATCC ATGCCAGACA CCTATTCATT CCCGCCCGAT ATTGATCATC AAACCTTCGA AGTGGAAAAG GTTTACGTAC AACAAACTGG CCAAAAGTAC GATTATGCCT TTGTTCGTCG CCGCAGCAAC GTAGACGCGG ACAATAACTT GTTGGGTAGT GTCTATCAGC TCACGACGGT TCAACGTTTC GAAGAAGAAG TTATCGAACA GAAGCGTATA ATCAGTCAAC GTGAGTATGC TGCATTTTAC ATGACACGGT GTCCAGAACG ACACGTGGTT CGTCAAAAGC GAATTAGTTT CATTTACAAG CAACAGAGCT TTGTGATACA CATTTACGAA GAGCCAGTCT CGGACATATG TATTCTGCAC GCGCAAGTCG AAGCGTCCAA GGAAAAGGTG GACTTACCAC CATTCATTGA CGTAAACAGA ATACTGCTCA ATAGCAAACC GGATGAGGAG AAATACGGGG CATTCAGCCT GTCGCTTATA AACGGTTGTG GCATGTAG
|
Protein sequence | MSRSDLTVTD LQAVKLSRLL EVQNTTSLSV SPPSFPNKQL SQAMIRRRGQ SVVFEDVGGI VHKTAQEKAV ASASKRGSDG GGKIVDDVQG VPFENSTASS SQLNLFSRNK ALVFLILIVL VGVFASSLFL VLGVRSAKEE VEDNFVRQAS DVVQQTERVW EDYQTLAMWM HQACYERQIT RPKFREIQTY MNATGLEFQA VCCAHNISSP TERAAAEAEA SAYYSENHPD MVYEGITGLE PDPETGLFSP RYRSQQPFYF PVRFIEPLEG NEAAVDFDLY SAEGRARTID AAISTGKSAL TPRLRLVQES DPSAYGVILM HPGIPVSPEF KWPFDLATVV IRIPSLLSRA IRFESQSIAV YLFDSTAKNS DPEFMGGAAV NMVENKPSVS FLAETSLADQ RKRGRTYEDT IEMVSNQWTM IVIPLDGIKV KSETEAEKAA LLVDNAKQSA KAERELNDFI AHEVRNPLSA AMSALSFVSM EVDTDPPLAS EEARQSVQED LDIIKGSLHF INDLLRSMLD MHRAASNQLV IAMSPTDIKK DVFEPVAAMI YNRRENFEVL IDCADNFTIL ADRLRLTQVI LNLARNSAKF VTTGYVRLRA GAVGEDRIVI SVEDSGPGIP SEKRGILFSK FQESLDSLNQ GTGIGLCLCK HLTELMKGDL ALDETFDSGI PHCPGTRFEV NLRTSIVDLD SNSLDLYEKT SREGSRLLTN SLRTATTSTK SQSMSNFVPH NEAVATSIHQ LPQTISVLFV DDNLVLRKLF SRAIKRVEPG WIVQDAASGE AALTMVESDT FDLIFVDQYM ASMEKQLLGT ETVAALRSKG VKSRICGLSA NDVEVQFVTA GADFFLLKPI SSDKDILTAD LHHILYGARQ WKGEAGSSNG DINSHAGKRS NWKATKNLIF FSDVNSKTEF YGFFARVLIR CISLENNNES LTEAERNLSS PGKAPLYKFV MTGGPCGGKT TALARVFNFL RERGFEVITC PEAYTLLMSN GMSVDFFSTP GMGRIIQSTV LDVQLNLEDN VARVLKARGK PGIILCDRGS MDGTVYVTKE EFQKVMQERD TDVVQLRDNR YDAIFHLVTA ADGAEHAYTL DNNKVRTENV EEAIEVDRRT QKAWVGHPHL YVLDNATDFE GKMNRLIDVI SNLVGLPSNL KRRSAKFLLK SMPDTYSFPP DIDHQTFEVE KVYVQQTGQK YDYAFVRRRS NVDADNNLLG SVYQLTTVQR FEEEVIEQKR IISQQRHVVR QKRISFIYKQ QSFVIHIYEE PVSDICILHA QVEASKEKVD LPPFIDVNRI LLNSKPDEEK YGAFSLSLIN GCGM
|
| |