Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46954 |
Symbol | |
ID | 7204770 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 915814 |
End bp | 918346 |
Gene Length | 2533 bp |
Protein Length | 720 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185808 |
Protein GI | 219121157 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0187117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCAACGGAGA CGGAGACAGA AGTTCTTTAC GGAGAGCAAC GGAAGCTTTG ACGCGCAAAA GGCAGTGCGG CGTGGTGGCG GTAACGGCGA CGGAAGTGCG GTTTTGGGTT TCCCTGTGTT GCAACGAAGA CGGAGACGGA AGTACAGTTT TGGGGTTCCG GGGCGACAGT GATTGCAACA ATGGCATCTT CCCTTGCAAA ATTTACAGAT GCATTGTCCC AGCTCCTAAC AACGGAGTCA TTGCAAAAGG CAATTGATGA ATTGAAAAAT GACCTTCAAT CTTGGGATGC TCTAATTGCT TTTCCAGTTG TTCTTCCAGT ATTACCTGGT ATGCCAGCAA CACTCTCCTA TTGCAACAAC CTTCTGGAGT ACACAAAATC AAAGTACAAA GGCCAGAAAG TATACTTTTG CCAGAGCAAA TACCCTTTAC ATGATGGCTG GGAAGTATTG AAGAATGATC TTGAAAGTGC AGCACTAGAG CAAGGATCCG CAATCATTTC AAATGGTGGT GGCACCGGCA ATGCCCGGAA CAGTAAAAAG TTGATATGTA AATGTGGTCG AGTCTACCAA AGAGGGCTGA CAGACAAGGA TAAAGTGAAA ACCTTCCGAA ATGGCTCCCT CCACAGTGAC AGAAAAAATT CCCGAGGGCC TAATGGCAAG AGAATGCCAA GAAGAACAAG CAGCATGCGT CCAGACAATG CAAATTCTGT TTGTCGTATG AGTTTTACTA TCAAATGGGA CAAAAACGGT TATTACTTGT ATTGCGGATA CGGGAACATC AAACACACCA ACCATGTCAA GCTGGCAGGA AAGCATATGA CATTTCCAAA GCGTCTTGTA GCAGATGCGG AACGGGATGT TGTTGAGTCT ATTGGTGCTG CAAAAGCACT TCATGCTGTG GCTCGCAATG TCCACTTTCA AAGAACAGGA CGCCTCTTGA GCCATTCACA AGTTCGCTAC ATGCACAACA TGGCAGAGAA GATTACCATG GCATTGTTGG TGGATGGAAG TTCGATCAAC ATTGATGCCT TAAGTGATTC TGAGCGAATG TTTGAACATT TTAAGTTGCA CGGAGTGAAC TATTCACTGC TGTACAACCA TGTGCCAACA GAGTTGGCAA GTAACAGCAC AAGTACTAGT TGTCAGGCTA CGGTGGATAA CAACTCCTTG ACTTCTTGTT TGCTGGGTAT GACAGATGGT GAGTCTGTGT CACTCACAAG TCCAAGCTCT TTAGTTAGCG AGACCCACAA TGTAAATGAT TGGTTGGGTT CATCAACAGC AGTGAAGCTT CCCCCAAAAG AAAATGAGGA TATGCTTAGA TTTGCCAATG AACACCGAGA TTCATACAAG CTTGAAAACA GCCAGAAGCT AATGATTGGA TGTGCTTGGA TTCATCCAAA AGAAAAGCGC CTATTCCAAT TGTTTCCAGA AGTTGTTCAT GTGGATTCCA CTGCAGATAC AAACAAGGAA GGCCGTCCTC TTTTGACAAT GACAGGCCGT GATTCAAGTG GAAAGCACTT CACTATACTT CGTGTGTTCC TTCCAAATGA GCGAGCTTTT GTTTTCCGTT GGTTATTCCG TCTTGTTCTG CCCTCACTTC TTGGGACAAA ATGGACAAAG AGAATAAAGG TCATTATAAC AGATGGAGAT TCACAAGAGA CCTCCCAGCT TGACATTGCC ATAGCTTTGC TGTTTCCTGA TGTTCTTCGA GTCCGATGTG GTTGGCATAT TGTTGATCGA GGCTGGAAAA GAAAATGCCC AGGTAACTTG TGGTCAAAGC AGACTTGTGT TGCATTGTAT TTCTGTCTAC TGCTTTCTTA TAATTGCTCT TTTCTATGCA AAACATAGGG TATCAGTTGG TAGATGTATG TAACAGAAAT AAATTTGAGG TAGTCACTAA GATTATCAAG AGTTGGCTGT ATTCATGGAT GCAGCCATGG CAGTGTGAGA CAGAGAAGGA GTACAATATC TCCAAGTCCC TTTTGTTTGC ATACATGCAG TCATTGGAAG TTGTGTCTGT CATGGGTCAA CAAAATGCAT CAAGGATCTT CCAATTTATC CGAGAGAACG TTGAACCACA CGAATCACAT TACTGTTTTC ACAAGCGTAA GACTGTTCGA CACTATGACA CATTTGTGAA CTCAGCTCAT GAGGGCACAA ACAATGGCTT AAAGAATTCT GCTGCTCCTG TGTTGCCACA GTACACCATT GACCAATCTG CGTCAGTGCT CACTAAGAAT GCCAGTGTCA ATACATTGGC TAAAGAGCTT GCTGCTGGCC AACAAAGTAT TCGTCACAAA CTTTGGTCAG ACCTGCCAAC TTCCAAGCAT CTAAATACTT TGGGTGAATC TCTAATTGTT AAAGAGTGGA AACTACATAA AGCCTACGGC CTTGAACGTG TGGCACTTGA CACCTTCCAT GTGAAACATT GCAACCCAAA ACCAGTAATG GTGAAGCAAA GTGCCCCCAT CCCCCTCTTT CAGCGGGTAC GTCTTGTAAA AATCTGTAGG CGGCGGCTCT ATTGCTCTTG TGA
|
Protein sequence | MASSLAKFTD ALSQLLTTES LQKAIDELKN DLQSWDALIA FPVVLPVLPG MPATLSYCNN LLEYTKSKYK GQKVYFCQSK YPLHDGWEVL KNDLESAALE QGSAIISNGG GTGNARNSKK LICKCGRVYQ RGLTDKDKVK TFRNGSLHSD RKNSRGPNGK RMPRRTSSMR PDNANSVCRM SFTIKWDKNG YYLYCGYGNI KHTNHVKLAG KHMTFPKRLV ADAERDVVES IGAAKALHAV ARNVHFQRTG RLLSHSQVRY MHNMAEKITM ALLVDGSSIN IDALSDSERM FEHFKLHGVN YSLLYNHVPT ELASNSTSTS CQATVDNNSL TSCLLGMTDA VKLPPKENED MLRFANEHRD SYKLENSQKL MIGCAWIHPK EKRLFQLFPE VVHVDSTADT NKEGRPLLTM TGRDSSGKHF TILRVFLPNE RAFVFRWLFR LVLPSLLGTK WTKRIKVIIT DGDSQETSQL DIAIALLFPD VLRVRCGWHI VDRGWKRKCP GYQLVDVCNR NKFEVVTKII KSWLYSWMQP WQCETEKEYN ISKSLLFAYM QSLEVVSVMG QQNASRIFQF IRENVEPHES HYCFHKRKTV RHYDTFVNSA HEGTNNGLKN SAAPVLPQYT IDQSASVLTK NASVNTLAKE LAAGQQSIRH KLWSDLPTSK HLNTLGESLI VKEWKLHKAY GLERVALDTF HVKHCNPKPV MVKQSAPIPL FQRAAALLLL
|
| |