Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49066 |
Symbol | |
ID | 7195429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 479828 |
End bp | 483159 |
Gene Length | 3332 bp |
Protein Length | 1012 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183615 |
Protein GI | 219126754 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGTATCTGT TTACTCGGTT CGTTCGCACC CCCACTCACA GTCAACCTCG GGTCTTCCTT TCCTACCTTG CCGTGTCTGG ATCGCTGGAA TACGTGACCA TCACAGTCAT TCTCTGCTGT TCGCTCACTC TGCTGTTCAC TGTCCGCTGG ATCGATCGAC AACACAGCAT TGCGGTAGGT ACACGCGTAG GCTGCATCGG TGTTCCTCGA CGGCTACCAA CAGTCTCCCA CGTTCCTTTC CTTACAATCA ATCAATCAAT CAGCCGCCTT GGTAGTTCGT ACACATACAT ACAGCATGGG CAATATGGTT TCCAACGAAA GAGGAGCTAC GGGCACCGGT ACCAGTACAG CCATCCCCAC CCGGACCAAC CAGAAAAAAG CCTCGGCACA AGCCGTTCAA CCACCTCCCT TGTGGAACGA ATCCTCCAAC AAACAGTCCT TCCACCTCGA TTCACCCGGC TCGGACTTGT CCAATCCTTC GCAGTTCCGA CACCGCCACC GTCACACCGT CAGCAACAGC AATATTCATG GTCAAGCGCA AACACCGCTT CGACTGCGGC GTCGGGCCTC CAAGGTTGTC AGCAAGTCAC ACATCGCCAC CATGGAATCC TCGCTTCCGT ACGGTGATTA CTCCAAACGG ACCAGTTTGC CCCCGCCTTC CCAGTTTTAT TTTGGCGCTG GCTTTGGCAA CGTCGATAAC GCCAGCGCCA GCTTTCTCGA GGATCGCGAC GAGCAAGACT TGCTGGACCC AAGCGCTCCC GAAATGGCCG TCTTCCGCCG ATTCCGGTTG GCGGCCAACA CTGCCGGGGT GGGACGAATG GAAGAGGACG CCGCTAGGCG TCGGGAACAG ATTCTGACCT CGCGCCAGCA ACGCAAAATT TGGTTCAAAT CGCGTCGCAA GGCTTTGCAG CAGAAGGCTC GTCGTGCCAA AACTATGCTT TCCGCTTGTT GGGAAGAGCG ACGGGGTATG TGCATCGTCA TTCCCGACGT GCAGCACCCG CAAGATGGGC GTCCCGAGGC GGCATCCTAC CCACCACACG TTGCGGTAGA CTCCATCACC ACGGACCCAC TACGCCGGGT TTCGTCTACC TCTCGTCGCG TTTCCGAGGA ACCCCCCATC GTCGACGGTA CTGGTCCGAT GCAGTTTGAT CCGCCGCAAG TCCGCGTGCT GGATGAGAAT CAGCAGTCTC GCAACTTTGA TTCGTTTTAC GACGAACTGA CCGTCACGGC GGCGCCTTTT GCGCCCGTGT GGAAGGAGAC AAAGCAGCCC AAACACGACG AACACTCACG GGTACCCGTC GCTGCCGAGG TTCCATTTTA TTTGTACGAT CCCCAACGTA TTTTGGAAGG AATCGTCGAC TCCTCCGAGT CTCCGACGAC CGCCACAACG TCGACGAAGC TCCAGCAATC GCGCCGTCGT ACAAGTGCGA GGTACGTGGA TCGTTCGGCG TCCCCCGTGG TCGCACTCTT TTCCGACGAC GACGAAGACC AGTACGATCG CATCGGTGAT GATGACTACA CCACCAAGAG CATCGAGACC AGTGACAAAT ACAAGACCGA GAAAGACGAC GATAATGACA CCAACAAGAC TCCATCTTCA TCCAAAACCC TGTCAAATCG GTCGTCACCC TTTGACGAAG ACTCGGTATC GCAAGAACCA GCGGAAACGC CCAACGACGA AAACGCTTCC GTTGTGACTC CGTCGACGGT CCAGGACCAG CCGATAGCGG TGCGTCGATC CTCTACACAG TCTTGTTTGG ACCGTGAAAG CATTGAAGCC AACGTCCAGC AACAAATAGA CCAGCAACAG AAGGGCGTCG CCGTCACGCA GAAACACCAA ACGCGACACC GCCCGAGTCT TGCAGAACGG GCCGAGGATC TGGGAAGTTA CGGCAGTGAT CCACCCCAAA CCAAGGCCCG AGCCGTGACG CAGGCGACAA AAAAGGGAGA TGCTCCTCTA GATCAAGCCA CAACCCGAGT AACCGCTCCG GAGTCCCGCA AGCCCGTGGT GGGGCATACC AGTACCGCCA GCATTCGAGT GGCCCGCACG TCGGAATCAA CGGAACCGAT CCAAACAGCG CACGCCGTTG CAGCCAAGAA ATCAGGCAAG GTCGAGTCAG TCCAACCGAA GCCAGCCAAG ACAGTTCAAC CGCAAGAATC CCGTCAATCG TCTTCTTCCC AGAACTTGCC CATATTGCTC CAGCTGGCGG GGTGGAGAGC ACCCAGCGCG ACAACCAACA AAGGGCAACG GAACAGCAAC GGCTCAAATG CATCTACTTC GACTGCAGCA GGTGTTGCCA AACAGCCCAA CAACATTCCC ATCTTGTTGC AGCTTGCTGG TTGGAAGACC GAAAATACAA AGGATAAACA GGGCATCCAG AGACTAACTC TTACGGAAAA GCAGCTCTCC CAAGGGCCCC CGGGCCGACG TAGTAACGTC ATGGCGACGC GAACGGTTGC ACCGACTCGG CGTCGTCAGA GTGCGCCACC ACTGTCGGCC GTTCCGAAAG ACGGCAATCG GGCTTCCAAT CGAGAATCGA GCGCCTCTGA ACCAATCTTC CGGAAAGGAA CAACATCCCA AATCCAAAAG GAGAATCAGG GTAAGAACAT TGCCACCATA CTGCAGCAAA CGTATCCAAT TGCGACGGCC TTCTCTATGG ATGCGATCCA TGACCTTCAA CCGGCACAGG TGCGTTTTGA TCCGAATCAC ATTAAGAAAA CTGCCAAGGC GGGATCGCGC CCGCTCTCCA CGGCTGTAGG AGTTGTTTCA GTGAATGGAT CAAACGAGAT TAGTACGGGG CTGAAAAGTC CGGCGGCCTC TTCAACGTGC ACTACGGAAA CACCTCTGAG TCAAGGATCT ATTCCCATAC TGTCAGAGGA GGCTATGGGT AACGCGGCCT TCTTGTTCTC TCCCAGTTAT ATGGGGAATG ACAAGGCATC CTCAGAGCAA ATGAGATTGG CACCACCAGC GGGAGTTATG CACCGTGATC ACGCTTCTTC CTTTGCCTTG TCAACCTTCG ATTCCCGCTG TCGTGTTTCT GGATCTTCCC TCTCGACCAA ACCAGTCGCG ATTCGTCCCC TTCACAATGT TGTCGACGAC AGTGGCAGCA ACCGTAGCTA TCATACCAAC AGCAAGACTG TTTGGAGCAT TGATGAGGCT CCCAAGGAGG GCAGTGACGT CGGTAAACGC AATAGCAGTA TAACAACGTA TGACGATACC ATGCGAGGAA AGTTTGCGAG CAAGGAATCA CAAAAGTTTG AAATGCCACC AATTGTTCAT GCACTCTCGG ATCTCACTGA TACGACTGGA CGGGAGAGTG GAATGGGCAC AA
|
Protein sequence | MGNMVSNERG ATGTGTSTAI PTRTNQKKAS AQAVQPPPLW NESSNKQSFH LDSPGSDLSN PSQFRHRHRH TVSNSNIHGQ AQTPLRLRRR ASKVVSKSHI ATMESSLPYG DYSKRTSLPP PSQFYFGAGF GNVDNASASF LEDRDEQDLL DPSAPEMAVF RRFRLAANTA GVGRMEEDAA RRREQILTSR QQRKIWFKSR RKALQQKARR AKTMLSACWE ERRGMCIVIP DVQHPQDGRP EAASYPPHVA VDSITTDPLR RVSSTSRRVS EEPPIVDGTG PMQFDPPQVR VLDENQQSRN FDSFYDELTV TAAPFAPVWK ETKQPKHDEH SRVPVAAEVP FYLYDPQRIL EGIVDSSESP TTATTSTKLQ QSRRRTSARY VDRSASPVVA LFSDDDEDQY DRIGDDDYTT KSIETSDKYK TEKDDDNDTN KTPSSSKTLS NRSSPFDEDS VSQEPAETPN DENASVVTPS TVQDQPIAVR RSSTQSCLDR ESIEANVQQQ IDQQQKGVAV TQKHQTRHRP SLAERAEDLG SYGSDPPQTK ARAVTQATKK GDAPLDQATT RVTAPESRKP VVGHTSTASI RVARTSESTE PIQTAHAVAA KKSGKVESVQ PKPAKTVQPQ ESRQSSSSQN LPILLQLAGW RAPSATTNKG QRNSNGSNAS TSTAAGVAKQ PNNIPILLQL AGWKTENTKD KQGIQRLTLT EKQLSQGPPG RRSNVMATRT VAPTRRRQSA PPLSAVPKDG NRASNRESSA SEPIFRKGTT SQIQKENQGK NIATILQQTY PIATAFSMDA IHDLQPAQVR FDPNHIKKTA KAGSRPLSTA VGVVSVNGSN EISTGLKSPA ASSTCTTETP LSQGSIPILS EEAMGNAAFL FSPSYMGNDK ASSEQMRLAP PAGVMHRDHA SSFALSTFDS RCRVSGSSLS TKPVAIRPLH NVVDDSGSNR SYHTNSKTVW SIDEAPKEGS DVGKRNSSIT TYDDTMRGKF ASKESQKFEM PPIVHALSDL TDTTGRESGM GT
|
| |