Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46891 |
Symbol | |
ID | 7204438 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 723611 |
End bp | 727235 |
Gene Length | 3625 bp |
Protein Length | 1007 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185939 |
Protein GI | 219121430 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.861173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGTGATCA GCTTGCTAGG TCCGGAGGTC CGTCGTCTTT ACTGCGGACG CGCGGAAACA AATTTCGGCA GGCATGCCTG TATCGGTGGC AGTCCAACGG CCAGGCCGCC AGTCGGGATC GGCCGACCGC ACCTCCGTCC GGAATCCTCC GACGCTCCCG GGCCGGAAAA AGCACTCGTC CGCCGCGTCT CCCACAATGT CGGTCACCAA TCCGGTCGTG GCGTCGCTCC CGAAAAAATC CCCGCATCCG TATTCGTCCC TGGCGTTGTC GCCCGATCGA ACGCACGCCG TGACGGCGAG CAAGGATACC ATCCAAATTC TGAGGGTCGG TGCGGATGGC CTGTCCCTTC TACAGGCGAT CCCGATCGCT CAGCACTTTC AAACCGCCAC TACCGGAGAT CATCGTGCCA TGGGAAGAAT ACACGAAGAT GTTCGGGATA GTTTCGCTTC CTTTGGTTTG GGCTCCAAAA CCGCCGCGGC TCCCCAACCG AATCTCGCGA TGAACGTCGT CATTACCAGT GTTGCGTGGA GCCATCAGCC TACCAAGGGG GTGCCACCCA CCACGGCGTC GGTGTATACG CATACGGAAG GCGCTAGTAA AAGTGGGGCG TCGGATAAGG GAGAGCAGAC TCCTCTCGGC GCCAACACTT TCTTGGCGGT GGCGGGGTCC AACGGAGTCA TTGTGGTATG GAATGCGGAC GCCTTGTTGC AGGGATCGGG GCACGCGACA CCGCACGAAG TTGTACTCAA TCCACACAAT CGTGCCGTCA ATCGTTTGGC TTGGCATCCG ACACGACTTT TGCTGCTGTC GGCCTCCCAA GATGGGACGG TAAAGCTCTG GGAACGTCGC AAACAATCGA AGCCGCCCCC GGCCCAAGGA AGGAGAGAAT CCAAACAATT TAGTCTATTT CGGAGTATGA ATACGACCTC AGCGTCGGTG CAGACGTATT CCTGGTATTG TCGTTTAAAC TTTGAGCCGA AGAGTGAAGC CGTTCGGGAT ATTGCGTGGA GCCCTTTTTA CGATGATGGT AGGTCAAATA GTTGCGAATG CCTCAGGCTT TGGGAGCTTC CCACCTCAAC CACTCTATCG TTCCAGTTTT TGCTCTAGTC ACAACGAGCG GCTCACTCGT CGCTTACAAT ATGCACCTAC CCAAGCCAGC CATGCTCAAG ATGACGGCTC ACGCCGGTGA CGCTACTTCT ATCGATTGGC ATCCGACGAA ACCGAATATT CTCGCAACGG GAGGGGCTAG TGACCGATGC GTCAAGATTT GGGATCTAGT ATCCTACTTA TCCCTCAACA AAGATGAAAA TAACCTTGCC GCCAACTACG AAACCGTAAC AAGTCGTGCC GATTCAGTCA ACACAGAGGC TTCTTCCGAC ACCGAACGGG GCAGGTGAGT TCTTATGGTA CACCCGGACT GTATTTGTAT CAGTTAGCTA CTATGACAGC GAAAATGCTC ACTATGTTGT TGCTTCAACA GTCACACTGT TTCGTTCGGG CTCACGTCGT TGTTCCCCAC CGTCCCGCGG TTGACTGGAA CCCTTGGATC CTCCGCCAAT TTGGACCGAT CGAGACACAA AAGCCAAACG GACAAGGCCA TGCTTCACGT CTTGTACATT TCCGCTTCCG TCACTCGTCT GCGATGGAGG CTGCCAGCAA ACGACAAATT TTTGTTGGAA GATGAAGATC GGCACTCGTC TATGTTAGCG GTAGCAACCG CTCCCATCAA AGGTGCAAGC GCGGGCTTTC CTGGACTTTT GGGCTTGTGG TCATTTCATA GACCCTTTAT GCCACTTAGC GTAGTCGAAG GGCATCGGGA AGGAGCTGTA ATGGACTTTG ATTGGTTGGA TACTCCGCAA CCAAAACAAA ATGACACCGG GCGTTCGCTA TCCACAGCGA GAATGTTGTC ATCGATAGAC TTGAGAGGGG ATAAACGAGG AGGAAGCTCT CGACTTCAGG GAAGCACAAA CGAAGCAGAT TCGACACGTG ATACGGGTGA AGCTGACGAA TATGATAAGC CCATCGGGAT TTGGCAGCAC GTAATTAGCG TTGGAAGGGA CGGACGGTGT TTGTTGCAAA GTTTTGTACG CGGTAAGATT GTACTCTATC GCAGCAACCG GAGTTCCATT CTGATTTTAA GCCTCACCCA CTTTCCGTAT TACAGGAGAC CGACCCATTT CGCGCGTACC ACCGTCCTGC TTTGCTATGG CGAACCTGTC TCCTTTTCAG AGGGGATATG GTTCGCTTCA AGTTTTTTCA GTCTGTCAAC AAATTCCCTC TGGTCCACGG GAGAATTTCC TAGTAACAGG GCTAAGAAAC GACAGCATCA CAGCTCAGGC ACCAGGAGTT TTTCGAGAGC TACCGAGCGA GACATCAGTC GATCCTGCAG CCTTTCTTCT GAGAAACCAT TGGATGTCAA AGAAAAGTAT GCCTTCGGAG ACGCCGACAC TGGTCTTCAA TGTCTTGGAC CAAGGGGAAC TGGACCACGA GGCCAAGCCA GTGGGGCAGT CACAGCAAGC TCTTACGATT GCACCAGAGG TGGTACATCT GTCCCGCTTT GCAGAATCGT ACGTCTTGTA TCCCAGCGAA TCCTTTCCTA CTAGGAAGGC TTTGTGTGTG GAGAATGGAG AAATTGCAAT GAGTTTGAAT TGTGGCCCAC TTGCGCACAT GTGGCGATTA CTGGCGTCTA TGCTAGAGAG CGCCCTCCTG GATGGCCTAC CAGAGAAAGG CTCGGAACCG GGTAACATTA TGCAATTCGT TCTTCTGCCA ACAATCAAGG GAATTCTTGA GGAGCGGGCG GACGCTGGTG ATGTGCAGAG CTGTGTGGCC TTGTGTGAAG TTTTGGATGT GTTGACGCCG GATCAGACCA CCCGTGTTCC CGGTCTCGAG CTAAATCTTG TTCGAGAATG GTATCTCTCG TACATAGATC TGTTGCGAGA CATGTGCCTC TTCTCACATG CTTCACTACT TATTCGGAGC TGCAAAGACC CCTTCATTGG AGCTTTGAAC CAGCAATCCA CAATGTGAGT TTTGATTTGT TTTGGGGACT GTTGTTTTAG GAGAAGACGC TAACTCTTTG CATCGCTAGA ATCTACGAAT CATGTCCAAG CTGCGGTAAG CCATTGCAGT CGTCCGAAAG TGGAGGTGCA AGCACATTGG ACGGCACTGT TCGACGTGCT TGTAAGAGCT GCCGACGTCG AATCGGTATG TGTTTTCTTT GCCATGAACC TGTAAAAGGG ATGTATGTAT GGTGCCCCGG ATGCGGTAAG AATCATTCTT CGTCGTTTGC AACATGGGCA AGTCGCTGCT AACACGCTCG CTTGAACTAT ATTAGGTCAC GGCGGTCATA TGGAACATGC GCTTCAATGG TTTGGTGGTC TCAGTGGAAA GCCTGTTCGC GAAATGTGTC CCACTGGCTG CGGTCACAAA TGTAACATGA TGCAACCGCT CAGCGCCTTT CCAAGAACGG AATCACTTCG TGATCTAGCA CAAAGCGACC ATATGGAGAT CATCCCATAG CTCATGCTTC CGCAAATAAA AACGTTTTTT CATCGTTGAG TGGAAAAGCG TAGGCATTCT TAGAATAATG TCGAATACCT TATGCTTCAG AGTGTTTAAA ACTTG
|
Protein sequence | MPVSVAVQRP GRQSGSADRT SVRNPPTLPG RKKHSSAASP TMSVTNPVVA SLPKKSPHPY SSLALSPDRT HAVTASKDTI QILRVGADGL SLLQAIPIAQ HFQTATTGDH RAMGRIHEDV RDSFASFGLG SKTAAAPQPN LAMNVVITSV AWSHQPTKGV PPTTASVYTH TEGASKSGAS DKGEQTPLGA NTFLAVAGSN GVIVVWNADA LLQGSGHATP HEVVLNPHNR AVNRLAWHPT RLLLLSASQD GTVKLWERRK QSKPPPAQGR RESKQFSLFR SMNTTSASVQ TYSWYCRLNF EPKSEAVRDI AWSPFYDDVF ALVTTSGSLV AYNMHLPKPA MLKMTAHAGD ATSIDWHPTK PNILATGGAS DRCVKIWDLV SYLSLNKDEN NLAANYETVT SRADSVNTEA SSDTERGSHT VSFGLTSLFP TVPRLTGTLG SSANLDRSRH KSQTDKAMLH VLYISASVTR LRWRLPANDK FLLEDEDRHS SMLAVATAPI KGASAGFPGL LGLWSFHRPF MPLSVVEGHR EGAVMDFDWL DTPQPKQNDT GRSLSTARML SSIDLRGDKR GGSSRLQGST NEADSTRDTG EADEYDKPIG IWQHVISVGR DGRCLLQSFV RGDRPISRVP PSCFAMANLS PFQRGYGSLQ VFSVCQQIPS GPRENFLVTG LRNDSITAQA PGVFRELPSE TSVDPAAFLL RNHWMSKKSM PSETPTLVFN VLDQGELDHE AKPVGQSQQA LTIAPEVVHL SRFAESYVLY PSESFPTRKA LCVENGEIAM SLNCGPLAHM WRLLASMLES ALLDGLPEKG SEPGNIMQFV LLPTIKGILE ERADAGDVQS CVALCEVLDV LTPDQTTRVP GLELNLVREW YLSYIDLLRD MCLFSHASLL IRSCKDPFIG ALNQQSTIIY ESCPSCGKPL QSSESGGAST LDGTVRRACK SCRRRIGHGG HMEHALQWFG GLSGKPVREM CPTGCGHKCN MMQPLSAFPR TESLRDLAQS DHMEIIP
|
| |