Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22739 |
Symbol | |
ID | 7195070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 302704 |
End bp | 306730 |
Gene Length | 4027 bp |
Protein Length | 396 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183461 |
Protein GI | 219126431 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.30615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAATATTCA AATGTATGGC TATTTTCAAA ACTTGTGTCA TACGAGAAAG TAATCAGATA TTCTATGCGC GTTCGACTAT GCGCGTTCCT ATGCGCGTTC GGATGCTCTG ATTGGTCGAG AAATTGACAT TCTTCCTTGA GAGAATGATA TTCTTCTGTC CTAAATAAAA TAATTTCCAA AAGTCCTTAG TCATTCTTGT GTTGGATTGC TGGAGCCGCG GTTGTGAGCT GTGCTCACCC CCCTTTTGCT GGTCATGAGC TGTGCACCAG TGTTTTCCCA CACGGTAGAT CCGAAAGCAC CGTACCAGAA ATGGCAACCT CTCGCACATC CCCGACGCCG CGAGATATAC CAAACACCGT CCTTTTCATT TTGCACATAG TCATTTTGTT GCATCTACAG TAAAAGAGTC AGTGTTTGCT TCAATGCTTG GAGACGGAAC AAGACAAGTC CACAAGGCCA ACAATGGTCC AAGTCGAAAT AATGTCTAAA ATATATGGTG AGCTTGTAGC ATATCCACCA CTCCGGATTC TGCAAAATGG GAAGGAACAT GACCCTTACG TTGTTGTAGA TGAAGCTATT GAAAAATTTG GTGCAGCAGT CAAAGATCAA ATGCTTACAA AGGAACAAGC TTTGGTGAAG GTGACAAAAG TTGCTAATTC AATTGACTGG CTACCTACTG AACTGAAAGA ACTGGTTGAT GCACACTGTC GTAAAGACTC AGATGTTGAC AATGATGGAA AAGCTGCCAG ATACTGAGGG ACCTTGGCCG GAGGATTACA AAAGAGGAGC TTCCTCGCTT CCGGCAAACA GAACTTTGCT GCTCACAAGC CATCAGTGAC GGATCAAAGT TAAGTTCCCT TATTACAGCC AACAAACCTG TGCTCATCAG CCTCCCAAGA CATACTAAAT GGCTTGTTAT TCATGGCCTG TACAATCTAT CAGGAAGCTT GGCTGCTGCC AAAGTTTCAA ACAATGTTGG GGTGGAAGTT ACATGCTATG GCAACATGGG AACAATTATG GAAGGACTTA TAGAAGGTGC TGCCAGCTTT GATCACAGAG TGGCCACATA CAGCACTGTA ACAGACTGGA TTGCAACCTC TGCTTTGACA GGAATGAATA CCATGACACG GTTGATTGCA AGCAACAAAT TTAATAGCTT AACTGGTAGC ATCTAATGAC TACTTACTTG TGCTTTAAAT TAAATGAAGA AACAAACTTC ATGCATTCAT TTCTCACCAT CTATTGTTCA TTTGATTCTT GTCACCAGCA ACAAAAGATA GTTGTAGCAA AACCCATCAT GTCTTTTCCA AGTGTGTATA TTCTCTCAAA GCCATCATGG CAAAGGACAT TCAACATAGT TCAATTTCAT ACTCCAAAAA GTCTAGATTG TGCCAACACA AATCTTTGTT GTAATTTCTC GCGGCTCCCC TTTGGCTCTA TTCTTTGAAT CAAGTCTATT TGCTGCAGTG TTGCTGTTGT CACACCTAGG AATTACAACA AGCTCACAAA CAAGTCGTTT CTGCGTAATA ATCTTCTTTT TGTTCACTGT GATATACTGC ATTGGTCTTG TACCTATCAA CAACCTGCTT TCTTAAGTTT GTTGGGAATT GTCCCCTTGT CAGGCATGGG CCAACTTGGT AGTGCTTGGG GGTCTGGAGC TACTTGTGAT TCCGTAATTC CTGCTGCTCC CTTTGCTTGT CAGCACACTT CTCCAGGATG CATCTGACAG TACCAGTCTG TCAAGTGTTT TGACACACTG TCCTACACAG TATGCTCCAC AACCATAACT GGTTGCGCAG GTCGCGCAAC CTCCATCACG TTGCGGAAAC GACCCATTCC GGATAGTGGA CCCCCGGACG GTGCCTTCGG CGTTCCGCTG TATGCACGAA CAGCCACACC CAATCTCCAT TCCAACATAC ACTGTGTGGG AAACGTGTAG AAACAACGCT TGAATTGCTT CGACTTTCTG TGCTTTAACG TTTCCGAAAA CGACTTTGTC GTCTTCCAGT CCGCCCGCGA ATAGGCATTG TCTACCGCCA CCGTCCCACG GCCCTTTATG CACACGTCGG ATACCATGTA CACGCTGCTG CACATGTTGG ACCTGTTGGA TGCTGCGGTG CGTGCTGAGG GACGGTCTCC GCGGACTGTG GGCGTCGTGC GACTTTCGCC CCGACAAAGA GGAAGCGTGT CATCACCTGA TGACGAAGTT TGCCCCTTTG ATGTGGGGCA CGGAATCTAC TTGTGGCTAT ACGGCGTCCA ACAACTTGAC CCTTGTCGCT AACGGGGATG TCGAGGGCAA CGAGAGGGTC GATCTGATGT ACGCCCGCAT GGGGGTGGAT GGTGTGGGCT CTTTGACGGT CAGCGCGTTC ACAGTAAGCA AAAGGGGGTT GGGAGGGGGG TGAGCACAGC TCACACCCGC GGCTCCAGCA ATCCAACACA AGAATGACTA ATGACTTTTG GAAATTATTT TATTTAGGAC AGAAGAATAT CATTCTCTCA AGGAAGAATG TCAATTTCTC GACCAATCAG AGCATCCGAA CGCGCATAGG AACGCGCATA GTCGAACGCG CATAGAATAT CTGGAAAGTA ATAGCAGCTT GTTGTAATCG AGTGACATTG ACTGTGAGTG ACCGTGATTC TCCGGTTTTG GTTGTACGCC TCTCGACTAC GAACCATTTT GGGGTGCGTG ACTTGCGTGA ATCTTATGCG AAAGCCACTC GGGAATGACT CGTTCCCGTG ACTTTGCTGC CGAAAAGCCC TCGAGAGTCT GCCTTCTCGA TTCGCTGGGA TCGGGACCGA CCCATTCCCC TCCCCATTAC AAGGTCCATC AACACCATGG CAATGGAAAA GCGGAACAAC TCGGCCCGCC GAGCGGTAGA AACCAAATCA TCCATCGACG GAATCGAGGA AGGCCTCGCG ATGGAGGAAC GCGAGTCGCT CGTGGAGGGC TCGTCTCCCG CCTCGGGGGA CCGTACGCAC ATCGGAGCCA ACCAAACCAA TCCGGATACC GCCCACAGCA CCGTCGACGC TGGCGACAAG AAAAAAGAGG TTCCCGCCGG AGTCATTACC AGCACTGCTT TGATGGTTCT CATTCTCCTC GCCGTACAGA ATTGCTCCAA GAATTTGTTG ATGCGCTACG TCATGAAAGA CCAGCCCAAG TTCCTTACGT CGGCGGCTGT TTTGGGCAGT GAATTTACCA AATTGTCTCT CAGTGTCGGC TACATTCTGT TTGTACAGCA CCGATCTCCC CAAACCATCT TTCGTTACCT GAAAGAAGAC ATGCGCAACA CAATGCTTTT GGCCGTCCCG GCTTCGGCCT ACAATTTGCA AATGAGCTTG GAATACGTCG CCCTAGCTAA TTTGAACGCG GCGGCCTTCT CCGTCCTCGT ACAGACCAAG CTCATTTTTA CCGCTTCCTT TGCCGCAGCT GTCCTCCGGA AACGCCTGCG TTACGCTCAA GTCATTTCAC TCGTCTTGCT CACGGCCGGC GTCATGCTCT GCAACTACAA GGGCGGAAGT GTCGACGTCG ACACCAACGG TAACTCCACC AAGGGCATTC TTGCCACGCT CGGTATCGCC CTTTCGTCCG GTTTCGCCTC GGTATACACG GAAAAAGTCA TCAAAGGCCA AGGATCCACC AAACGGTCCG TCAATATTGA AGACTACGGC CTCGCCTACA CCCAAGTACA ACTGGCCCTG ATGAGTCTCT TGACTATTGG TGTTTACGCC ATTGCCAGCG ATTTTGCCGC CATTGTCCGG GACGGACTGT TTTACAACTT TACGTCGGCC GCCTTTGCCT CGGTACTCAT GTCCGCCCTT GGCGGACTCA TCGTGGCGTC CGTACTCAAG TACGCCGATT CCGTTTTGAA AGGTTACGCC ACGGCCATGT CGGTCATTCT GACCGGTCTA CTTTCCATGG TGCTCTTTGG TACCACCTTG TCCGTAATCT ATTTCATGGG GATCATCAAT GTTGTCATGG CGGTTTTGTT GTACAATGCC AAGGATTTGG ATCGCTTCGT GTGCTGA
|
Protein sequence | MAMEKRNNSA RRAVETKSSI DGIEEGLAME ERESLVEGSS PASGDRTHIG ANQTNPDTAH STVDAGDKKK EVPAGVITST ALMVLILLAV QNCSKNLLMR YVMKDQPKFL TSAAVLGSEF TKLSLSVGYI LFVQHRSPQT IFRYLKEDMR NTMLLAVPAS AYNLQMSLEY VALANLNAAA FSVLVQTKLI FTASFAAAVL RKRLRYAQVI SLVLLTAGVM LCNYKGGSVD VDTNGNSTKG ILATLGIALS SGFASVYTEK VIKGQGSTKR SVNIEDYGLA YTQVQLALMS LLTIGVYAIA SDFAAIVRDG LFYNFTSAAF ASVLMSALGG LIVASVLKYA DSVLKGYATA MSVILTGLLS MVLFGTTLSV IYFMGIINVV MAVLLYNAKD LDRFVC
|
| |