Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48300 |
Symbol | |
ID | 7203729 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 56489 |
End bp | 59446 |
Gene Length | 2958 bp |
Protein Length | 746 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182889 |
Protein GI | 219125231 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0439212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG CGGCTGTACT TTTCATCATC ACGTACGTAT TGTGAGAGAG ACGGAGAGAC GATTTCCACC AAATCCTATT CTAGCGACGA TCAATCATCT TGTCGACACC AAAAAGTAGC CTCTGTATCA AAGGTAGCAC GATATCGGTT TCCATCATCT TCTGGCCTCA AATCTCTTCG TAGTGAGTAG TTGCTGAACT AGCAAGCATT AAGAACGCGC TGAAACTTCT CATATCCTTG TACTAGCTGG CTTTCACGAT TCAGTTCTCG TTGGCCAAAT CGTGGAACTC ACAGTCAAAC GAAAAGCCAA CCAGCAGTAT ACTACTGAGT CTCTGTCCTT TGTTATATTC GAGACTTAGC GGGTCGCACT TTATCGACTC CTTGTCGAAT TCTTCAGTCA TGGACGCAAA AAGGCGGTTG TCGTTCTTTA TGGGGATAGG AATGGTGTTC TTTCCAGGAG CAAACGGCAA CAACTCTTGG ATCGATATTG AAACACCATT GAAGAAAAGG ACCACCAAGT CTTTGGTTGA TGGCTCAACT TACCATTTGG TGCGTCTTTT TCTATCGCTA TGTACAGAAA TGCAATTGCA AGCTAAAATC TCACACCAGA CTTCGTAAAC TACTTTGTGG CTGCGTGCAC AACAGGTCAT GTCAGACGAG TTTAACGTAG AAAATCGAAC ATTTAAAGAT GGACACGATC CGATGTGGAC GGCCCTAGAT AGAAGCGACG ATGACGCTTC AAGCGCTGGA GGTGGATCCC TGCAGTTCTA CAATAGCTCC GCCGTTAGTA CAGAAAATGG CTTTTTAAAA ATTGCCACCT ATCTGGAAAC CACTTCTTGG ACTCGGTACG ACCACGTCAA CAAGCACTGG AAAACGGAAA GGACGAACTT TACTTCAGGT ATGGTCCAAT CGTGGAACAA ATTTTGCTTC ACCGGTGGAA TAGTGGAGGT GGACGTTGTT TTTCCTGGCG AACCATTCAT TGGGGGATTG TGGCCAGCCG TTTGGATGCT GGGTAATCTT GGACGAGCTA CGTACGAGGC CTCCACCAAC AATATTTGGC CGTGGAGTTT CGATACATGT GATCGTGAAA TGCAGGATGC CCAAGCCATT TCGGCTTGCA ATCGCGAAAA TCACTACGGA ATGCATCCGT TTCAAGGGCG AGGTGCCACC GAAATTGATA TCATTGAAGT TATGACCGGT GATTCCAACG GGCCGTTGCC GTCCACCGAA CCACCCATTA CGTTGCCCTA TGGAGATATG ACGTTACAGG TACGTTGACT GATAAACTGA TGTTGTTTCT TTATGGCTTG TCCCGGTCCT TTGGCTTTGA TGAGTGGTAG TCGGGCTCAC ATGTGGTAAT GTTCGCTTTT TTAGGTTGCG CCCGGTGTAC CAAAAAATCG ACCTCAGAGT GGATCGCTTC CTCTTCGGAA AAATACTTTC TCCGACAATG GGCATACAGA GTTTTTGGCA AATGTTTGGT ACAAAGATTT GGAAATGCAC GGCAATACTT CAATCAATCC CTTTTTTTAT GGCACGTATC TTGGAGAAAC CAAACCTGGC GAACCCGTGA CTCGCGGAAA GCATGAAGCG TTTCAAGCCG ATGCGGTTGG AGCGGCCCAT CAGTTAACAC CGGCGCACTT TAAGAGACCC CATACCTTTC GGATTGAGTG GCAACCAGGT AAAGGAGGAC GACTGGATTG GTACACTAAA GGCTATCGCA TGAATGAGAC GACGTACATG GAAGGTGATG GTGAAGGACA GGAGTGGACA CACGTATTTT CACTCAAGGA TAAATCCTTG AGCGATTTGA TGGGGTCGCA GATTCCGAAT GAACCAACAT ATCTTATTTT CAATACAGCC ATATCCAGTA CCTGGGGATT TCCTTACGAT CCGCCCGACT GGTGTCCGAA ATGTTTTGAC TGCAACGATC CGACTTGCTC TTGTAACTTC TATCCTGGCT TTTGCCAAAT GCTTGACTCG CGAACGGTTG CCATGTTGAT TGACTCTGTC CGTGTATACC AATCCTTCAA CACTTCGGCA CACGTTGGCG GCAAGCATAC CTTGGGTTGC GATCCACCGG ATTATCCTAC AAGAGAATGG ATCACAGGCC ACGAGTATCG ATACATGAGG AACGAGCCTT TTTCATATAA GGACAAAGCG CATTCTTTAC AACCATTGCA ACGAGGCGGT GGCGTTTGCC GAGACGATTC GGATTGTGGC GGAAACGTGT CTTTAACGAA TTTAACGGCC GTATACGACA TGCTGGGAAC TGACAGTGAG CGGAAACTTT TTTCGACGGA ATCGCGAGAG ACGGTTGATC TTGTCATAAG TCAAGGCCAG TGTGCCTCGC AGACGAACAC ATTCTTCTCA AGCAAGTCAT GGACGGGGAA AGTTTGTCGG TGTAGAGTGG GCTTTACGGG GCCAATGTGC TTATCGTTGG ATCGCATAGA TACTTTCCCA AGTGCGCACA AAATAAGGAC AGACGTATCG CCTTTCAATC GGATTGCCAA TTTTGAAGCC CCAACGTTCA TGCTGACGGC AATCGCCAGC ATGATTGTTA TGCTGCTTTC TATTTTGGTC TCCAAAGTTG TAGACGAAAA GAAAGCAAGG AAACGAAAGT CGGTGTCGAG ACAGTTCAAA CGTCCCACTT TTGTCACAAC AAGCAATGAC TCCAACGTCA CCATCATTAC CGGTACCAGC ATATGAGCAT GATAGCCAAA TACAATGCTA ACAATACCAT TACTAATTGA ATATGCGTTC CACATCAGAC CACGGTCTCA GGATGTGAAC TTTAAATGTG ATGATTGCTT GATTTAAGCA ACTTTGAGAC AGTTGTTGTA AACAGACCGG TGGAAGAACA CTAATGTAAT TCCAACAAGG ACTGGCTGTG AGTTCTAAAT TTGTGGAAAG ACGACTGGTT ATTTTTTCAA AAACATGTAA TCAATATACG TTAGCCTT
|
Protein sequence | MKKAAVLFII TLCIKGSTIS VSIIFWPQIS SYGSHFIDSL SNSSVMDAKR RLSFFMGIGM VFFPGANGNN SWIDIETPLK KRTTKSLVDG STYHLVMSDE FNVENRTFKD GHDPMWTALD RSDDDASSAG GGSLQFYNSS AVSTENGFLK IATYLETTSW TRYDHVNKHW KTERTNFTSG MVQSWNKFCF TGGIVEVDVV FPGEPFIGGL WPAVWMLGNL GRATYEASTN NIWPWSFDTC DREMQDAQAI SACNRENHYG MHPFQGRGAT EIDIIEVMTG DSNGPLPSTE PPITLPYGDM TLQVAPGVPK NRPQSGSLPL RKNTFSDNGH TEFLANVWYK DLEMHGNTSI NPFFYGTYLG ETKPGEPVTR GKHEAFQADA VGAAHQLTPA HFKRPHTFRI EWQPGKGGRL DWYTKGYRMN ETTYMEGDGE GQEWTHVFSL KDKSLSDLMG SQIPNEPTYL IFNTAISSTW GFPYDPPDWC PKCFDCNDPT CSCNFYPGFC QMLDSRTVAM LIDSVRVYQS FNTSAHVGGK HTLGCDPPDY PTREWITGHE YRYMRNEPFS YKDKAHSLQP LQRGGGVCRD DSDCGGNVSL TNLTAVYDML GTDSERKLFS TESRETVDLV ISQGQCASQT NTFFSSKSWT GKVCRCRVGF TGPMCLSLDR IDTFPSAHKI RTDVSPFNRI ANFEAPTFML TAIASMIVML LSILVSKVVD EKKARKRKSV SRQFKRPTFV TTSNDSNVTI ITGTSI
|
| |