Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46203 |
Symbol | |
ID | 7201275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 574120 |
End bp | 577895 |
Gene Length | 3776 bp |
Protein Length | 1242 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180670 |
Protein GI | 219119837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00106502 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGGGC GTTGGAGCTT TTCTCATGTG GCTGCAACGG CGGCTAGCAT CCGCGGAACG TCGCCTGATG CAGCAGACTC AATGTCGCTC CCATCAATGA ACGATTCCTT CACAGTTCTG GACACTGATC CTGTGTACTT TTCGGACGGT GAGCTGGAAG CCATGTGTAT GCAAGCGAAT GAGGAACAGA GGACTATGTT ACAGAATGCG AATGGAATGA CAGTCGAGGG AGAGAATGAT TCGCTAGACA ATGAACGGCG CCATGAGACA GAAAGTGTCC GCATGACAAC CAACAGCGCC GACCATCGTC AAAGTGTGTC CACAACCGAA TCGCCGCGCG ATAAAATCTG GAAGAAGTGT TTAATGCAAC AGCATTCTGG CAATGCGATA CAAAATCCTC TCGCAACTTC TACACCAGCA CTCCAGGCTC TAAACTACGG TACTTCTAGA CCCAACACAA GTCCTTCCAC CCTTGCTTTC GATAATGGGT CGCCTCGGTA TCATCGCCGT GCTCGACGTC GCAGCGAATC AAGTTACGGA ATGAATACGT GGAAAAAGCG ATACGCTCAC CACGTACTCT CTCCGGAGCA TCACCTATTA GGAACAGCCG CTTCCAAATC AAGGCAAAAT TCGAGTTTTT CCGGTACCGA CAACCCGGAG CAAGGCTTGA ATTCGCAACG TCAACTGTTC TCCTCCCGTG ATTTTCCAAG CCCCCATGAG CAGCGCTTTG ATTTCTACAT GGGAAGTTCG ATTAATGCAT CTCCACTTCG TCCTTCACTT CCCACGCCAC CATCACCATC CACTATATCA GACCAGTCAA CGAGAGAAAC GAGCTCGCCG CCACGCTCTG TATCGGAAAA CCATAAGAAA CCATATTGTA GCAATACCAA CACCCAAGAA AAGCTGGCTG AAGCAGCTCA TGGGTCATTG GTCTTGCCTT TGTCGCCAAA AGACGAGAAT ATTGTAACAG CAATTCCGCC GCCTGTCGAA GCCACTACAC CATCCGGGAA GATGACCACT CCCAGACAGG CGATGCACGC TTCGCTAACA CAAGAACAGG CTCTCGTCAT TCAAGAACCA GACAGCAGTC CAAAAAGACC TGCGACTGGC CCGTATTCGT ATCCGGAATC AATACCTAAG AAAATTGTGA TAGTGCCCCA GTTTGATGAT CATGATGATG ACGACCATCA CCCGATGCTC ACGTCGTATG AACCGTCCAA TGGGGAAGAG GTACAAAGGG ATGCAGAGCC GAACAAATCT TCGTCGATTC CCTCGAGAGC AACACCGACG ACTGGTACAT TGCTTGACCT TTACGAGGCG CCAGTCCTAG CAAAGGAGTT CTCAGTTGCA CTTGTCGAGC CTGTTCTACC TACAGTGAAC AGCGCCAGTG AAACATTTCA CTGTTTGACT CCATCGCAAA ATGAACCGCC AGCACGAAAT GTCTTGCAGA CGGAATCAAT GATAGCATCC TTCCATACCT TTGACTTGGA CAAGAAGCCT AAAAGGGTGG CTTCTCCCGA TGAGCCCATA GCCTCACCGA GATTGAAAGC GACAGCCAGG GCACCATTGC TGACTGTGAG TGTTTCATCA GATCTCCCCG AAGAAAGTAT ATCCCAGGCA CCTACAACTG TATCCAAAAA ATCATCAGAC CCGTTCGAAT GGGCTTATGA CATCTGGCGA GGGAAGAACT TACTCCTGCC CAAGAGTGCC GTGCGTCGCG ACCCATCGTT TACATCGCCT TGTAAAATCG AAACTTCAGA AAATGAAGTC TGCATCGAGG ACGCCCGCGT ATCTCCTCGT TCGACGCCGT TTTTGCTCCC TCTTGAGACC ATCGTGCCCT CTACAACACC GGGCATTTCG ACGTATCCTT GTCCAAATGC TGAAGCAGTT TATTGTTCAG CAACACCGGT GAAGGGTGAA AAGGCCTTTG CCAACGTCTT ACAGGGATGG AAAACGGTCA GTAACGAGAG ACCTTGCACT CAGTTTTTGT CTCCAGAAAA CAGCGTGATA TTCAGTCAAA CAAAGGCAGG CAGCGCAGTC CTACCTTGTG CAAATCTTGA ACACACGAGA TTAGATCCGA AAGGCAATGA TACAGAAACG CTCACTTATC CAGCATCGAC GCAGTCGCAT CGCTCTTGCA ACAATATCTC TACTCCGCTC TCCAGCACTG GCAAAGCTTC GATTTTAAAA TTGGAATGCC GTCAAGGAAA TGGCTCCTCG GATCCAATAG TATCTAGAGC GATTACAGTG ACTGATGACC AGAACATGCA GTTCCACAAT GAAGTTGGCT CTATTTTGTC CCCCAACCTG GTATCGAGCG CACCTTCTTT CTGGGACAAA GCGATTGAAG CAAACGATCC CGCACAGAAT GCGCAAGACC AAATATTTGA TCTTGCCAAA ACAACGTCGA CATTACTTCT ATCCTTACCA GGTACGACAA CATCATACTC CGAAAAGCAT TCTTTCAAAT TGAAGGAGGG ATCGACTTCA GGAGAAGCGT TGCTGAACGA TCAAAGTGAA AACTTTGGTC AACCAACTGA CATTTTGGAG AGCGACTTAG GCAGTGCAGT GTGTAAGAGT TTGGATCTCG CGTACCTCAA AAGTGCTACA TGTGATGGAT CAATACCGTC AAAAGCCATT CGCAAGAGCC GACATAGTAT CGATCACAGT CTCATGGTTG TGGGCAATAA GGCTCGATTC AAGGGCAAGC AAGAAGCGTT TGATGACGCT GTCTCCGTCG GCGAAAGCAC AATATCCTCA ATGACTTCGT GCACAAGTCG TGTCGGAAAC ATTGAGAGCC GAACGAGAAT TAGAGATAAA ATCAGGAAAG TCAATGCGAA AAAAGACTTG TCTTCAATCT CTCCAGGTGC GGCAAAAAGC CGTGACACTT CGTCGACTCA AGCCGCAAAG ATTAAAGAAG TCTACAGGAA GAAGCGGCTG GTTATGCGAC AAAGTATGGG CCAATTGTCT GAGGCCTTAC CAAATCGACC AATGGAGCCC TCGCTTTGGA GTCAAAGCGA CTTTAAGATG TACGCTGCGG AGCTTCTCGA CATGCTGCCG TCCGAGATTT CAGGGGCTTG TTCGACAGAT GATTCGCGAG ACATTGAGCC TGAGGAAAAG GAATTGTGCA ACCTTTCTTT ACAGCCGTTT TCCACACATA CAAACTTGGA CCATGCATCT ACTGTGAACG AGGGGAATGC CACCTGTAAT TGCTCCAAAT CGGTCTTCTC GGGGAACGAT GAATTAATTG AATTCTTCTT GCCACGGTTA GGAATGGCTT GCACTTGTAG CAAAGGGTTG CAAAGCTTGA ATTATCCCAG TGAACCAGAA TCCCTTGCTA ATATTTTGAG ACCATGGCAA GTTGCCTACT TGGGAGACTT TGGTATACAT CGTGGAGATC AGCTTGTGAA GGCCCACCAC AGGAGTGCTG ATGCATTGGC AAGCGCGATG CGCCAGTACC GTCGAGACCA CGGGTTGACA CCTTTCCGTA CGAAAAGCTG CGGGATGGCT CTTTCCATTT GGGCGAAGAC TGCAAAAACA TACATTCGAT CAGTTCGGAA ACAAACGACA GCACATGGGG AAGTAGCTTG GAATCTGCCC AATACTCTTT ACATCCTCAG CTCCTTCCTG GAAAAAAATC CAGGGAATTC GGGGAGGCTG TCCTCCCCCA TAGATCAGTT TGAGGCTGAG AGCAACGATG GATCACCATC AGAATTTAGT TGCATTTAAC TGTAAGTCAA TAAAAGTAAC ATATAGCAGA AGAAGTGACA TTAGTG
|
Protein sequence | MLGRWSFSHV AATAASIRGT SPDAADSMSL PSMNDSFTVL DTDPVYFSDG ELEAMCMQAN EEQRTMLQNA NGMTVEGEND SLDNERRHET ESVRMTTNSA DHRQSVSTTE SPRDKIWKKC LMQQHSGNAI QNPLATSTPA LQALNYGTSR PNTSPSTLAF DNGSPRYHRR ARRRSESSYG MNTWKKRYAH HVLSPEHHLL GTAASKSRQN SSFSGTDNPE QGLNSQRQLF SSRDFPSPHE QRFDFYMGSS INASPLRPSL PTPPSPSTIS DQSTRETSSP PRSVSENHKK PYCSNTNTQE KLAEAAHGSL VLPLSPKDEN IVTAIPPPVE ATTPSGKMTT PRQAMHASLT QEQALVIQEP DSSPKRPATG PYSYPESIPK KIVIVPQFDD HDDDDHHPML TSYEPSNGEE VQRDAEPNKS SSIPSRATPT TGTLLDLYEA PVLAKEFSVA LVEPVLPTVN SASETFHCLT PSQNEPPARN VLQTESMIAS FHTFDLDKKP KRVASPDEPI ASPRLKATAR APLLTVSVSS DLPEESISQA PTTVSKKSSD PFEWAYDIWR GKNLLLPKSA VRRDPSFTSP CKIETSENEV CIEDARVSPR STPFLLPLET IVPSTTPGIS TYPCPNAEAV YCSATPVKGE KAFANVLQGW KTVSNERPCT QFLSPENSVI FSQTKAGSAV LPCANLEHTR LDPKGNDTET LTYPASTQSH RSCNNISTPL SSTGKASILK LECRQGNGSS DPIVSRAITV TDDQNMQFHN EVGSILSPNL VSSAPSFWDK AIEANDPAQN AQDQIFDLAK TTSTLLLSLP GTTTSYSEKH SFKLKEGSTS GEALLNDQSE NFGQPTDILE SDLGSAVCKS LDLAYLKSAT CDGSIPSKAI RKSRHSIDHS LMVVGNKARF KGKQEAFDDA VSVGESTISS MTSCTSRVGN IESRTRIRDK IRKVNAKKDL SSISPGAAKS RDTSSTQAAK IKEVYRKKRL VMRQSMGQLS EALPNRPMEP SLWSQSDFKM YAAELLDMLP SEISGACSTD DSRDIEPEEK ELCNLSLQPF STHTNLDHAS TVNEGNATCN CSKSVFSGND ELIEFFLPRL GMACTCSKGL QSLNYPSEPE SLANILRPWQ VAYLGDFGIH RGDQLVKAHH RSADALASAM RQYRRDHGLT PFRTKSCGMA LSIWAKTAKT YIRSVRKQTT AHGEVAWNLP NTLYILSSFL EKNPGNSGRL SSPIDQFEAE SNDGSPSEFS CI
|
| |