Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47888 |
Symbol | |
ID | 7203102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 356867 |
End bp | 360200 |
Gene Length | 3334 bp |
Protein Length | 854 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182206 |
Protein GI | 219123801 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.554333 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCGCG CGACAGTAGA TATTGCTTTG CTCTCCGCAC AGTTTCTCTC TAACGTTCAG GACGGCCGTC TCAACATGAA GGAACAAGCA AGCGCTTGTG ACGCTTTCAC CATTAGAAGC CGGGTGCCCG TTTCCTTGCA TGGGTGGCAG CGTTGCAGCG GTTGGATTCT TTGTGTTTTG TTGGCCATTA TTGCACCCGC CGCTGGTGTA CCAGACGATG CTCTATCAGT TCACTCGGAA GTATCCTCTA GTCGGAGCAA ATTAGATGTT GCGAACTCTT TTATCGACAA CATCCCTTCT TCAGCACCGG CAGTTGTGGC ATCATCGACA CCCAGCTCAT CCTTTGAGCC TACCACGTCA ACGCAACCCT CATTCACGCC GACAGTGTCC TCCCAACCTA CATTAACGGC ACGACCGACC CAAATTTCAA CACTCAGTCC CACTGCACTC ACCTCAATGA ACGGACCTAT CTTCGCAGAT CCCGCCGGAA CTCCGACTGC CGCTATTGCA ACACCAGTGT CAATGAGCCC GTTCATAGAG ATTCCGCAAG AGGAAGAAAT GCCAACCGGA AAGCCGTCCC CGATTCTGCA ACTGAATCCG ACAAGTACAC CCGCTGCGCC TACAATTTCT CCAACCAAAG GACCAACTAC CGCACCAACT CTTGATACAA CGACTATGCC TACTTTGTTT CCACCTGCGA CAGAAAACTT AACGGTAATA ACAACGAGCT CTCCGTCAGC AGTTCCATCC ATTAATCCCT CATTGGTACC ACCGCCTACT GCAACATTGA ATGCGTCTGA CGCTCCTTCT GTCGCTGCTA GCACACGGCC TCCTTCCAAC CCTACGAATG GTGCTTCGGG CGCACCCACG ATGACCCTCG CCCCTGCAGT AGTTCCTTCC GTATCGCCTA CCCAATCGCC ACTGTCAACG ACAAGCGCAA TGCCCTCGGC GATCCCAAAT TCCATGGTGT CAGCGACTCC AAGCACCACG CCTACTTCCA CAACAAGCTC GACGCCCACT TCGACTCCAA ATTCCATGCC TTCGGACATC CCCTCATTCT CACCCACTCC AAATCCCACA GTGGGTCCTA GTGATCCACC AACCTCCAAA CCTTCTCCGA TGCCATCGCC AATCAGTTCG AATGGTCCAA GTCTCAGTAC TTCGGCCTCG CCGTCTACTC GTCCTTCAAG CACTCCTACG GAGCAACCCA CTCATGTTGC TTCAGATGAG CCCTCACGAG TAGCTTCGGA AACGCCATCA CTGAGCCCAA GCCATATTCC CACAGCCATT CCTTCACTAG GTCCAAGCAA TGCGCCTAGT TCTAACCCCT CCCTGGCTCC GTCTGTTTCC ATGCAGCCCA GTACTAGTTC AGCGCCATCG ATGGCACCGA CACCGGTGTG GGAGTTCGAA AAGTTCGTTG GCGTTCTGTC TGGAGTTCAG CTTATCTACG GTATGAACGG TCGTCGTATG GATGAAAGCC AGCCAGAATT GACCGATGAA GTCTGCATTG AATTGTGGCG GGATTACGTC GAGTCAAGTA TAGCGCGAGA AGTTGAGAAT TTGGTTCAGA CCATTGAATT TCTCGAGGTT ACGGTGTCGA ACGAAACACA AGAAGTGCTT ATTGATTCAC AGCGTGTTGC GTATGTCTTT GATACTACCG TAGAGCTACG ATCACCAATC AGTGAACACA ATTTGAATCG TTTTGTGGCG GGTGCTTTCA ACACTGAAGA AGAACGGCTT ACCATGGTAG AGTATCTACG GAACACGACG TGCCCTCAAT TTGCGTACAT AACGACAGCC GACTTAGTCA TGCCTCCGAA CAAGGCGTTA CTTCCCGAAG ATGGCAGCGA TGAAAGCTCC GGGGCTGGAT TGATCGCTGG TCTGGCTATT GCAACAGTGG CGGCCGTTAT TTTGACAAGT CTGTTTCTCT TTCTTCGCCA TAAGAAGAAA AATAATCCGT CGGTAGCTGA AGAAGAAGAA GTCATTCTTC CAATACCTGA GCACCCGACT AGTCACAACC CTGACGAGTA TGCTTCTGAG ATTGATGTAG AAGGTGGAAC CGATATTAGT ACTTTGGGAG ATCCAATGCC GCAAGCCATG TACCCAATGC TATCAGGCGA CATGTCTATA ACTGATTCGG CAACAATGGA GTATGACTTT GACAAGGCTT ACCATAGCCC ACCTTCTGTG TCAGAAGCTT CGGAGACAGC CTCGAGATTT GATACTCTAC AGTCGAGTAA TCCTGTATTG AGCAACGATG GCATTTCGAA TACACGAAAT GAGTTCGAGA CGGTAGTAGC AGTACTGGCT CCTGCCGGGC GCCTAGGGCT CATCCTTGAA TCCAGCAAAG ATGGGACTCC TATTGTGAAT AGTGTCAATC CGGGAAGTGC TTTGGAGGAT GAGGTCGAGC CGGGCGATCG TCTCTTGTCC GTGGATGGTC TTGACGTGAC GGTACTCTTG CCCAGTGAAG TGTCAAAACT GATTGCCGAG AAACGAGATG AAGACGTTCG TAACCTTGTT TTTGCCCGCT CTATGGCAAG ACCGAACAAC TTAGTGGATG GTTAAAGTCT TTTTCAGGTT GCAGATTTAG ATTGTTCGTG TTTATGCGCG GTATACTTGT TGGAAGTAAA GCCGAGTATT AATTCTAGAG AAATCGATAT TAATGAAAAT GCCTGTAGCT ATGCTCAAGT CCCACCATTG AATATGGGTT GTCATCCTAG GGAGCAACCT GATGATCTAG CAGTGGTTTG TGCTTCATAC GGGGCTTATG CTTTTTCTGT GGGCGACCCT CTACGGGCTT TGAATCAATA CCCACTGCAG GCTTGTCAGG TTGTTCTTTC TTCGCCTCTT GTTTTGATCT CAGATTCTCG GGAGGGCCAT CAAGAATCTT GGCTCCGATG ATATGTACAG GCTCCTCATA GAACCATTGC CACTCAGGAT CGTCGCGCAC AGTAGGAGAT GCAATGACCT CTTGTAAAAC ATCAAAACCG TCCACGACCT TGGCAAAACA CGGATCAGCG AATTCGTTCA GGTCATGCTG ATTTTGACCA CCAGGTCCAT GCGATTTAGT ATTATCCACC TTGTTAATAT ACCAATCGGG TCCGCCTGGT CGTCCCGTAT ATCCCAACGT CCACGGGACG TGCGGAAAAT CTTCCGAATA TTCCGGAAAC GCCAAGCTTT CGAGATTCAA CGATCGAAAC TTTCGAAGCG CGAACTCTCG TTCGTCTTCC GCCTCGTCGT ACCCTTCTTC ATCGTCTTCC GCCTGCGGAC CACCTTGGAG GACGTGCGGT CCATTGAGAT AAAACCATGC CGAGCTCCAT AGACCGTGTG CAAC
|
Protein sequence | MVRATVDIAL LSAQFLSNVQ DGRLNMKEQA SACDAFTIRS RVPVSLHGWQ RCSGWILCVL LAIIAPAAGV PDDALSVHSE VSSSRSKLDV ANSFIDNIPS SAPAVVASST PSSSFEPTTS TQPSFTPTVS SQPTLTARPT QISTLSPTAL TSMNGPIFAD PAGTPTAAIA TPVSMSPFIE IPQEEEMPTG KPSPILQLNP TSTPAAPTIS PTKGPTTAPT LDTTTMPTLF PPATENLTVI TTSSPSAVPS INPSLVPPPT ATLNASDAPS VAASTRPPSN PTNGASGAPT MTLAPAVVPS VSPTQSPLST TSAMPSAIPN SMVSATPSTT PTSTTSSTPT STPNSMPSDI PSFSPTPNPT VGPSDPPTSK PSPMPSPISS NGPSLSTSAS PSTRPSSTPT EQPTHVASDE PSRVASETPS LSPSHIPTAI PSLGPSNAPS SNPSLAPSVS MQPSTSSAPS MAPTPVWEFE KFVGVLSGVQ LIYGMNGRRM DESQPELTDE VCIELWRDYV ESSIAREVEN LVQTIEFLEV TVSNETQEVL IDSQRVAYVF DTTVELRSPI SEHNLNRFVA GAFNTEEERL TMVEYLRNTT CPQFAYITTA DLVMPPNKAL LPEDGSDESS GAGLIAGLAI ATVAAVILTS LFLFLRHKKK NNPSVAEEEE VILPIPEHPT SHNPDEYASE IDVEGGTDIS TLGDPMPQAM YPMLSGDMSI TDSATMEYDF DKAYHSPPSV SEASETASRF DTLQSSNPVL SNDGISNTRN EFETVVAVLA PAGRLGLILE SSKDGTPIVN SVNPGSALED EVEPGDRLLS VDGLDVTVLL PSEVSKLIAE KRDEDVRNLV FARSMARPNN LVDG
|
| |