Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47206 |
Symbol | |
ID | 7201974 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 812244 |
End bp | 816162 |
Gene Length | 3919 bp |
Protein Length | 1244 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181266 |
Protein GI | 219121840 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTCG AAGGGTATTT TTGTTTCCGC CCTCTCTGGA GAGATGTTCA GACCGAGTTC AAAGAGGAGT CAAGGTATAC AGCAAAACTC CCTCTTTGGA GCGAAATATC CTGTAGCATA TTAGTCCCAA TCGATATGCC GAGGAACAAC CAACTGCAGC AGGAACGTTG CCAGCGGCAT CAAGGACTCT CCTCCGACGG TGAAGACCGC GAGGGCACCG ATGCGTTCAA TTCCCCAGCT GCTGCCTTCC AGCACTGGTC TTCCCAATTG CGGCGTCGAT CTCCTCTTCT TTCCACTGAC AAAATGTTCT CGACGGAAGA AAACGGCGAC AGCGACGGAG AGAATCGCGA CAGAAGTACG AAGAAAACTT CGATGGATGA AAGTTCTGGT TCCAATGGAA TCGTGGCTGC GCGCAAAGTG GACTTTTCAC CTCGCCTACA CTCTCACCGG AGCTCAACAA ATAAATCGGC GTCTTTTCCC AGCAAAAAAG ACTTGAACGT TTTCAGTGTT TCGAGAAAAG ACCCGGCTAT GACTTGTTCG GGCCCGAATC ATCCTAAGTC TAGTCCAATT TCTTCGAAGA TGACAGACAA TTCACCAGCG CGTCGGCGTC GAGCCGTACA TGGGCTTCGT CCACCTTCAA CGCCCCACAC TCCTCGATCC CCACAGCCAT CTCGTCCCAC ATCCGTACTT CAATCATGGA CGACACACCA ACCAGACCCT GACTTTTGGC CCGCTAACGA AAGCATGGAC GTCTTGTTAG CCACGGCATC CGTCGATTCC TCTGGCATCC ATAAAGCTCA ATCCCAGGAT CTACCGCTCC ATAGCGCCAT TACACAAGCA ACGTGCATTC ACGAGCTGTG GAACGCGACT GTGCGTGACA TAGTTACGGT AGATTCGGCT GCCTCTCAGA ACGATGAAGG TCAAACTCCT ATCCATTTGT TTGCGGAAAA TGCAGCTCTC GCTGATCAAC TGATTCTTTA TCACAACCCG ACAGCCGCCC AAACACCACC GGATGCGGAT GATATGCTGC ATGTCACTCG GTCAGGATTG GATAGTGACG CTCCACAATC CGACGATCCG ACGACTTCTC TGGCGGCGGT AGAGAGGCAG CTCGTTCATT TTGGCTTACA TTTACTGCAC GCGTATCCAG CTGCCATGAT GACTCCTGAT GGGGACGGTT TCATCCCCTT CGAGCGCACA CGTAAGTAAT ATTGAAACCA AAGCTATTGC GGCAAAGTCT GAACTCTTTA GCTAACACAA GTTTGATGCG ATCCGATTGA AAAATGAAAC CCTTCAAGTC TCTGAATGGA TCGATCAAGT TCATCGAGAG CCAACTTGCA ATCCTCGTAA GCCACAGCGC AATCAGTTCC AGTCAGCGAC TGACTTGCTT CCAACACAAA TAACTTCTTT CCTCGCATCT AGTACCACGC GCTGGGCATC GCAGCCGAAA AAGTCATTGG ACCTCTCTTA CAAAATTCGG GACGAAACGT CGTCGGACGG ATCCTTGGAC GTAGAAAGCG GAGTAACAAC TAAGGTGCTT CCCAAACTCG CGACTGGAAG CAAGACACAA GAGAAGACGT TGCATAATCG ATCTCGGCAG TTCCCGGATA AAGTGCAGTT GACTGCGCAA GCTCAATTTG CCTTTTTGAT GCTGTCTTCC TTTCTCGATC ATTTGGATCG CCGTCTCGAA GAAACTACCT TTCTCAAGAC CCGTCGCTCC AGCCTGGCTA CTTCAACACG TCCTTTAACC GGAACTCCTT CTGCTCTCCC CGACGTGGGC GCTTTAGGGG CTGAAGATTC CTTCCATGAA GCAATGGACG AACTCCGTCA CATGACTGTC GCCGATATTC GATCCGAGAT CGTGCGAAGC ATTGCATCTA TACCCGATCT GGTCAAGACA CTCCTCATCA TGGAAAACGA TACCCAGCGT GCGTTTTGTT TCTCAACCAC ACTAATGCAC CATGTGCTTG CGTCCAAGTA CTCTGTTGGT CCATGGCTAC CGGCTATGCT GCAGCATAGT GAGGAGCGAA TCTCGGATTT GGCAGTCGGC TATCTAAAGC TAGTCTCGGA ACGGGATATC GAGGATGGCA TTCCACGAAT GCAATGGGGA ACTAGTGTCC GCGCGTTTCG AAACTCTGGT GGAAAAGTTG AAAGGCACGG AAAGATGGAT CTACACGATG AAGTCAGTCG CCTCGAAGAC TTTGTCCCAT CGCTTCTATC TCTAGGGGAG CGCCAAATGG AAGAAGCTGC AACAACCAAG CTCGTTCGGA AGGTTATGGA TTGCATCATC TCTCGTCCGT TCGCCGTAAC TGTTGTTTTC TGCGACGCCC TGTTTCTCGC CATCCTACTT GTTGGGTTTC GTGGAGCCGT CAATTCTCTC TTGCTAGGCG GCGTCTCGAG TACCGTCACC CGACACCTAT ATTTGGCCAA CGCTGGCATC TTCTACTTCG TTGTTAGGGA GCTGGGAAAA GTTGTTAGCC TGTGTATGAT CACACGCCGC GCACGAGTCT ATTTCCAAAG TTTTTGGAAT TTGGCGGACT TGCTGTCAAC GGTCTTGGCT TTGACAAGTA CTATTGCGAT GCGTGCTGCG TTAGGACCTA CTGAAACAAA TATGGCAGGC GATACCAACC TGCGCAATTT GCTGGCGATC ACCACAGGCT TTTTGTGGCT TCGCGTATTA AATTACTTGA AAGGCATCAA CATGCAGCTT GCGACATTTG TTTTGGCGAT TCTTCAGGTA AGTTCAAAGT GTTGGCAAGG GACCTCCAAT CTTGCGCGCT TCTATGCATA GACTGACTAA AACTAAACTG TTTTTTAAAT TCAGATTACG CGAGACATTT TGTGGTTCTG CGTGATCCTG TTGACGCTCA TCGTTTCCTT CGCGCAAATG TTCTTCACGC TCTTGGCCCC GGATACGTGT GTGGCTGGAG ACATCCTGAG CAATAAGAAG GAATGTACTC AGTCTGAGTA CTACCTCAAA GTCTACGCCA TCCTACTGGG CGACTTCGGA ACGTTTGAGC GTGAAAGCTT TACTTCTATG TTTTCGGTGT TTCTGGTTGT GTTTTACTCC TTCATGGTGG TGCTGATTCT GCTCAATATT CTCATCGCTG TTGCATCGGA TAGCTACGAA AAATGTTTAA TCCGATCCCA GAGCTTGTTT GGACGTGCCC GGGTCATGAT GATTGCAGAG CTTGTTTCAT TTCAGAATTT ATTGCGCAAG AATCCACATG TACTGTTGGA CTCGTCGTCA ACGGGTCCGA CCGCGCCTAT CTATAGGACC TGGTTATCTG GCAATTCATG GGCAAACGGG TGGAGTCGAG GATCCATGAC GTTTTTCCTC TTATCGTCAA CTGTCGTGCT CGTTTGGGGT ATCGCAGAGA CTGCTGGATA CGCAACAGGG AACCGTAATG TAAACGTGTG GATGAGCATA TCGTCCATCG CGATCAACGT CGCTCTATTT GTCGGCATTG TTATCTTTCT TTCAACCGGT GGTACTGAGA TTGGCCCAAC ACAGCTTGAA GGCTACCTTC AATTTGTCAT GCTACGGCTT CTTGGGTCGA CCAAAGACAC TACAAATACG GTGACTGGCT CTGACCACGA TGGTTGGCAG GGCCGGCTCG TGTACATCAA GACAGAGATT AAACGCCTTT CGGACGAAGC CAAATCTGAG ACAGCTGCAT CAACTAAATC TCTAGTAAAC CTCATTGGCA CTACGGAGGG ACGGCTTCAA GATGAACTGT CTCGCGTAGA AATCGGCTTA AACGAGCTGA AACAACTTCT GCAGCCAGGG CAATTGCCAT CCTATCAACC GAACGAAAAG AAGACAACAA TTGCTAATAT TCACAACACC ATGAAAGACC TCGTCGAAAC CCTGCATCGG ATGGAGACTG CGGAAGAAGT GAAGGGAGGA CAAAGTTGA
|
Protein sequence | MSVEGYFCFR PLWRDVQTEF KEESRYTAKL PLWSEISCSI LVPIDMPRNN QLQQERCQRH QGLSSDGEDR EGTDAFNSPA AAFQHWSSQL RRRSPLLSTD KMFSTEENGD SDGENRDRST KKTSMDESSG SNGIVAARKV DFSPRLHSHR SSTNKSASFP SKKDLNVFSV SRKDPAMTCS GPNHPKSSPI SSKMTDNSPA RRRRAVHGLR PPSTPHTPRS PQPSRPTSVL QSWTTHQPDP DFWPANESMD VLLATASVDS SGIHKAQSQD LPLHSAITQA TCIHELWNAT VRDIVTVDSA ASQNDEGQTP IHLFAENAAL ADQLILYHNP TAAQTPPDAD DMLHVTRSGL DSDAPQSDDP TTSLAAVERQ LVHFGLHLLH AYPAAMMTPD GDGFIPFERT LSEWIDQVHR EPTCNPRKPQ RNQFQSATDL LPTQITSFLA SSTTRWASQP KKSLDLSYKI RDETSSDGSL DVESGVTTKV LPKLATGSKT QEKTLHNRSR QFPDKVQLTA QAQFAFLMLS SFLDHLDRRL EETTFLKTRR SSLATSTRPL TGTPSALPDV GALGAEDSFH EAMDELRHMT VADIRSEIVR SIASIPDLVK TLLIMENDTQ RAFCFSTTLM HHVLASKYSV GPWLPAMLQH SEERISDLAV GYLKLVSERD IEDGIPRMQW GTSVRAFRNS GGKVERHGKM DLHDEVSRLE DFVPSLLSLG ERQMEEAATT KLVRKVMDCI ISRPFAVTVV FCDALFLAIL LVGFRGAVNS LLLGGVSSTV TRHLYLANAG IFYFVVRELG KVVSLCMITR RARVYFQSFW NLADLLSTVL ALTSTIAMRA ALGPTETNMA GDTNLRNLLA ITTGFLWLRV LNYLKGINMQ LATFVLAILQ ITRDILWFCV ILLTLIVSFA QMFFTLLAPD TCVAGDILSN KKECTQSEYY LKVYAILLGD FGTFERESFT SMFSVFLVVF YSFMVVLILL NILIAVASDS YEKCLIRSQS LFGRARVMMI AELVSFQNLL RKNPHVLLDS SSTGPTAPIY RTWLSGNSWA NGWSRGSMTF FLLSSTVVLV WGIAETAGYA TGNRNVNVWM SISSIAINVA LFVGIVIFLS TGGTEIGPTQ LEGYLQFVML RLLGSTKDTT NTVTGSDHDG WQGRLVYIKT EIKRLSDEAK SETAASTKSL VNLIGTTEGR LQDELSRVEI GLNELKQLLQ PGQLPSYQPN EKKTTIANIH NTMKDLVETL HRMETAEEVK GGQS
|
| |