Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50602 |
Symbol | |
ID | 7199440 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011700 |
Strand | + |
Start bp | 44608 |
End bp | 49229 |
Gene Length | 4622 bp |
Protein Length | 1255 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185556 |
Protein GI | 219130826 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.94407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTTA GAGCGTTTCT ATGGAAGGCC CGGCTCTTTG TTTTATCCTT CTTTAATGTC GTTAGTCGTT TCATAGTCTC TTTTTATTCC AGTCGCGTTT TTGTATCTAG GCATCAATGT CCAGTTACTA TATAGTCCCG GAAAGGCAAT TGAACTGACA CGTGAGCATT TTTGGGGCGG GCAACCCAAC GACAGTCCCG TTGTTTGTTT TGCGGGGGCA GCCCGAAAGA GTTTGGACCC CCAACACGAC CGTGAACTAT CAGTCCCCAC AATACACATT CATGAACTAC CTAGCCAATC TGATAATAGA ATCTGCGCGG CGGTCCAGTT GTGCAATCTA GAATAGAGGT TTTTCCCAGG AGTGTCGTCT TTCCAAGTCG CACCGGCAAA TACTGTCTCG GCGCAGCCAG ACTTTTCTGG AGCACCTTTG GTAAAATTAG TGAAGAACCG GTACGTGAGA CTCGGAAGGC AGGAATGCGT ATCCACGAGA CTTAGTGTGT CTCGGTTGAC TCCAACCGTG TGTGCAACGT TTCGATTCTC GGGTCAGTAT CTCCGCATAC ACTATCTACC AGATCGATCG GCTGTCGTAG TAGTAGTTAC CTCCACCGCT TGCCCTGCGG CTTACTCGCT GCGTTCCTGG AGAGCAGCCG CGGAACTGTG AGTCTCTACC TTATTGTGTT TGGCGTGTCT CACCTAGCAA ACAATTTTAC TCAAGCATGT TCAGTCTCTT CTTTTGTTGA CCCAGCGTAG CAACGCCATG TCTTCTGCTG AGCAGAAACT TTCATCCGAC TCGTTTTCTG TCTCAAACTT GAGTCAGCTG ACCGAAGAAG TTGATAGTCT GGATGGTAGC TATGAAACCA GCCGCTTTCT GAGAAAAATG CCCAGCCAAT CTCGCTCGTA TCCTGCCTCG TTTTCGAACA ACCAACGGCA CTCTCGCCTG CCCTTTTCGA CTATTGGTCT TTACGGAAGG GATAGTCACA AAGAAACTCT GCAGGCTAGG TTTCGAGATC AAGCCTTGTC GGTCGAAGGA GGATCCACGT GTTTGGTTAC ATTGGCTGGT CCAGCGGGGG CGGGAAAATC TGCCCTTGCC TCGCAAATGG AGCCGTTCTG TGCCGACTAT GGCGGCTTTT TCGTGTCAGG AAAATTCGAC GCTCATCGTG TTGGCGAAAC TGATCAGGAT CCTTACGCTG CGATCCGTGT GGCTTGTAAT CAGCTAGCGG AAAAAATATT GCGCTGTGAT CAGGACCTAC TCTCTGAGCG TTGTGACACC GACTCTCAGC CGCTCTCTAG AACATCCTCC ACGGTTGTTC AGCCATCGAC ACATTCTTCA AAAAGCATCA GCCAACAAGA GGAGCAGATA GCTGCCCGAA ATTTTTCCTT TACAGATATT CAGTCCAAGC TTCAACAAGA ATTAGGTGAC GAAGACTTGA CTTTGCTACT GCGTGTTGTT CCAGTGCTGG CTGAAGTGCT CGGACTGAAT CGCAGCACCA TTGTTGCGCT GGACCGCCCT TGGGACCAAG GAACTGTAGA AGCCACTGAC GTCCAAGCGC AAAAACACCG CCTGTATAGT TCATTTCGCC ATTTTTTTCG CACTGTTGGA AGTTTTGGAA CGGTCCTCTT GGTTTTGGAC GATGTGCAGT GGGCCGATGC TGCCAGCTTG GAATTAATCG AATTTTTGGT CGCTACTGAA GGTCCCACGC GGGAAATGAG GCAGTCGACT CGGCTGATGC TCATAGTATC TTACCGAGAC AACGAAGTTG ACGAGGATCA TCGTTTTGCA CAAATGATGG ATCGGTTGAA GAAACGGGAT CTTGACGACG TAAATAAAAA TGGCAAAGCA GTATCTCATA TCGAAGAAAT TTCAGTGGAT AATCTGTCCG TCGCGGAAGT CACAGAGTTC TTGGCGGATT TGATGTGGAT TGAAGAGCAT CGAGTTCGTC CATTGGCGGA GCTTGTCCAC GGAAAGACTT TGGGCAACGC TTTTTATCTT ATCCAGTTCT TGACCTCCCT TGTGGATGAA GACATTCTAC AGGCAGCACC TGGAAGTCGC TCCTGGTCGT GGGATATGGA ACGTGCTGCC ATGCACATTG ACGCTACAGC TAATGTTGTC GATCTCATGC GTGTGAAACT AAACAAAATG CCTCGCGATA CTCGTTCCAT CTTAAAGTTA CTTTCATGTT TGGGATCCAC CTTTGCACTC AGCATAGTTC GTATTATTAT GGACGATCGT TTCAGGAACC AAATTCATTT GATTTGGGAG AGTCGGCTCC TCGAGAGGCC CCATCGACTG GTCGTAAAGC ACTTGAATGT CTCGAACATT GCGTGATGGA AGGATGGATT GTGCGCTTAC GACGGAATTT GTACCTTTGG GTGCACGATA AGATTCAGGA AGCTGTGCTC TCGCTGATTG AACCCGCATA TTTGCCTACG TTGCAACGTC GGATTGGTGA ATTATTACTG GACAAGCTTT CCGACTCGGA GCTCGAAGGC AATGCATTTG TTGTGGCTAA TCTGTTAAAT GCTGGTATTC AAGGAATGGC TCTGTCAGCG GCCCGGCGTA TTCAGACTGC AAAAACAAAT CTTGCTGCTG CGAAAAGGGC CATAGCTTCG GCATCGCTTT CGTCGGCCGT GCGGTATCTA GAAAGAGGCA TTATGGTAGT TCCCGAAGAT CACTGGTCTA CCCACTACAA GTTCAGCCTA GATTTGTTCT CGACTGCTGC TGAAGCCGAG TATTGTATTG GAAATTTCAA TAGAGTCCAG AGGTATTCCG GTCAAATACT GGCTCAGACC CAACGTCCAT TGTTGGATCA TAGACGAGCG TACAATGCAC TGATGGATGC AATTGGTTCG CAAGATCAAC ATCTTGAGGC TGCGAATTTA TGTTTGGAGG TACTTGAAAA GCTTGGCTGC CCTTTCCCAA AACGTGGTGT CGGACTCAAG ATCGCTTTTG GATTTTTAAA AGCAAAAGTT TCTGTCAAAA CTATATCGTC CGAGCTTGTG TCGAAAATGC CTGTTATGAC AAAAGATCTG GAGCTCTGGG TGATGTCATT GTTGGATAAA TTATTTACGT TTTTGTACCT AGCCGGATCT GAGATGTTAC CGCTTTCGGT TTTGAAAAGC CTTCAATGGA CTCGCAAAAA GGGTGTTAGT GAGTTTTCTC CTTCGGCTTT TGCTCGACTG GGCATGAGTT TTACAGCTTT TCTTGGCGAC CCCCAAACGG GAAAAGAGTT TGCGGAACAT GCTTTGTCGT TGCTTAATAA AGTGAAGTCC GGCAAGGCTG AATCTCGAAC ACTTATGCTT GTCCATTCCT TCGTCATGCC ATGGTCATGC CCATTGAAGA AAACACTTGA GCCTTTGTTT CGGGCTTACG AAACAGGAAT GGCTACCGGA TTTTCAGAAA ATCCGATGTG GTGTATTTAT TTTATGAAAG AGCACAGCAT CCATTTAGGG ATTCCGATTA GCAAGGTTTT AGCCGATTTT CCCAGTCTCA TTTCTTGCAT GGACAAATTT CAGCAATTAA GGCAGCGGGA CTGCTGCAAA ATTGTTGGAC AGGCCCTGCT TAATCTGTCT GGACAAGGAA AAACACCGTA TGTTCTGACG GGTGAACTCA TGAATCAGGA TCTGATGTGG AATGCGGCCA TGGACGCCAA TGATGTGGTA GCCATAGCCT CTCTTCAGCG CTGGCGATTG TATGTAGCTT TCTACTTCGA AGAATATCAA ATAATGATGC AGCTACTGGA AGACACAAAT CTGGGGTCCA CAATTGAGAA AGCGCAGCCT GGATTGTACG GGCTTTGTCC GATGATCTTC CATAACGGTC TTGCTTGTAT ATCAATGATG CGGGAAACAG GCAACAACAA ATACACCGTA TGGGCGAGAA GATTCGCCAA TACTATCAAG AAATGGGTAG ACAAAGGCGT GAGTCAATGA CAATAGCGAC GTTTTGGCTA CACTTCTGAA TGGAGACTGA TACTTGTTTC TTGCCTTTCG TAGAATCCTA ACGTTAAGCA CTATGACGCG TTGCTGAATG CAGAATTGGC CGCTCTCTTC GGGAAACATT CCTTGGCCAT GAGATTCTTT GGTTCCGCCA TTCTATTTTC CGGTAGCGGT GGCTTTAAAA ACGACCAAGC CTTAGTATAC GAACGTTTTG GTGAATACAA TCTGAAGCAG GGCAACGTCA ATGAAGCCAA GTACGGACTC GAGCAGGCAA TACAACTGTA TGAAGCATGG GGCGCGCATG GCAAGGTCAA TCTTATCAAA CAGAAGCATA TCTTGCTATT GGCTCCGCCA GTTGAAATTA ATGTTTCGAT CGATGAATAA GGTGGTGATA CCCATTTGCA ATGAGGCTTG AGAAGCGCAG GAGGTGAATT AGCTGTTTGA ATCACTGCCA GCTCTATTCT ACCCTACCAG CAGAGCTTTA TGCATTTACG TAGCCTCTAC CTATAGAGCA ATTGACACGG ACGCTTAAGA GCGTTAGTCG TATCAAGCAC CAAACAATGC TACTTTGCTG ATAGAATGAA GCCAGTCTGT TTTTGCAACC ATTTACCCGA CTACGATTGC TACTGTCAAA ATTCTTGCTG TAAATGCAAG ATTGCAAATT TTAAATTTTT TACTAACATT TC
|
Protein sequence | MNLRAFLWKA RLFVLSFFNS LFIPVAFLYL GINVQLLYSP GKAIELTREH FWGGQPNDSP VVCFAGAARK SLDPQHDPRL FWSTFGKISE EPSLLLLTQR SNAMSSAEQK LSSDSFSVSN LSQLTEEVDS LDGSYETSRF LRKMPSQSRS YPASFSNNQR HSRLPFSTIG LYGRDSHKET LQARFRDQAL SVEGGSTCLV TLAGPAGAGK SALASQMEPF CADYGGFFVS GKFDAHRVGE TDQDPYAAIR VACNQLAEKI LRCDQDLLSE RCDTDSQPLS RTSSTVVQPS THSSKSISQQ EEQIAARNFS FTDIQSKLQQ ELGDEDLTLL LRVVPVLAEV LGLNRSTIVA LDRPWDQGTV EATDVQAQKH RLYSSFRHFF RTVGSFGTVL LVLDDVQWAD AASLELIEFL VATEGPTREM RQSTRLMLIV SYRDNEVDED HRFAQMMDRL KKRDLDDVNK NGKAVSHIEE ISVDNLSVAE VTEFLADLMW IEEHRVRPLA ELVHGKTLGN AFYLIQFLTS LVDEDILQAA PGSRSWSWDM ERAAMHIDAT ANVVDLMRVK LNKMPRDTRS ILKLLSCLGS TFALSIEPNS FDLGESAPRE APSTGRKALE CLEHCVMEGW IVRLRRNLYL WVHDKIQEAV LSLIEPAYLP TLQRRIGELL LDKLSDSELE GNAFVVANLL NAGIQGMALS AARRIQTAKT NLAAAKRAIA SASLSSAVRY LERGIMVVPE DHWSTHYKFS LDLFSTAAEA EYCIGNFNRV QRYSGQILAQ TQRPLLDHRR AYNALMDAIG SQDQHLEAAN LCLEVLEKLG CPFPKRGVGL KIAFGFLKAK VSVKTISSEL VSKMPVMTKD LELWVMSLLD KLFTFLYLAG SEMLPLSVLK SLQWTRKKGV SEFSPSAFAR LGMSFTAFLG DPQTGKEFAE HALSLLNKVK SGKAESRTLM LVHSFVMPWS CPLKKTLEPL FRAYETGMAT GFSENPMWCI YFMKEHSIHL GIPISKVLAD FPSLISCMDK FQQLRQRDCC KIVGQALLNL SGQGKTPYVL TGELMNQDLM WNAAMDANDV VAIASLQRWR LYVAFYFEEY QIMMQLLEDT NLGSTIEKAQ PGLYGLCPMI FHNGLACISM MRETGNNKYT VWARRFANTI KKWVDKGNPN VKHYDALLNA ELAALFGKHS LAMRFFGSAI LFSGSGGFKN DQALVYERFG EYNLKQGNVN EAKYGLEQAI QLYEAWGAHG KVNLIKQKHI LLLAPPVEIN VSIDE
|
| |