Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47010 |
Symbol | |
ID | 7202244 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 169475 |
End bp | 172978 |
Gene Length | 3504 bp |
Protein Length | 1133 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181151 |
Protein GI | 219121600 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.671968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAAGA TTGCGGAAAT TTCCGATCGC GCGGCCTGTG CAGCGTGGTG CCCACTCCAA GCACATCCCA ACGTTGTAGC TCTCGGAACC AAGGTACGTA CAAAGTACTG TTCTATTGAG ATTCACGGTA AAGAGTAGAT GGCTTCTTGA TCGCATGCTT ACGTAACTTG CTCTGGTTCC TCTGTTCTCG CTCAGGACTC TGGTGGAGGT GGTTTTGACG ACACCGGCGG AGAATTGGAA GTTTACGACT TGTGGAGTAG CAACATCACT TCTGGTGCTG ACGACAACGA CAAGGTTGGC TCTACGCAAC CCAAACTACT GGGATCCATC AAGACCGGAG CCCGATTTGC TAGCGTCGCA TGGACACCGT ACACACGCAA TGGAAAGTAC CCCATGGGAT TGCTCGCGGG CGGGATGGTA GATGGAACGA TTTACAATTG GGATCCTAGT CTAGTGATTG CCGCGTTTTC GCAAAACCAA ACAGCGCAAG CTGAGTTAGC GGCGTGTTTG GTGCACACCG TCCAGGCTCC GCCGGCATCG TCCAACAGCT CGTTCGGAGC CATGCAATTC AATCCACTCG AGCCGCATCA GCTCGCAGCG GGAGCCGCTA ATGGACACGT TTCCATTATC GATATTTCCG CCGATAAAGC GTCGGTGCAG GAACCCACCA CCGTAGCCTC ACAATTTCAA ACCTCCGAGA TTAAGGCCGT CGCCTGGAAT CCTCAAGTAT CCCATATTGT TGCGTCTGCC GCGGCCGATG GTTCCGTTGT CGTTTGGGAC TATAAATCCC GCAAAGCTTG GTGCGAGTTA CGGGCGGCAC ACGCTGCCGC GGCAGTCAAT GACGTGTGCT GGAATCCGTC ACAGGGACTG CATTTGCTCA CCTGCAGCGG AGATGACCGG GATGCCGTGC TCAAAGTATG GGATTTAGGA GCCAGTACGT CCATGCCACT CACGACTTTG ACGGGACACG CCGCTGGAAT TTTGAGCGCG GGCTGGTGTC CTCACGACGA AACATTGTTG CTCACGACGG CCAAGGATAA TCGTACGTTG TTGTGGGATT TGCAAACACT CCGACCCGTA GCGGAACTCC CGAATGCAGC GGCGGAAGCA GAATTGCAAC AGAAGCAGCA ACAACAAACA CAGCAAGCTC ATCCCACAAA CGTCTTTGCA GCCGCTGGTG CTGGTTCAGC CGGGGGTATG TCCGATCAAC GACACATGCG TTACGCCGTC GCTTGGTCTC CGTGGAAACG TGGTGTTGTC CTGACGTGTA GCTTGGATCG TAAAGTCGAG GCGCATTCCG TTTTGACCCT GGCCACCAAG TCGGGACGAC CGCCGGCCTG GATGCGACCA GCCTCGAGTG TATCATGCTC CTTTGGAGGT CTCGTCGTGT CCTGCGGAAG TCAGGATAAG GTTGTTTCGT TGCGAACCGT GTGTGAGCAA CCCGAATTCG CGGAAACGTC ACAACGACTA GAAACGGAAC TCAAGTCCCA AACCATTGTG GACTTTTGTC GATTCCGTCA TGTGACGGCA GCGAGAAACA AGACGGAAGC CGATATGTGG GGATTTATGC AAGTCCTCTT TGACGCCAAC GCGCGTCAAG CTCTTTTGCA GCATTTAGGA TACGATGCGG ATAGTATTGC GTCCACGGTG GCGGCCCGGT ACCCCCATGT AGCAACTACC ACCGAGCACG GAGAGTCATC CGTTAATGGT AGTAGCAACA AGAGCACCAC TGGGCTCCCC AAGCCGCCTT CCATGAAAAC CGCGTCAAGC ATGCTAGCAA AGTCTACCTT TGACCAGGCC GCCGAAGATT CCGTGAAGCA AGCCTTGCTT GTGGGGAACT TTGAGGCTGC CGTAGACGTT TGCTTGGCCA CCGGAAATTG GGCTGACGCA CTGGTCTTGG CCAGCTGCGG GGGTCCCGAT CTCTGGCAGG TTGTACAAAC GCGCTTCTTT GCTTCGGAAA CTGCACAGCG TCCATATTTG AACATTGTCA GCGCCGTAAT TCGTTCTCAA CTCTCCGATC TCGCCACTAG TCCGGACATA GCCACAAAGT GGTCGGAAAC GCTGGCCATT TTTTCGACCT ACGGTGCGTC GGAGGAATTC CCACAGCTCT GCGTGGCGCT TGGTGAATCT TTGGAAGAAG CGGGCGACCC GGCCAGTGCT ACACTCTGCT ACATGTGTGC TCTGAATTTG GATCACACAG TGAGGTTCTG GAAGTCACAA TTGGCGGAAG CTAGTCGCCA GAAAGGTGAG GGAACGAGTG ATGTCTTGGC ACTGCACGAA TTTGTGACGA AAGTGTCCGT CTTTTTGGAA GCGGTAGGGC CGTCCGCGAT TCTGTCGGAG GATATTGCAA CTTTGTTTGC CGATTATGCC GAGGTGTTGG CACAACAAGG TCTTTTGGTT ACGGCGGCCA AATATGCCAG AAAGGGGACT TCGATCGAAA GCCAGAAGCT GCGCGATCGT CTCTACAGGA GCCGCGCAAG TGCTTCTTGT TACGCGTCCT TAGGTACTGC ACCAGAATTT CCCTACCGCA TGTCAGCTGT GGAACCCAGT CGAGGACCGA CGGTTGTCAA GAATGCGTCT TATGCGCAGC ACCAACCGGA TGCAATTCAT CTGAATAAGC CGCAACCGAA AACCACCTAC GAACAGCGTC AGCAGTCCTC GATATACGGA CGGCAACATG AACGACGACA GACGTCATCT GGAGCACCAG TCCCAGCTGC AGTCATCGAG CTTCCTAGTG GATGGATGGA GCTTCATGAC CCTAACAGTA GACTCCCATA CTACGCAAAC CAGGCCACAG GGGAAACTAC GTGGGATCGT CCACAAGCAA TCCCGTCAAC AAATTCTCAG ACAGTGCCGC AGCAGACGTA CGCTAGTGTA CCAGCTAGTC AAGAGCCGGT GATGGATTCG TCTCGACACT CAACGCGTTC TCAGACCTCG GTCACTTCTG GCGTTTCAAC GCTGAGACCC AAGCCTAGTG TTGTCTCCAA ATACGGCGAC GGTTTTGTGA CGTCCGCGTC GCATCCAGAA CTTGCCGATC AATACGGCAA CATTGGCACC AGCAATCCCT ATAGCGGTGT AAATCGACCC GGTACAGCCG CAGCTGTGGC CGCGACAGCC CAGTCCGCTG TAGCTCCAAT ATCAGACACC CTCAACATTG AAACCCTGGA GGTCTCACCC GAGTTTGCTC ACATCAAGGA CACGCTGCTG GCGTGTGTCA ATGCCTTGAA AGACTATCCG TTATCTCCCG CTGACAAGCG ACAGTTGGCG GAAGCCGAAA AGGGAGTTGC TATCCTCGTC AAAAAAATTG TCCGCTATGA TATCGACGAA GAGACAGTCT CAAAGGTCTC ATTTATGATC GGCGCTCTGG CCAACGGTGA CTACCCCGCA GCGACGGCGA CCAATACCGC ACTAGTTAAT AGCGATTGGC GCGACCACAA GGACTGGCTT AAAGGGATGA AATCGTTGCT AGCGTTAGCG TCGAAGAAGT TTGCTCGTCA GTAG
|
Protein sequence | MTKIAEISDR AACAAWCPLQ AHPNVVALGT KDSGGGGFDD TGGELEVYDL WSSNITSGAD DNDKVGSTQP KLLGSIKTGA RFASVAWTPY TRNGKYPMGL LAGGMVDGTI YNWDPSLVIA AFSQNQTAQA ELAACLVHTV QAPPASSNSS FGAMQFNPLE PHQLAAGAAN GHVSIIDISA DKASVQEPTT VASQFQTSEI KAVAWNPQVS HIVASAAADG SVVVWDYKSR KAWCELRAAH AAAAVNDVCW NPSQGLHLLT CSGDDRDAVL KVWDLGASTS MPLTTLTGHA AGILSAGWCP HDETLLLTTA KDNRTLLWDL QTLRPVAELP NAAAEAELQQ KQQQQTQQAH PTNVFAAAGA GSAGGMSDQR HMRYAVAWSP WKRGVVLTCS LDRKVEAHSV LTLATKSGRP PAWMRPASSV SCSFGGLVVS CGSQDKVVSL RTVCEQPEFA ETSQRLETEL KSQTIVDFCR FRHVTAARNK TEADMWGFMQ VLFDANARQA LLQHLGYDAD SIASTVAARY PHVATTTEHG ESSVNGSSNK STTGLPKPPS MKTASSMLAK STFDQAAEDS VKQALLVGNF EAAVDVCLAT GNWADALVLA SCGGPDLWQV VQTRFFASET AQRPYLNIVS AVIRSQLSDL ATSPDIATKW SETLAIFSTY GASEEFPQLC VALGESLEEA GDPASATLCY MCALNLDHTV RFWKSQLAEA SRQKGEGTSD VLALHEFVTK VSVFLEAVGP SAILSEDIAT LFADYAEVLA QQGLLVTAAK YARKGTSIES QKLRDRLYRS RASASCYASL GTAPEFPYRM SAVEPSRGPT VVKNASYAQH QPDAIHLNKP QPKTTYEQRQ QSSIYGRQHE RRQTSSGAPV PAAVIELPSG WMELHDPNSR LPYYANQATG ETTWDRPQAI PSTNSQTVPQ QTYASVPASQ EPVMDSSRHS TRSQTSVTSG VSTLRPKPSV VSKYGDGFVT SASHPELADQ YGNIGTSNPY SGVNRPGTAA AVAATAQSAV APISDTLNIE TLEVSPEFAH IKDTLLACVN ALKDYPLSPA DKRQLAEAEK GVAILVKKIV RYDIDEETVS KVSFMIGALA NGDYPAATAT NTALVNSDWR DHKDWLKGMK SLLALASKKF ARQ
|
| |