Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35923 |
Symbol | |
ID | 7201229 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 91941 |
End bp | 97607 |
Gene Length | 5667 bp |
Protein Length | 1289 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180569 |
Protein GI | 219119627 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.584547 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCCTC TCGAGCATGT CCTTGTGAAC CTTTTGGGAG CGACAACGCT GGATTCGTCG TACCGTCGGT TCTTTGAAGA GTATGGCATT ACTCAGGCCA GTGAATTGGC CTCAATCACT GAACATCGTC TTGCAACGGT GTCTTACGGC GTCTTGACCC CTGCTGTGGG AGACGGCCCT GCTGCAATTG TTCGTACATT CCTTCCGCCT GCGCAACAGG ACCGGATTTT GAAGATTGTA CAATGGTTCC TTTCGAAAGG CACCAATGTG ACAAACGACA CCTGGCTTGA ACTTACCTCT GATGTTCTCG AGTATTGGCA ACCCGCCTCT GCTACTGTTG CCCCAGCTAC TCCTGTTGGA TCGGATGCTC GAAGTTCCTT TGTTGAAAGT GCTGCCGCGA AATTTCGGAA GACGATCAAA AATCATTCCG TCCCGTATCC AAAGTTCAGT GAAGACCGTT TTTGGGTCAC TTGGAATACG AATATTCGTA TCAAGCTCCG TATTCATGGT GTTCAGTTGG TACTTGACCC GGATTATTTG CCTGAGACCG TCGACGAGAC GGATACGTTT GTTGAAATGC AGAACTTTGT CTTTGGCGTT TTCAACGATA TTTTGTTGAC CCCTCGTGCG CGTGGAATCC TCCACAAGCA TGTGGATGAG CTGGATGCTC AGTCGGTCTA CCGCGACCTT GTTGCCTCGT ACGGCAAAGG TATTAACGCG CAGATCACGG CCACATCCAT TGAAACGAAA CTTACTTTGT ATTCATTTGC GACTTCAAAG AGCAAGACCT GTGTTGCTTT TTTGACGACC TGGCGCAATT TGATTTACGA TCTTGAACGG ATCAACGAGT TCCCTTTGCC GGATCACCAG AAGAGCGTAC GACTGAAGTC AGCTGTCCGT TCCCATCCGC AATTGAAACT TTTCCTCGGA AATGTTCAAC TTTACTCTCG TACCCATGTG GGTAAGAGTG CTGATGATTC CGATTTCGAG TATGTTTATG ATTTGATGCT TGAACATGCA ACTGATATTG ATCAGACCGA TTTGGAAGAC CGCGGTAACA ACCGTGGTGG ACGCTCAGCA AACAATGCGA AGTTCCAGTC TTCTTCCAAG AAGAAAACTA ACAAACCGAT TGGTAAGAAG CACAAGAATT ATGTGCCTCC TGAGAAGTGG AATGCTCTCT CTCCCGAAGA GAAGCGGACC ATTATGGATC AACGAGGACC TCGCCCTGCT CCAGCCCCTG CCCCTGCCTT ATCGGTGAAC GCCGCTGCCA CTCAGCCCCC TCCTACGGTG TATGTCAGTG ACTCGACGGC TGTTGACAAC CAAAGCCTTG CTTCGACCCA CGTCCCACCT GCTGCTGGAC CTGGTCACCT GCTTCGTTCG CTCATTTCGA ATTCAGCTGC CCGCCAGCAC TCTGCCCCAT CGAATGGAGC CACGTCTGAC TCTTTTTCGG TCAATGGGAC CACCTATCGC CGCGAAGTGA ACCGTGCTTC TGTGCAGTAC CGTCTTTCCA CTCACGATGT TTCGTTGAAT AAGGACTCTT TGGTCGATGG TGGTGCCAAC GGTGGCCTTA GCGGCTCAGA CGTAACCGTT ATTTCGCAAT CCCTGTCAGA GGCAACTGTC TCTGGAATTG GAAATTCGGA ATTGACCAAC CTCCGTTTGT CAACAGTGGC CGGACTCATT CACACGACGG ATGGTCCCAT TATTGGTGTG TTTCACCAGT ATGCTCATCT TGGTACTGGT AATACCATCC ACTCGTGCAA CCAAATGCGC TCCTGGGGAG TCACGGTTGA CGACGTCCCT CGTACTTTTG GTGGCAAACA GCGTATTGTC ACGTCCGATG GTCGTTTTGT CATCCCGCTT TCGGTTTCTG GCGGACTCAC TTACTTGTCT ATGCAGGCCC CTACCGAGGA GGACCTGGAC ACTTTCGAAT GGGTGCCTTT TACCGCTGAC AACGAGTGGG ATCCAAATAG TCTCTCTTCT CCTGCCGCTG CCGACGATGA CCTCAGTTTG CAGCTTCCTG TCGGCCATGT TCCGTTCCGT GACGAACGCA TCAACAACTT TGGTCTCCTT GCGCATTCCG CGGCAGTCAG TCGATCCCCT TTGAATGTCG ATGCTTTGCA ACCCAATTTT GGATGGGTTC CCAGTGCTCG TATCGCTCGT ACGTTTGAAA ATACCACGCA ATTTGCTCGT GCCGATGCCC GTTTGCCCTT GCGCAAACAC TTCAAATCGC GTTTCCCTGC TGCCAATGTC TCTCGTCTGA ACGAAATTGT GGCAACCGAT ACTTTTTTCT CGGATACCCC TGCGGCCGAT GACGGCATTT TTAACCATGG TGGGGCTACG ATGGCCCAAC TTTTCGTTGG AAAAAGTTCG CAAATCACCT CTGTCTTCCC GATGAAGCGC GAGTCTCAGT TTGCCCATAC TTTCGAGGAT TTTATCCGTA CCCATGGTGC TCCCGATGCC CTCCTCAGCG ACAATGCCCG TGCTCAGATC GGTAAGCAGG CACTTCAGAT CTTGCGCATG TATGCGATCG ACGACATGCA GTGCGAGCCG CATCATCAGC ACCAAAATTA CGCGGAACGC CGCATTCAAG AGGTGAAAAA GATGGTGAAC ACAATCATGG ATCGTACAAA CACTCCTCCT GAATATTGGT TGCTCTGCTT ATTTTATGTG ACCTACTTGC TCAATCGCCT CTCTGTCGAA AGCTTGAATT GGCGTACCCC GCTTCAGGTT GCCCATGGAC AGCGTCCCGA TATTTCTGCT TTGCTCCTTT TTCGTTGGTT TGAGCCCGTT TATTATTACG ACCCTGACCA TGCGTCTTTC CCATCGCATT CTCGCGAGAA AACTGGTCGT TGGATTGGTG TCGCCGAACA TAAAGGTGAT GCGCTGACTT ATTGGATTTT GACAGACAAT ACTCACCAGG CCATTGCTCG TTCTGTTGTT CGTCCAGCCA ATGTCGATAA TGGTTTGAAA AACCATCGTG CTGCGGATTC CTCTCCCGAT GGTGGGGAGC CCTCGAATCC TAAGCCCATT GTCTTGGCTA CGAGTGACCT ACGCCATGAC GCTACGATTG ATCCATCTTT TGAGAAATCC CATGCATTCT CTCCTGACGA ATTGATCGGC AGATATTTGA TTCGTGAAGC CCCTGACGGC CAGAGCCATC GAGCCCTTGT TGCTCGTAAA ATTATTGATG CCGACTCCGA TAACCACCAG GCAATCCGCT TCTTGTTGCA AATTGATGAA AAGGATGCTG ACGAGATCAT TTCGTACAAT GAACTCTCCG ATTTGATGGA AGCCCAACAA TCAGAGCCCG CTACGAACGG AAATATCGAA GATCATTTCA AGTTTACTAG TATTATTGGA CACCAAGGCC CTTTGCAACC GACCGATGCG GGCTACAAGG GATCCTCTTG GAATGTTTTG GTTCAATGGG AAGATGGTTC CCAGTCGTAC GAACCTCTAA TTGAAATGGC AAAGGACGAT CCAGTCACAC TCGCGATGTA CGCGTCTGAC AACGATCTTC TTAACGTGCC CGGGTGGCGC CGCTTCAATC GTCTGCTTCG CAACCGTGAT GACTTCAATC GATCTGTTTC GTTAGTGAAA CAACGCAAGG GAGACCCAAT TTTCAAGTTT GGCGTCCTTG TCCCTCGTAA TTCTCGTGAA GCCCTGAAAT TTGACGCTAA CGCCGGTAAT ACTCGTTGGG CCGATGCCAT GGAGCTCGAA CTGGCTCAGC ACCGTGAGTA TAAAACGTAC AAGGACCTCG GTCAGGGCGC AGCCAAACCT GGCCCTGGAT ACCAGCGGAT CAATGTTCAT TTCGTGTTTG ATGTGAAGCA GTCACTCAAA TACAAGGCCC GGCTCGTCGC CGGTGGACAC ATGACGGCGC CTCCGAAGGA CAGTGTTTAT TCCGGTGTTG TATCCCTTCG GTCCATTCGT CTCGCTCTTC TTGCTGGTGA GGTAAATGGA CTGGAAACGT GGGTTGGTGA CATTGCGGTC GCCTACTTAG AGGCATACAC GAAGGAACAG GTGTATTTTG TTGCCGGTCC TGAGTTTGGG GAGCTTTCTG GGCATACCTT GCTCATTGAC AAGGCCTTGT ATGGCCTACG TACTAGTGGA GCCCGTTTCC ATGAGCGTCT CTCGGACTCC CTCCGTACGA TGAATTTTAT TCCTTGCAAA GCGGACCCCG ACCTCTGGAT GCGTGACTGC GATGACCATT GGGAGTATGT ATGCGTATAC GTCGACGATA TTGCATGCGT CTCACGAAAC CCCAAGGCCT TTTTTGATTC CCTTGTTTCA GACCATCATT ACACTCTGAA GGGTGTAGGC CCTCCAACTT ACTTTCTTGG TGGTGATTTT ACTCGCGACA GTAAGGACAA TACTCTCGCA GTTGGTGCGA AAACATACGT GAAACGTATT ATTTCGAACT ACAAGACCTC CTTTGGTACG GAGCCCAAGT TGTACTCGTC TCCTCTCGAG AACGGTGACC ATCCGGAAGT CGACGCTTCG CAACTCCTTG ATTTCGGTTC GATAAAATTG TATCAGTCCC TCATCGGTGC CCTCCAGTGG GCCATCACTC TCGGGCGCTT TGACATACAG TGTGCCGTCA TGACGATGGG ACGCTTTCGT GCTGCCCCTC GTGAAGGACA TTTGAATCGT CTCCGTCGTA TCATCGGATA CCTCCGTCGA TACCCTGATG CTGCCATTCG TTTTCGTACT GGAATTCCCA ACCATGAGGC TCGGGGAGAC GTCCCTCAAC ACGACTGGAT GTACTCAGTG TACGGTAAAT CGAGTGACGA GGACTCTCCG ACAGGCGTTC CTCCTGCTCG TGGTAAGCCC ATGCGCATTA CTACCTTTGT TGATGCTAAC TTGTACCATG ATTTGACTAC AGGCCGTGCC ACAACTGGTG TCTTGCATTT GGTTAATCAG ACTCCTGTGT CTTGGTTTTC CAAACGGCAG TCTACAGTTG AAACGGCTAC GTACGGTTCA GAGTTTGTTG CTGCTCGTTT GGCTACAGAA CAAATTATTG ACATGCGTCT TACGCTACGT ATGATGGGTA TTCCTTTGGA CGGTCCCGCT TGGCTCCTTG GCGACAATCA GAGCATTGTT ACCAGCTCTA CGCTTCCTCA TTCCGTTCTT TCTAAACGAC ACAATGCCCT TGCGTACCAC CGCGTCCGTG AGGCGATTGC TTTTGGGATC ATGCATTTTT TATGGATTGA GGGCAAAGAC AACGCTAGCG ATGTGTTGAC CAAACCTCTC GGTCATGCAG TTGCTTGGCC TTTGATTCAG CCTTTGTTGT TTTGGAAAGG TGAGACGAAA TCTCCCGCTT GTTCTGTCTC GAGTACCTCA CAGTGGGGAG TGTCAACTGG AATGACAGTT TTGAATACCG GTTCGCGCGA TGGCGCGCAC TACTTGGGTC GTGGCACCCA AGGCGAGCCT GTCGAAACTA ATCTTGTGTG TTTGGCCACA AATCCCGACA CGGATGGTTT TGGACAGGCG CAATCTCAGT TGAGTATTCA GAAACATGAT GTGACTGTTT CGGGACTCGA TCGTTACCAT GCTAAGAAGC ATGGTAACTA TGATGAATGT GCCCACGGTG CCGTGTATAA CACCGTGGAG AATAGTGAAA CGGACCACGA TGGCGCTTGG CAGATCGTGG GGCCGAATGG AAAGTGA
|
Protein sequence | MDPLEHVLVN LLGATTLDSS YRRFFEEYGI TQASELASIT EHRLATVSYG VLTPAVGDGP AAIVRTFLPP AQQDRILKIV QWFLSKGTNV TNDTWLELTS DVLEYWQPAS ATVAPATPVG SDARSSFVES AAAKFRKTIK NHSVPYPKFS EDRFWVTWNT NIRIKLRIHG VQLVLDPDYL PETVDETDTF VEMQNFVFGV FNDILLTPRA RGILHKHVDE LDAQSVYRDL VASYGKGINA QITATSIETK LTLYSFATSK SKTCVAFLTT WRNLIYDLER INEFPLPDHQ KSVRLKSAVR SHPQLKLFLG NVQLYSRTHV GKSADDSDFE YVYDLMLEHA TDIDQTDLED RGNNRGGRSA NNAKFQSSSK KKTNKPIGKK HKNYVPPEKW NALSPEEKRT IMDQRGPRPA PAPAPALSVN AAATQPPPTV YVSDSTAVDN QSLASTHVPP AAGPGHLLRS LISNSAARQH SAPSNGATSD SFSVNGTTYR REVNRASVQY RLSTHDVSLN KDSLVDGGAN GGLSGSDVTV ISQSLSEATV SGIGNSELTN LRLSTVAGLI HTTDGPIIGV FHQYAHLGTG NTIHSCNQMR SWGVTVDDVP RTFGGKQRIV TSDGRFVIPL SVSGGLTYLS MQAPTEEDLD TFEWVPFTAD NEWDPNSLSS PAAADDDLSL QLPVGHVPFR DERINNFGLL AHSAAVSRSP LNVDALQPNF GWVPSARIAR TFENTTQFAR ADARLPLRKH FKSRFPAANV SRLNEIVATD TFFSDTPAAD DGIFNHGGAT MAQLFVGKSS QITSVFPMKR ESQFAHTFED FIRTHGAPDA LLSDNARAQI GKQALQILRM YAIDDMQCEP HHQHQNYAER RIQEVKKMVN TIMDRTNTPP EYWLLCLFYV TYLLNRLSVE SLNWRTPLQV AHGQRPDISA LLLFRWFEPV YYYDPDHASF PSHSREKTGR WIGVAEHKGD ALTYWILTDN THQAIARSVV RPANVDNGLK NHRAADSSPD GGEPSNPKPI VLATSDLRHD ATIDPSFEKS HAFSPDELIG RYLIREAPDG QSHRALVARK IIDADSDNHQ AIRFLLQIDE KDADEIISYN ELSDLMEAQQ SEPATNGNIE DHFKFTSIIG HQGPLQPTDA GYKGSSWNVL VQWEDGSQSY EPLIEMAKDD PPLLFWKGET KSPACSVSST SQWGVSTGMT VLNTGSRDGA HYLGRGTQGE PVETNLVCLA TNPDTDGFGQ AQSQLSIQKH DVTVSGLDRY HAKKHGNYDE CAHGAVYNTV ENSETDHDGA WQIVGPNGK
|
| |