Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45915 |
Symbol | |
ID | 7201004 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 656708 |
End bp | 660608 |
Gene Length | 3901 bp |
Protein Length | 1242 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180088 |
Protein GI | 219118639 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.144901 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGTCT TGACGGCCTT CCCGTCTCTC GATGCCACAC TCCGTTTCTT CTGCCTCCCT TATTTGCCGT TGTATTCGTT CGTTTGTTTG TCCACGATCA ATAGCACCAA ACACATGTCC GAGTTCCTAA CCACAGACGT CTTGGACGCG CTCCTTTCGG CGTCCCACGT CGAGCGACAG GCGGCGGAAG CCGTACTCCG GAACTGGACG GTGGATCAGC GCATTCGTGC CTTGCTACAG GCCTTGACGC AGCAGCCTTC AACGAACTTG CCACAATCAC AGGCAATTGC CGTACGGGAA CCGGTGTCGC GTACGGAACA CCACCAACAA TTGCTCGCCG TCTTGTTGCG CCGGGAGATT CAGCAATCGT CGCACGTGGC GCTCCTGCAC GACGCGATGG AACCTCTCCT CCAGCGGATT GTAACACACT CCGATTCTCC CGGTGTTAGT ACCAATGTGG TGGGAGACTG TGTGGCCGAA ATTGTTTCCG TTACGGCGGC ACTGGCCTCC CACGACGATG GCGTCCGTCT CGTACGCCGT ATCCTGTCCT CCGTGGCCGA GCCGGTACGT ACCCGATCTA GTAGTCCGCG GGACGCTATG ATGCAGTGTG ACACGCACAC TCAGTATACT CCCTTGCGAA ACGACACACG CATATGCACA TACTACATAT ATTAATCTAC TTACTCACTC ACAATCAATG TCGGTCTCTC TCGTAGGCAT CCCACGCCGA CCTTTCCTCA TTGAAACTCT TGGTTGCCGT GGCGGAACGG GCACCGACCG TCTTTGCACC CGCCGTGGCC GATGACGTTT CGACTCTGGT TTCAACCGCA CTCCGTCACG CGATCCATAC CGGCCCTCCA ACGGCCTCCC AGGTTCAACA CCCTGGCAAC GTCATCCGCA TCCTCACCGA CCTCGTCGTC GCCACCGCCC GGGCTCACGA ATCTGTACAC AGTTACGGTG ACAGTAGTGG TACCACCCTC GATCCCAACA CTCGGTCGAG TCAACTCGGC CGTACCTGTC TCGTACCGCT CCTCGAAACT ATACGACACG TACACGATCG GGACGTCGTC TCCAAACTGT TGCAAGCCTT GACCTCGGCG GCGGAACACG TTCCCGCGCT CCTGGCCGGA TCGCCGCACA CCCTACCAAT ACTCGTCACC GTACTCAGCG ATGTCGCACA CGATGCCACC GACACGGATG TCGATCTACA ACTCCAAGCC GTACAAGTAC TGGCCTCGCT CGTTGAACAC CACAACGTCA AACATCATCG CCTGACCCCC CAACTCGTTC AAGCCATACT CCAAGGCACC AACGGTAAGC ACGGAGTTGT CCAACTCTGT CTACACGCCA TGGTACACGG CACCGACGAC GATTGGGACG ACGAACCCCT CGTCTGGCAC CAATATAGCA ATGACAACGC CTCCGATGAA ACGGCCGCCT TTGCCCAGGA ACTCCTTCAC ACCGTACTAC AGGCACTCGG TAAACTCGCT TTGGACGTCG TCCTTCCCAG TGTGGAACGC TTATGTTCGT CTCCCGAACC CACCGCCGTC CGCGCGGGAC TGGCCGCACT CCAAGTCGCC GTCCAAGCCG CTCCCGTCAG CATTCAACCC CATTTGCCCG TCGTCGGCCG CGCCGCCTTG ACCTGGGCTG CACCGACCCA CCATTCTTTC CGAGTCCAGT TCCAAGCCAC ACAACTGGCC GGCGTCTTGT GTGAACTCCC CGGTGACGCT ACCGTACGTA CACTGTACGG ACCGCAACTC CTGCAAGCGC TAGCCGTCGC CACCGGCAGT CCCTGTCCCC ACGTAGCCGC CGTCGCCTCC ACCGCCATTG TTTCCTACTG CCGCGGCGAC GGCATCACCG AGGTAGACGC TGCACAATTC GTCGTACCGT ATCTAACTGA CGTACTGCAC GCCCTCGTTC ACGGTCCCCT GTCGCTCTCC CGAACCGACC GCAGCCAGGT AGTGGTCGTG ATTCGTGCCG TCGGTGCCTG GCCTGCCTGG CGCAAGCTTC CGGTCCCGCC TTTGCCCCCT ATTACTCCCA CGTCGTTCCT GGTCTGTGGG CCATCTCGCA GGATGCCGCA ACCGGTAATC CCGAACTCGC CCACTTGCGT GGTGCCGCGC TCGAGGCCGC TACCATCATC GGGCAGGCCT TGGGTGACAC ACACCGGGAA CTCTTCGTCG CCGATGCCGT AAACATGATG GACTGGGCGG TGCCTTACTT GAACTCCGGG GCGACCCACG TACCACTCGA ACAGCTCTTG TCAGCCTGTG CCCGCGTCGC TTCGGTCCTC GGTGAAGACT ACGCTCCCTA CGCGGGAGTC GTACTACCGC ATTTGATGCA ACGCGCCACC GCAGCCGCGG ATATGGAAGT CACTGAAGGC GACCAGGCCG GATGGGACGC CACCCAACGA CAACAAGTCG TCCGGGATGA TGAACAAGGC ACCGAGAGTA TGACCATTGC CATACCCGGT CGTGGCCTGG CGAAAGTCAC CGTCAACACA ACCAGAATTC AGGAAAAAGC CCAGGCCGTT CGCGCCATCT ACGAGCACGC GGTTGCTCTG GGTGCCGCCT TTCCGCAATC CGAAGCGTGT CTGGACGCAT TTCTAGAGTT GGTGCGCTTC CCGTACTCGG CTGAGATTCG GGCCGTGTCG GCGCAAACCC TAGCAGCCAT TTACGAAGCT TCCTGCGCCC ATGGCGAAGA CGGGGGTATG CGTGTTCCAG CAACGTACCT TCCACTCATA GCCCAAGGAA TTGCCACACA AATCTACGAG CAGGATGAGG CCGATATGGA CGCGCTCTAC GCCATGGCAG ATTCCCTCAG CGAGATCTAC TATAGTATCT ATCGCAGGCT CGCCAAGTTT GGACCAGTGT TGCTAGAGAA GTTTACCGTG GGTACGGCGT CAGCAACGGT ACAGTTGTTC ATGCAAGCTA TGGTGGCTTG TTTAGAACGA CGACGCGAAA CGGCTGATAT TCTTTCCGGG AGCCCACAAT CTCCGCTCGG CGAAGATGAA CACGCCGAAT ACGCAGAGTT GTTGCGGCTA GAGGAAACAC TCTTGACGCC CCTCGTCGAT GCAGTTGGAT ACACGCTAAA ATTCCTGCGC CACGAGTTCC TCCCGATTTT CGAGGCGCAC GTACTCCCCG TGCTCGGTCC GTACCTGTCG ACCGGCAACG ACATTCGCGC ACGGCTCGCG ACCGTTTGTC TCTTTGACGA CTGTGTAGAA TATTGCGGTG CCGCAGCCGC CGCCAAGTTC GCTCCCATGC TTATGGAGGG AGCCTTGTTG GGTATGAACG ATGCTAGTAA TGGGCAGGAC GAAGAGCTCC TGAGGGCGGC CATTTATGGA ATTGCACAAA TTGCCCGCTA CGCTCCGAGT TCCGTACTAG CACCTCACGC CCACAGTCTT GTACAGCATT TAGCAACCAT CTCCAGTCAG CCCAAGGATG AAGCCGACAA TGTGGCTATT CACGAGAATG CTGTGTCGAC GCTAGCGTCA CTCGTTCTGA TCGGCAACGC TCCTTTCCGA GGATCCGCGT TTGTCAAGCC GGAGACGGCC CTTCACATTT TTCTCGCGAA TTTGCCGTTG CGCGAGGATG CAGACGAAGC CAAAATTTGT CACAGCGGAT TGTGTGATCT AGTCGAGCGT AATACGATCG ACGTGACAGA GACCTGTCAG GAACTAATTC GCATCATCGG TGAGATACTG GTTTATGTGG ACGACGAGGA GGATCTTGCA AGTCCCGAAA CGCTCCTGCG CAGTGTTGGT ATCTTATTTC GCATGCAAAA GGAAGTCCAT GGCGACGCAA TGCAACGGGC ATTCGCGTCA ATTCCGGATG AGGCACAGGC GGCGATTAAT AACGCCATGC AGCAACATTC CCGTCAATTC AACTGTGTCG TGACACCGTA A
|
Protein sequence | MAVLTAFPSL DATLRFFCLP YLPLYSFVCL STINSTKHMS EFLTTDVLDA LLSASHVERQ AAEAVLRNWT VDQRIRALLQ ALTQQPSTNL PQSQAIAVRE PVSRTEHHQQ LLAVLLRREI QQSSHVALLH DAMEPLLQRI VTHSDSPGVS TNVVGDCVAE IVSVTAALAS HDDGVRLVRR ILSSVAEPAS HADLSSLKLL VAVAERAPTV FAPAVADDVS TLVSTALRHA IHTGPPTASQ VQHPGNVIRI LTDLVVATAR AHESVHSYGD SSGTTLDPNT RSSQLGRTCL VPLLETIRHV HDRDVVSKLL QALTSAAEHV PALLAGSPHT LPILVTVLSD VAHDATDTDV DLQLQAVQVL ASLVEHHNVK HHRLTPQLVQ AILQGTNGKH GVVQLCLHAM VHGTDDDWDD EPLVWHQYSN DNASDETAAF AQELLHTVLQ ALGKLALDVV LPSVERLCSS PEPTAVRAGL AALQVAVQAA PVSIQPHLPV VGRAALTWAA PTHHSFRVQF QATQLAGVLC ELPGDATVRT LYGPQLLQAL AVATGSPCPH VAAVASTAIV SYCRGDGITE VDAAQFVVPY LTDVLHALVH GPLSLSRTDR SQVVVVIRAV GAWPAWRKLP VPPLPPITPT SFLDAATGNP ELAHLRGAAL EAATIIGQAL GDTHRELFVA DAVNMMDWAV PYLNSGATHV PLEQLLSACA RVASVLGEDY APYAGVVLPH LMQRATAAAD MEVTEGDQAG WDATQRQQVV RDDEQGTESM TIAIPGRGLA KVTVNTTRIQ EKAQAVRAIY EHAVALGAAF PQSEACLDAF LELVRFPYSA EIRAVSAQTL AAIYEASCAH GEDGGMRVPA TYLPLIAQGI ATQIYEQDEA DMDALYAMAD SLSEIYYSIY RRLAKFGPVL LEKFTVGTAS ATVQLFMQAM VACLERRRET ADILSGSPQS PLGEDEHAEY AELLRLEETL LTPLVDAVGY TLKFLRHEFL PIFEAHVLPV LGPYLSTGND IRARLATVCL FDDCVEYCGA AAAAKFAPML MEGALLGMND ASNGQDEELL RAAIYGIAQI ARYAPSSVLA PHAHSLVQHL ATISSQPKDE ADNVAIHENA VSTLASLVLI GNAPFRGSAF VKPETALHIF LANLPLREDA DEAKICHSGL CDLVERNTID VTETCQELIR IIGEILVYVD DEEDLASPET LLRSVGILFR MQKEVHGDAM QRAFASIPDE AQAAINNAMQ QHSRQFNCVV TP
|
| |