Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_27518 |
Symbol | COPbeta2 |
ID | 7201516 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 8873 |
End bp | 12404 |
Gene Length | 3532 bp |
Protein Length | 962 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180359 |
Protein GI | 219119187 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTGGCACT GTAGACCTTC GATCTCACCA TGGTGCGTCT TTAATTGTCA TTGAGTATTA ATGACAAACC TGAAAAGAAT TTTTAGAACT CGCATGCCCC GACGAGTTGC GTTCAGGGTT GAGGCAAGCG TTTCGTCATT CCTCTGCCAC TCCTCGCGAG ATCTTCGTGC AGCTAGCGCT TGTTTGGGAT GCGGGACGAT GTTCAAAGAG CTAATGCGAA GAATTCTTCC AATGTGCGAG TTGAAACTAG AAGATCCTAA ACGCCTGTAA ATGGCAAATC GGGTGTGGGA TCATCCATGG CTTTGTCAAA TGTCGCATCG ATGTTGTAGA CGCGTGTGGC GACACGTTAC ACTTTTTACG GATAAGCGCT CTCCTAATCG GGACTCTCAA ACTCTTACCT TTTCTCACCT TCTTGAACTG TCATCTTACA GCCGCTTCGT TTGGATATCA AAAAGAAGCT TTCCGCCTCT TCGGAACGTG TTAAATCGGT TGACTTGCAC AACTCGGAGC CATGGGTGCT CGCTGCGCTC TACAGCGGCA ACGTCATGAT TTGGGACTAC GAATCCGGCA GTCTCGCCAA GTCTTTTGAA GTCTCCGAAC TCCCCGTGCG CTGCGCGAAA TTCATTGAAC GTAAACAATG GTTTCTGGCG GCATCGGATG ATATGCGCCT GCGCGTTTTC AACTACAACA CGATGGAAAA AATCAAGGAG TTCGAAGCGC ACGCCGATTA CATTCGTTCG CTCGAAGTAC ACCCCTCTCT ACCTTACGTT TTTTCGTCGT CGGACGACAT GACGATTAAG CTTTGGGATT GGGATCGTGG CTTCGATTGT ACGCAATTGT TCGAAGGACA CGCGCATTAC GTAATGCAAG TCAAGATCAA CCCCAAGGAC ACAAACACCT TCGCTTCCGC AAGTCTCGAT CGGTCCATCA AGGTGTGGGG ATTGGGATCT CACGTCCCGC ACTACACCTT GGAAGGTCAT GAGCGTGGTG TCAATTGCGT GGACTATTAC CCATCCGGTG ATAAGCCTTA CATTCTATCT GGAGCTGATG ATCGTACTGT GAAAATCTGG GACTATCAGG TACGTGCACA AAAGTTTAGA TTCGTCCCAC TTGTCAGGCA AAATCTCAAC TTCGCATATT ATTCGTAATA GACAAAATCC ATTGTACATT CCCTGGAAGG CCATACACAC AATGTTTGCG CTGTCATGTT CCACCCCAAG CTGCCCATCA TCGCTTCCGC TTCCGAAGAC GGTACAGTCC GTATTTGGCA GAGCACCACG TACCGTGCGG AAACCACCCT CAACTACGGT ATGGAACGTG CGTGGGCACT CGCCGCGTCG CCAGAATCCA ACAAACTGGC AATAGGTTTC GATGAAGGAT GTGTGTGTAT CGAATTGGGC TCAGACGATC CGGTTGCTTC GATGGATACA ACCGGAAAGG TCGTTTGGGC GACCAACAAC GAAATCAAAA CTGCTTCGAT CCGCGGTGTT GCAGGTAGTG GCGAAGATGC CTTGCCCGAT GGCGAACGGC TCCCGGTAGT CCCTCGTGAT CTGGGCGCTT GTGAACTATT CCCGCAAATG CTTCGTCACA ACTGCAACGG ACGCTTTGTT GCTGTGTGTG GCGATGGCGA ATTTATCATT TATACCGCCC AAGCACTTCG CAACAAGGCT TTTGGGCAAG CTCTAGACTT TGTATGGTCT GGATCGGGTA CTGGAGACTA TGCGATTCGT GAAACGATCA ATAGTGTGAA AGTTTTCAAA AACTTTAAGG AATCACAGAG TATCGTACCT GCTACTGCCT CAGCTGAAGG CTTGTTTGGA GGACAAATGG TCGGAGTAAA AGGCGGCGAC GGTGCTGTGT TGTTCTATGA CTGGGATAGC GGCATCTTCG TTCGTAAAAT TGATGTAAAC CCGAAAGAAG TGTACTGGTC AGACAGCGGC AACATGGCAC TTTTGGCTTG CGAAGGAACA GCGTACGTTC TCTCGCATAA CGCCGAAGTG ATGGCTCAAG CGATTGTATC TGGGCAGGTC TCTCCTGAAG AAGGCATCGA TGGTACTTTC GATCTTTTGT TCGAAATAGA TGATACGATC ACGTCCGGAA AGTGGGTTGG GGATTGTTTC ATCTACGTCA ACAACGTCGG GCGTCTCAAC TACAGCGTTG GTGGGCAGAT TGAAACATTG GTTCATTTAG ATACTTCGGC GGGCGGGTCA GTACAGCACA CAATTCTTGG ATATCTGGCC AAGGAAGACC GAATATTCCT GATCGACAAG TCCTTGAACG TTGTTTCGTA CAAGGTTACT TTGGCGGTAT TGCAGTATCA AACAGCCGTC ATGCGCGGTG ACTTTGATTC GGCTAATGAG CTGTTGCCTT CAATCCCCGA AGAAGAATAT ACCAAAGTCG CTCGTTTCTT GGAATCTCAA GGATTCAAGG AAGAGGCGTT GGCTGTGACG CAGGATCCGG ATCACAAATT TGACTTGTCG CTCGAGCTAG GCCAAGTCGA TTTGGCGCAC CAGATCCTAT TGGAAACGCC CGAAGAGGAC AAGGAATCGA CCGACACACA GGCGAAGTGG AAACGGCTCA GCGATGCTGC CCTTAAGGAC ACCAATTTGG AACTGTGCGA ATCTGCCAGC ATTTCAAGCA ACGATTACTC TGGACTGCTT CTTTTGTACT CGGCAACTGG AAATCTTTCG GCGATGGAAA AGCTGGCGAA GCTCGCATCG GACGGAGGAA AGACAAACGT AGCTTTTGTC GCGTACATGC TGACCGGCAA TGTAGAGGCT TGCGCCGATT TATTGATCGC TACCAAAAGG CTGCCAGAAG CTGCATTCTT TGTGCGAACA TACTTGCCGT CCCGAATCGA AGAAGTTGTG GCTCTTTGGA GAAGAGATCT TTCCTCGATT AGCGAGTCAG CGGCAACTGC TCTTGCTACT CCATCAGAGA ATGCCACACT GTTCCCGGAT ATGGATGTTG CTTTGCAAGT CGAGCAAATG TTTCTAGGAC AACGAGAGGC GACGAAGGCT ACGGGTATCC CCGCATCGGA GTACCTGAGT GCCAAGGACG ACCTGGATTT GAATTTGATT GATCTTATCA AAACCCGTTC GCAGCCGGCA GTAGACCATT CTATGGCAGA GACTCATCAA TTGGTGGATG AAGAAAAGGA GGCAGACCCC ACGGACGACC ATGATGATGA AGAGGATGCC GATTTAGCTG CTGTACGTGA AGCCGAAGAA CGAGGAGCGA GTGAGGCTGC AGCGGTTGCT GAAGTACAGA GGCCGGCGGA AGGAGCGGCA GAATTTGAAG ACGATGTACC GTTAGAAATG ACTGAGGATG TTCCTGGAGC GACGAAAGAA GTAGACCGAG ACGATAGTGG ATTCGACGAG GAGTGGTAGA TAAGGTGGGC TTTACTATTT CTACCAATAA ACGATATGAG AAATGAGACA ATTTGATCAT ACCATGAACT GTGCACTGTA AATCAGCAAG TTCTCCCTTA CCCACAGGGT ACCGGTAGAT ATATGGCTAA GAAATCATCT GC
|
Protein sequence | MPLRLDIKKK LSASSERVKS VDLHNSEPWV LAALYSGNVM IWDYESGSLA KSFEVSELPV RCAKFIERKQ WFLAASDDMR LRVFNYNTME KIKEFEAHAD YIRSLEVHPS LPYVFSSSDD MTIKLWDWDR GFDCTQLFEG HAHYVMQVKI NPKDTNTFAS ASLDRSIKVW GLGSHVPHYT LEGHERGVNC VDYYPSGDKP YILSGADDRT VKIWDYQTKS IVHSLEGHTH NVCAVMFHPK LPIIASASED GTVRIWQSTT YRAETTLNYG MERAWALAAS PESNKLAIGF DEGCVCIELG SDDPVASMDT TGKVVWATNN EIKTASIRGV AGSGEDALPD GERLPVVPRD LGACELFPQM LRHNCNGRFV AVCGDGEFII YTAQALRNKA FGQALDFVWS GSGTGDYAIR ETINSVKVFK NFKESQSIVP ATASAEGLFG GQMVGVKGGD GAVLFYDWDS GIFVRKIDVN PKEVYWSDSG NMALLACEGT AYVLSHNAEV MAQAIVSGQV SPEEGIDGTF DLLFEIDDTI TSGKWVGDCF IYVNNVGRLN YSVGGQIETL VHLDTSAGGS VQHTILGYLA KEDRIFLIDK SLNVVSYKVT LAVLQYQTAV MRGDFDSANE LLPSIPEEEY TKVARFLESQ GFKEEALAVT QDPDHKFDLS LELGQVDLAH QILLETPEED KESTDTQAKW KRLSDAALKD TNLELCESAS ISSNDYSGLL LLYSATGNLS AMEKLAKLAS DGGKTNVAFV AYMLTGNVEA CADLLIATKR LPEAAFFVRT YLPSRIEEVV ALWRRDLSSI SESAATALAT PSENATLFPD MDVALQVEQM FLGQREATKA TGIPASEYLS AKDDLDLNLI DLIKTRSQPA VDHSMAETHQ LVDEEKEADP TDDHDDEEDA DLAAVREAEE RGASEAAAVA EVQRPAEGAA EFEDDVPLEM TEDVPGATKE VDRDDSGFDE EW
|
| |