Gene PHATRDRAFT_27518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_27518 
SymbolCOPbeta2 
ID7201516 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp8873 
End bp12404 
Gene Length3532 bp 
Protein Length962 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180359 
Protein GI219119187 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTTGGCACT GTAGACCTTC GATCTCACCA TGGTGCGTCT TTAATTGTCA TTGAGTATTA 
ATGACAAACC TGAAAAGAAT TTTTAGAACT CGCATGCCCC GACGAGTTGC GTTCAGGGTT
GAGGCAAGCG TTTCGTCATT CCTCTGCCAC TCCTCGCGAG ATCTTCGTGC AGCTAGCGCT
TGTTTGGGAT GCGGGACGAT GTTCAAAGAG CTAATGCGAA GAATTCTTCC AATGTGCGAG
TTGAAACTAG AAGATCCTAA ACGCCTGTAA ATGGCAAATC GGGTGTGGGA TCATCCATGG
CTTTGTCAAA TGTCGCATCG ATGTTGTAGA CGCGTGTGGC GACACGTTAC ACTTTTTACG
GATAAGCGCT CTCCTAATCG GGACTCTCAA ACTCTTACCT TTTCTCACCT TCTTGAACTG
TCATCTTACA GCCGCTTCGT TTGGATATCA AAAAGAAGCT TTCCGCCTCT TCGGAACGTG
TTAAATCGGT TGACTTGCAC AACTCGGAGC CATGGGTGCT CGCTGCGCTC TACAGCGGCA
ACGTCATGAT TTGGGACTAC GAATCCGGCA GTCTCGCCAA GTCTTTTGAA GTCTCCGAAC
TCCCCGTGCG CTGCGCGAAA TTCATTGAAC GTAAACAATG GTTTCTGGCG GCATCGGATG
ATATGCGCCT GCGCGTTTTC AACTACAACA CGATGGAAAA AATCAAGGAG TTCGAAGCGC
ACGCCGATTA CATTCGTTCG CTCGAAGTAC ACCCCTCTCT ACCTTACGTT TTTTCGTCGT
CGGACGACAT GACGATTAAG CTTTGGGATT GGGATCGTGG CTTCGATTGT ACGCAATTGT
TCGAAGGACA CGCGCATTAC GTAATGCAAG TCAAGATCAA CCCCAAGGAC ACAAACACCT
TCGCTTCCGC AAGTCTCGAT CGGTCCATCA AGGTGTGGGG ATTGGGATCT CACGTCCCGC
ACTACACCTT GGAAGGTCAT GAGCGTGGTG TCAATTGCGT GGACTATTAC CCATCCGGTG
ATAAGCCTTA CATTCTATCT GGAGCTGATG ATCGTACTGT GAAAATCTGG GACTATCAGG
TACGTGCACA AAAGTTTAGA TTCGTCCCAC TTGTCAGGCA AAATCTCAAC TTCGCATATT
ATTCGTAATA GACAAAATCC ATTGTACATT CCCTGGAAGG CCATACACAC AATGTTTGCG
CTGTCATGTT CCACCCCAAG CTGCCCATCA TCGCTTCCGC TTCCGAAGAC GGTACAGTCC
GTATTTGGCA GAGCACCACG TACCGTGCGG AAACCACCCT CAACTACGGT ATGGAACGTG
CGTGGGCACT CGCCGCGTCG CCAGAATCCA ACAAACTGGC AATAGGTTTC GATGAAGGAT
GTGTGTGTAT CGAATTGGGC TCAGACGATC CGGTTGCTTC GATGGATACA ACCGGAAAGG
TCGTTTGGGC GACCAACAAC GAAATCAAAA CTGCTTCGAT CCGCGGTGTT GCAGGTAGTG
GCGAAGATGC CTTGCCCGAT GGCGAACGGC TCCCGGTAGT CCCTCGTGAT CTGGGCGCTT
GTGAACTATT CCCGCAAATG CTTCGTCACA ACTGCAACGG ACGCTTTGTT GCTGTGTGTG
GCGATGGCGA ATTTATCATT TATACCGCCC AAGCACTTCG CAACAAGGCT TTTGGGCAAG
CTCTAGACTT TGTATGGTCT GGATCGGGTA CTGGAGACTA TGCGATTCGT GAAACGATCA
ATAGTGTGAA AGTTTTCAAA AACTTTAAGG AATCACAGAG TATCGTACCT GCTACTGCCT
CAGCTGAAGG CTTGTTTGGA GGACAAATGG TCGGAGTAAA AGGCGGCGAC GGTGCTGTGT
TGTTCTATGA CTGGGATAGC GGCATCTTCG TTCGTAAAAT TGATGTAAAC CCGAAAGAAG
TGTACTGGTC AGACAGCGGC AACATGGCAC TTTTGGCTTG CGAAGGAACA GCGTACGTTC
TCTCGCATAA CGCCGAAGTG ATGGCTCAAG CGATTGTATC TGGGCAGGTC TCTCCTGAAG
AAGGCATCGA TGGTACTTTC GATCTTTTGT TCGAAATAGA TGATACGATC ACGTCCGGAA
AGTGGGTTGG GGATTGTTTC ATCTACGTCA ACAACGTCGG GCGTCTCAAC TACAGCGTTG
GTGGGCAGAT TGAAACATTG GTTCATTTAG ATACTTCGGC GGGCGGGTCA GTACAGCACA
CAATTCTTGG ATATCTGGCC AAGGAAGACC GAATATTCCT GATCGACAAG TCCTTGAACG
TTGTTTCGTA CAAGGTTACT TTGGCGGTAT TGCAGTATCA AACAGCCGTC ATGCGCGGTG
ACTTTGATTC GGCTAATGAG CTGTTGCCTT CAATCCCCGA AGAAGAATAT ACCAAAGTCG
CTCGTTTCTT GGAATCTCAA GGATTCAAGG AAGAGGCGTT GGCTGTGACG CAGGATCCGG
ATCACAAATT TGACTTGTCG CTCGAGCTAG GCCAAGTCGA TTTGGCGCAC CAGATCCTAT
TGGAAACGCC CGAAGAGGAC AAGGAATCGA CCGACACACA GGCGAAGTGG AAACGGCTCA
GCGATGCTGC CCTTAAGGAC ACCAATTTGG AACTGTGCGA ATCTGCCAGC ATTTCAAGCA
ACGATTACTC TGGACTGCTT CTTTTGTACT CGGCAACTGG AAATCTTTCG GCGATGGAAA
AGCTGGCGAA GCTCGCATCG GACGGAGGAA AGACAAACGT AGCTTTTGTC GCGTACATGC
TGACCGGCAA TGTAGAGGCT TGCGCCGATT TATTGATCGC TACCAAAAGG CTGCCAGAAG
CTGCATTCTT TGTGCGAACA TACTTGCCGT CCCGAATCGA AGAAGTTGTG GCTCTTTGGA
GAAGAGATCT TTCCTCGATT AGCGAGTCAG CGGCAACTGC TCTTGCTACT CCATCAGAGA
ATGCCACACT GTTCCCGGAT ATGGATGTTG CTTTGCAAGT CGAGCAAATG TTTCTAGGAC
AACGAGAGGC GACGAAGGCT ACGGGTATCC CCGCATCGGA GTACCTGAGT GCCAAGGACG
ACCTGGATTT GAATTTGATT GATCTTATCA AAACCCGTTC GCAGCCGGCA GTAGACCATT
CTATGGCAGA GACTCATCAA TTGGTGGATG AAGAAAAGGA GGCAGACCCC ACGGACGACC
ATGATGATGA AGAGGATGCC GATTTAGCTG CTGTACGTGA AGCCGAAGAA CGAGGAGCGA
GTGAGGCTGC AGCGGTTGCT GAAGTACAGA GGCCGGCGGA AGGAGCGGCA GAATTTGAAG
ACGATGTACC GTTAGAAATG ACTGAGGATG TTCCTGGAGC GACGAAAGAA GTAGACCGAG
ACGATAGTGG ATTCGACGAG GAGTGGTAGA TAAGGTGGGC TTTACTATTT CTACCAATAA
ACGATATGAG AAATGAGACA ATTTGATCAT ACCATGAACT GTGCACTGTA AATCAGCAAG
TTCTCCCTTA CCCACAGGGT ACCGGTAGAT ATATGGCTAA GAAATCATCT GC
 
Protein sequence
MPLRLDIKKK LSASSERVKS VDLHNSEPWV LAALYSGNVM IWDYESGSLA KSFEVSELPV 
RCAKFIERKQ WFLAASDDMR LRVFNYNTME KIKEFEAHAD YIRSLEVHPS LPYVFSSSDD
MTIKLWDWDR GFDCTQLFEG HAHYVMQVKI NPKDTNTFAS ASLDRSIKVW GLGSHVPHYT
LEGHERGVNC VDYYPSGDKP YILSGADDRT VKIWDYQTKS IVHSLEGHTH NVCAVMFHPK
LPIIASASED GTVRIWQSTT YRAETTLNYG MERAWALAAS PESNKLAIGF DEGCVCIELG
SDDPVASMDT TGKVVWATNN EIKTASIRGV AGSGEDALPD GERLPVVPRD LGACELFPQM
LRHNCNGRFV AVCGDGEFII YTAQALRNKA FGQALDFVWS GSGTGDYAIR ETINSVKVFK
NFKESQSIVP ATASAEGLFG GQMVGVKGGD GAVLFYDWDS GIFVRKIDVN PKEVYWSDSG
NMALLACEGT AYVLSHNAEV MAQAIVSGQV SPEEGIDGTF DLLFEIDDTI TSGKWVGDCF
IYVNNVGRLN YSVGGQIETL VHLDTSAGGS VQHTILGYLA KEDRIFLIDK SLNVVSYKVT
LAVLQYQTAV MRGDFDSANE LLPSIPEEEY TKVARFLESQ GFKEEALAVT QDPDHKFDLS
LELGQVDLAH QILLETPEED KESTDTQAKW KRLSDAALKD TNLELCESAS ISSNDYSGLL
LLYSATGNLS AMEKLAKLAS DGGKTNVAFV AYMLTGNVEA CADLLIATKR LPEAAFFVRT
YLPSRIEEVV ALWRRDLSSI SESAATALAT PSENATLFPD MDVALQVEQM FLGQREATKA
TGIPASEYLS AKDDLDLNLI DLIKTRSQPA VDHSMAETHQ LVDEEKEADP TDDHDDEEDA
DLAAVREAEE RGASEAAAVA EVQRPAEGAA EFEDDVPLEM TEDVPGATKE VDRDDSGFDE
EW