Gene PHATRDRAFT_50517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50517 
Symbol 
ID7199240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp288859 
End bp291258 
Gene Length2400 bp 
Protein Length728 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185411 
Protein GI219130519 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGTGAGTCC TACTCCTCAA TAGGTTACAC AGTCCGAGTT GTGCGTCGCT GCCATTGTGG 
TTACACTGAC TGTGTCGTGG ACTTTGGCAT CTTCCCCTGA AAAACAACAC ACTCACACAA
GTACGTTTCC AGTGAAAGAC ACGGTAGGCA CGCAACATGA AGCTGAAAGA TCGTATTCAA
CAATTCGAAA AGGCGGCTGG ATCCAACGTG CGCCAGGGGT TGCGTGTCGT ACACAAGGGC
ATTCGACAGG GCGTCCAATT GGCACAAAAG GGAGTCCCAC ACGTCAATCC CCCGGAAGTC
AATCCCACCC GATCCTCTGC CGTTGTGGAA GATCCGGAAA TGCCCACCAC GGCAGCGGCT
ACGGACACCT TGACAGCACC AACACCAACA ACAGCACCAC CAACCACCAC AACAAAGCTT
CCTTCCGGTG TGGTGCCGCC GTCTCCCGAC GACGGGCGGT ACGATTTCTC CTTTTTCCAA
GCTTGGTACG ATTCTCTCCA CGCCACATTG TCCGCCACGA ATGTTCTACC CACGTTCGAG
TCGGACCAAC AGACACACCC AGACCCAAAC GTACAGAAGT TCCTGCTCAC GTTAACGACG
ATTGACGCGG CTTGGAACGG ACAAAAGGCG GCCCAGACTC TGGTGGATCA ACTCGGCAAG
GCCAAACAGC CACCCACATC ACCGGAAGAA TTGGCACGAG CCCAAAAGGC CTTGCAGGAT
GCCACACAGA TGGTCAACGA TCTCCTACTC ACCGGAGAAG AAGCGGCACG GGTTCTTCTC
CACACGGATA GCGTCTTGGC GCACTTTGTC TCCTCTGATT ACGACGATAG CAATCTCATG
ACGTTGGCCG TACTCCAATC CGCCACTCCG AAAGGATTGG CAGCTTGGTG TGATCGCGGA
GATGCACCCA CCCAACAACT CTTTCGATTG TTACGGAGCG CTGATTTGAT GCGTGTCTTC
CTACAGCACG GTGGGCCCAA AAACGGGGCC TACGGACGAG CCATGGAATT GCATGAACAG
TTGTCCGCTC ATCAACACAC AGCGACTATC GATCCCGACG CGCACCGCAT CGATCCCGTA
CTCGAACGAT TGGCCCTCGC CGTGGCCTTG GAGCTGTGCA TCGATCTAAC GTCCTTTGGT
ACCAAACACC ACATTGATCC GGTGCAACGC TACGTACACT ACGAGCAAGC GTACTTGTTG
GGAGAGCTTG ATCCTGCCTT TCCCACGTTT ACGGTATGGG AACTGCGCAA CGTTGTCAAC
AGCGACGTTC CCGACGAACA GCTAGGCTGG GGAAGACAGT GCCTGAAGAA TTACCGACCC
GATTTGGTCT TGACCGACGA TCCGCAGTGG CGTTATTGTC GTATCGTCAA AACGGACGTG
GCCTACAAGG TCCCAGACTG GTACAAAGAG CCACGGTCCA TGGACCAAAT TCTATCGGGC
GGCGGAAAGT GCGGACCGCG GGCTTGGTAC GGACGTTTCG CCTGCAAAAG CTTTGGTATT
CCCACGTGGG GATGCAAACA ACCCGGACAC GCCGCCATGA CCCGGTGGAC CACCAAGGGC
TGGATGACCT GCCTTGGTGC CGGCTTTCCG TACAGCTACT GGGAAAATCG AGGCGGACTG
GATTTCCTGT TGGAGACTCA AGCGCGTGCC GCCTTCGGTA CGGATCGAAT GTATCTACAA
AGGGTTCTCC GTCTCGAGTG GTTGGCTATC TTCTACAAAG AAAGCAACAA GACTATTGTG
GACAAATGCA TGCCAAGTTC GGATAGTCCG TGGTGGGCGC TTTCGATGAT GCAACGAAAG
ACGGTCGCTC GAATCCCCCA AAGCAAACCA GCGAGGCAGT CGGACAACTT TAAGATTGTG
CAGACTCACA TTGAGACCGT CCAAGCTCAG CCCGACTCTT TGGAGACCAT CCACAGAGAT
CTCACTACGG GTCGAATCAC GGTTCCTGCC GTGGCCTTTG GCAGTTCTTC CCGAAAGGCG
GATGTCATGA AATCGTTCCT GGGCGGTCGT CAGGTCTTCC TCAAGGAGGA CGCGGTCGTT
GCGTATGTGT TGAATACCGA CCTCATCTCT CCATTGGCGG AGCGCTACCG GCTCTCGTGC
AGAATTTGTA CGGTCCACCG ATTCGAACAA CCACTCCAGT TAACAGTCAG CGCAGAGAGC
ACTGCTGCTG GCAACTCCTC GACACTGCAC GAGATCCCGA TTCCCTACAC CATGGGCATA
TGGGAGGCCA CGGAACATGT CGAAGTGGAG ATTGCGCTGA AAGGGAACAC TACACTTTCC
TTTGCCCGAG CCAACCAACA GTTTGGTTTC GCGTTGAAAG ACGTTGAGTT GCTTCCAGTT
TGATACAGCT ACTAGCAAAC AGTAACACTT AATACTATTA CTGCCAACAT GTTAATTACC
 
Protein sequence
MKLKDRIQQF EKAAGSNVRQ GLRVVHKGIR QGVQLAQKGV PHVNPPEVNP TRSSAVVEDP 
EMPTTAAATD TLTAPTPTTA PPTTTTKLPS GVVPPSPDDG RYDFSFFQAW YDSLHATLSA
TNVLPTFESD QQTHPDPNVQ KFLLTLTTID AAWNGQKAAQ TLVDQLGKAK QPPTSPEELA
RAQKALQDAT QMVNDLLLTG EEAARVLLHT DSVLAHFVSS DYDDSNLMTL AVLQSATPKG
LAAWCDRGDA PTQQLFRLLR SADLMRVFLQ HGGPKNGAYG RAMELHEQLS AHQHTATIDP
DAHRIDPVLE RLALAVALEL CIDLTSFGTK HHIDPVQRYV HYEQAYLLGE LDPAFPTFTV
WELRNVVNSD VPDEQLGWGR QCLKNYRPDL VLTDDPQWRY CRIVKTDVAY KVPDWYKEPR
SMDQILSGGG KCGPRAWYGR FACKSFGIPT WGCKQPGHAA MTRWTTKGWM TCLGAGFPYS
YWENRGGLDF LLETQARAAF GTDRMYLQRV LRLEWLAIFY KESNKTIVDK CMPSSDSPWW
ALSMMQRKTV ARIPQSKPAR QSDNFKIVQT HIETVQAQPD SLETIHRDLT TGRITVPAVA
FGSSSRKADV MKSFLGGRQV FLKEDAVVAY VLNTDLISPL AERYRLSCRI CTVHRFEQPL
QLTVSAESTA AGNSSTLHEI PIPYTMGIWE ATEHVEVEIA LKGNTTLSFA RANQQFGFAL
KDVELLPV