Gene PHATRDRAFT_50081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50081 
Symbol 
ID7198679 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp336901 
End bp338869 
Gene Length1969 bp 
Protein Length569 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184865 
Protein GI219129374 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGTGTAC GTAATTCACA GTCACTCCCC ACCCAAAGGA CGTTTACAGT TCCACTGGAC 
ACACCACCGC GACGTTGCCA ACGGAACCGT GGATTCCCAA ACCCACCACC AACTACGAAC
CCATGGCATC GTTCGGAATC ACCGCGGGCT CACGGACGTT CCTCCTTCCC ATCCCACCGG
TGCGGCTCAT TCCGTGGCGT CGTGTCACGG AAACATTCAC GTCGACGACT GCGTCGGCAC
ACGTGTCGTG GACGCATCCC GTTTGGTTTT GCGTGGCGGG ACGGCGCGGG ACGTCGACCT
ATTCCTTGCG CCATTCGACC ACCGCTCCTT CGACGTGGGC CCGCAACGGC GCCCGATTGT
ACGCACGGCA CGTGCTGGCT GCCGCCGCAG TCGCCGCGGC GACGGCGCGG CGGCGGACGC
GTCTGGAGCA ACTCGTGGTG GTCGAGCCCC CGTCGGATTG GTTGCGGTGG GAAAAGAAAT
CCTGGGCCGA CGAATTGCTC GCTAGTCTCG GACGGAATAA ACTGGAGCGG TTCGGGTCGG
CCACGCGACG GATTGCCTCC TTGTTGGTAC TGGCCGCGCC TTTGACGCTC CTCGTGCCCT
TGTCGTGCGT GTCGGAACGT GCCACGGCCT GGTCCTGGGC CTACGCGTTA TGGAGTATTG
AACAGGCCGG ACCGACCTAC ATCAAGTTGG TGCAGTGGGC CACCACCCGA CAAGATCTGT
TCTCGCCCGA GTTTTGTCAA TATTTTGGAA AACTCCGCGA TGAGACCACC GGACACGCCT
GGCAAGCCAC GGTGGACACG CTGTTGGAGG ATTTGGGCAT TGGCGCCGAT TTTCTGCAAC
TCGAAACGAA ACCCATCGGC TCCGGATGCA TCGCACAAGT CTACAAGGGA AAGTTGACGC
AACCCTCCGG TCCCTATCCC GTTGGTACCG ACATTGCCGT CAAAGTACAA CATCCCGGAA
TATGGGACAA GGTCTGCGTC GATTTTTACA TACTCGGCAA AGCCGCGGCC TGGTTGGAAC
GCATACCCTA CTTGAATTTG TCCTACCTTA GTTTGGCCGA CAGTGTCCGA CAGTTTCGGG
ACATTATGCT CCCGCAACTC GACTTGACCC TGGAAGCCAA TCATTTGCAA CGCTTCAATC
GAGATTTTCG GGACGACGAT CGGGTGGCCT TCCCGGAACC CTTGAAGGAA CTTACCACCA
CCCGGGTCCT CACGGAAACC TTTTGTCACG GGACTCCCAT TCTGGAATAC ACCAAGGCCC
CTCCCAAGGT CCGCGAGGAA CTGGCCTATC TCGGACTCTC CACCACCCTC AAAATGATCT
TTCTACACGA CTTTTTACAC GGTGACTTGC ATCCCGGTAC GTACGAGTGT GTCTGTGTCG
TATGTTGTGT GTGTGTGCAT TTGTATTGGT ACAGTATATA TTGGTATAGT GTGTGTGTGT
GTGTGTGTGT ACGCGTGTGG GCGCTTCTCA TTCCACGTCT CGCTTCGTTG CAGGCAACAT
TCTCGTCAGT AACACCCCCA AGGGCGACAT TAAGCTGAAT CTGCTCGATT GTGGATTGGT
GGTGGAAATG GGTCCGGAAC AACACATCAA CTTGGTCAAA ATCTTGGGCG CCTTTACGCG
TCGCGATGGT CGTTTGGCGG GACAGCTCAT GGTGGACACC AGTAGTCACT GCCAGGCCAG
TCCGTTGGAC GTCGAACTCT TCGTCAACGG CATTGAACGA ATAATTTTGG ACGACGCCAA
GAACAATTTT GTCGAAAACG TGGGGGACTA CATTACGGAT ATCTGTTACA TGGCCTGCGT
ACGCAAGGTG AAACTGGAAG CTTCCTTTAT CAACGCGGCG TTGGCGATTG AGATTATTGA
AGGCATTGCC CAACAGCTAC ATCCGCAAAT CGTCGTGACG AAAGAAGCAC TGCCACTCAT
CGTCAAGGCG GAAATGATGC ACCGGTTGCC CAAGTTTTCT CTCTGGTAA
 
Protein sequence
MASFGITAGS RTFLLPIPPV RLIPWRRVTE TFTSTTASAH VSWTHPVWFC VAGRRGTSTY 
SLRHSTTAPS TWARNGARLY ARHVLAAAAV AAATARRRTR LEQLVVVEPP SDWLRWEKKS
WADELLASLG RNKLERFGSA TRRIASLLVL AAPLTLLVPL SCVSERATAW SWAYALWSIE
QAGPTYIKLV QWATTRQDLF SPEFCQYFGK LRDETTGHAW QATVDTLLED LGIGADFLQL
ETKPIGSGCI AQVYKGKLTQ PSGPYPVGTD IAVKVQHPGI WDKVCVDFYI LGKAAAWLER
IPYLNLSYLS LADSVRQFRD IMLPQLDLTL EANHLQRFNR DFRDDDRVAF PEPLKELTTT
RVLTETFCHG TPILEYTKAP PKVREELAYL GLSTTLKMIF LHDFLHGDLH PGNILVSNTP
KGDIKLNLLD CGLVVEMGPE QHINLVKILG AFTRRDGRLA GQLMVDTSSH CQASPLDVEL
FVNGIERIIL DDAKNNFVEN VGDYITDICY MACVRKVKLE ASFINAALAI EIIEGIAQQL
HPQIVVTKEA LPLIVKAEMM HRLPKFSLW