Gene PHATRDRAFT_47667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47667 
Symbol 
ID7202857 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp471390 
End bp473573 
Gene Length2184 bp 
Protein Length606 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181914 
Protein GI219123193 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0434926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGACTTTGA GGAGTCTCCA TGTCCAACGC ACAATTCTTC TGTGCCCGAA ATCAGCATCA 
TTATCCCAAA CCAAAAGATT GGCTGACGAA TCGTAACTAT GGTAAGTTGA CTTCAGGGTC
GATCACACAT TATCGCCGTA CTAACTGTAA AGCGAGCTGG ACCTCATTGG ATTGAAAAGG
GAGAGAGTGA TATGGCAGAT TTCTGTCAGA CCGATGGCTC TCATTAATCA AGAGATTATT
TTTCCAAGTC ATTTATCTAA CTTGTATTGT GCATTTTTGA TGGGCAGGCT GAAGAACAGA
ACGACGTTGA GGCGATGGAG GTCGACAAAG CCACTGAGGC TAAATCTCGC AAGGATCGTC
CTACTGGATC AATCCATTTC GGCGACACCG ACTTTGACGT TGACGAGCAA GTTGGGGACG
CTACATGGGC CGAGGTATGC AACGCCTGTT GTGTCCACTC CGGACAGGAA TGGGCAATGA
TCGCTCTTGG AATCTTCCTG GTTTGCTTCT TCCTCTACTT CTTCCTGTTG GGTCTTGACC
TGCTTGGAAA TGGTGCAAAG GTCATGTGTG GATGCACTGC CGGAGAGTTG TTCGGCGACG
ACACGAATCC CATCGCTGGT CTAATGGTTG GTATTCTTGC GACTGTATTG CTTCAGTCGT
CTTCAACAAC TACCTCGATT GTTGTGTCTT TGGTCGGATC CGCTGTCTCC GTCCGCCAAG
GAATTTACAT GATCATGGGA GCGAATATTG GTACTTCAGT CACCAACACA ATCGTCGCTA
TGGGCCAAAT GGGCGATGGG GATCAGCTCG AGCGTGCTTT CGCTGGTGCC ACCGTCCACG
ATATGTTTAA TTTCTTGTCA GTGGCCATAC TTCTTCCTGT AGAAGTCATC ACAGGATACC
TCTATCGGCT TACTAAGGCT ATGGTCAAGA ATGTCAATCT CGAAGACGGT GATAGCTGGG
ACGGTCCTAT TAAAAAGATG GTTGATCCTC TGACCGATAA GATCATCATT TCCAACAGCA
AGATTATCAA ATCTGTGGCT TTGGGTGAAG CGACCTGTGA TGTGGGTGGT GGATTTTACC
CTATGAACTG TACAGAAGAC ACATACTTGG GTTGTGGCAA GGCATTTGGA CTCATTTCGT
GCAGTAAGAC GAGCGGCGAT TGCCCTGCTT TCTTTCAAGG TGATGCTTCC GCAAAAGATG
ACAAGGTCTC TGGGGGTGTT GTCTTTTTCA TTGCTATTGT CATCCTGTTT GTATGCCTTG
CCGGGCTTGT TACTGTTCTT CAGAAGTTGC TGCTTGGAAT GTCCACTCGC GTTGTCTACA
AAGCCACTGA CATTAACGGA TATCTTGCGA TTGCTATTGG CGCTGGTCTC ACTATGATTG
TGCAGTCCTC CTCCATTACT ACGTCCGCTT TGACTCCGTT GGTTGGTATG GGAGCGCTTC
GTCTTGAGCA AATGTTACCT CTTACACTTG GTGCCAATAT TGGTACAACG CTGACTGCCA
TTTTGTCTGC CCTTGTGTCT GCAAGCAAGG ATTCGCTCCA GGTTGCCCTT GCCCACTTGT
TCTTCAACTT GACTGGAATC CTCATCTGGT ACCCGGTGCC TTTCATGCGT CATGTCCCTC
TCGAGGCAGC TCGTAGACTT GGAAAATTGA CTCGAGTCTG GCGTGGTTTC CCCATTGTTT
ACATTGCGGT GATGTTTTTT CTCATTCCGC TTCTTCTGCT CGGCCTGTCG TCTCTTTTCG
ATGATGGCAG CAAGGGTTTC ACTGTCCTGG GATCCTTCCT TACCATCCTT CTGTTCCTTG
TCATCATTTA CACTGTCTAC TGGTGCCGTT ACAAGGACGG TCAGCAGAAG TGCGCAAACT
GCATGGCGGA GCGTGAGAAG AAGCGCGTTG TGATTAAAGA GCTCCCTGAG GATATGGCGT
ACCTGAAGGA GCACATGAAG CGCCTCATTG AACACACTGG ACTCCCCGAA GACGAAGATG
TTCCGGCCAA GGATGAATCT CCTGATACTT CGGATGCTGA GGTTGATGCC TAAGCACCAA
ACAAGAGGTC GATATCGCCG AATATTGGTT AGAGGTCCCC TTTTGATTAT CCGTTTAGTA
TATATAGTAT ATACAACCTG TGCCAAGACC ATAGTTCTCG CAGATTCTGC TTGAGAATAA
AATAACTTGG GAATGTTTAT CTTA
 
Protein sequence
MSNAQFFCAR NQHHYPKPKD WLTNRNYEQN DVEAMEVDKA TEAKSRKDRP TGSIHFGDTD 
FDVDEQVGDA TWAEVCNACC VHSGQEWAMI ALGIFLVCFF LYFFLLGLDL LGNGAKVMCG
CTAGELFGDD TNPIAGLMVG ILATVLLQSS STTTSIVVSL VGSAVSVRQG IYMIMGANIG
TSVTNTIVAM GQMGDGDQLE RAFAGATVHD MFNFLSVAIL LPVEVITGYL YRLTKAMVKN
VNLEDGDSWD GPIKKMVDPL TDKIIISNSK IIKSVALGEA TCDVGGGFYP MNCTEDTYLG
CGKAFGLISC SKTSGDCPAF FQGDASAKDD KVSGGVVFFI AIVILFVCLA GLVTVLQKLL
LGMSTRVVYK ATDINGYLAI AIGAGLTMIV QSSSITTSAL TPLVGMGALR LEQMLPLTLG
ANIGTTLTAI LSALVSASKD SLQVALAHLF FNLTGILIWY PVPFMRHVPL EAARRLGKLT
RVWRGFPIVY IAVMFFLIPL LLLGLSSLFD DGSKGFTVLG SFLTILLFLV IIYTVYWCRY
KDGQQKCANC MAEREKKRVV IKELPEDMAY LKEHMKRLIE HTGLPEDEDV PAKDESPDTS
DAEVDA