Gene PHATRDRAFT_50054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50054 
Symbol 
ID7198803 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp261659 
End bp262884 
Gene Length1226 bp 
Protein Length352 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184931 
Protein GI219129512 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.702034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TACGGACATA GAAAGATCTA TCCGTGGAAA GTCCAACAAA ACAACAACAG CAACACAAAA 
CAGCACACAC GAGCGGCACA TTAACGAAGG AACACCGAGG GAGTGAGAAG GCCTTTTCGT
GGATGAACGG TTGGCCATGA AAAAGCCCGA AGTACTGCGT GTTGTCAAGT GTTGCTTGCC
CGCAATCCTC TTCGATTATA AGAGAAAAAA GGCTAGTGAG GATGCTAGCG AAATAAATAT
CGACTTGCAA GTTGACGATC TACAGGTGAC AACATTGTGT CGGTTGTGGG CAGGAATGGG
GCACATTCAT CGCGTCCGCA TTTCCTTACC GGTAGGCTCA ACAACCAGTG GCAATAGAAC
TACCGGCAAC GACGATATTG CTATTAAGCA CATAATTCCG CCTCCTTCAT CACAGCGGTC
GTTCGGCGAC CATCGCAAAG CCTCCAGCTA CCGCGTCGAA GCCAACTTTT ACGAAAATCT
CGCACAAGAA CTCATCGCCA AAGGTGTCAG TGTACCCACT CCGTACCATG TGGAGCGGGG
TAAGGGTGAT TCTGTCATCA TCGCCATGTC GTACCACGAA AGCAAGGTCA ATCCGACGGA
AAACCAGCAA CGCGTGCGGC TCGTCTTGTC GTGGTTGGCA CAATTTCACG CTATGTATTG
GGATGCCGAT GCGGCGGATC GCGTGGTCCA ACAAGCGGGG CTGCAGGCTG TGGGAAGCTA
CTGGTATCTG GCAACGCGTC CAGATGAACA CGAGGAGATG CCGGACCACG GCTGGCAAGG
AAGACTAAAG CGCGCCGCCC GTGCCATTGA TGCGCGGCTA CAGCGCGATC CTTTGCAATG
CGTTATACAC GGGGACGCCA AGGATGCAAA CATCTTGATG GACGAGCATG GAAAGGTCAC
CTTTTGTGAT TTCCAGTACG CGAGATCTAG CTTACTTTTT CTGCAGCTCC GTATCAATCG
ACGACGAAAA GGAAGCTCTG GAATATTATT GGAACGAGCT GAAAGCTCGA TTGCCACTAA
ACGTGTCGCC CGCGCCTACT TGGGAGCAGC TGCAAGATTC CATGGAATTT GCATATGCGG
ACTTTTATCG ATTCATGAGT GGGTGGGGAT TTTGGGGGTC GGGAGCTGAA CGTCGCGTAA
TTGCGTTGTT GGACCGGCTC GATCACGGTA GCAAGTTGGC GACCGAGGAA GACTATGACG
AAGCCGTACA ACGAGAGTTT GGCTAA
 
Protein sequence
MKKPEVLRVV KCCLPAILFD YKRKKASEDA SEINIDLQVD DLQVTTLCRL WAGMGHIHRV 
RISLPVGSTT SGNRTTGNDD IAIKHIIPPP SSQRSFGDHR KASSYRVEAN FYENLAQELI
AKGVSVPTPY HVERGKGDSV IIAMSYHESK VNPTENQQRV RLVLSWLAQF HAMYWDADAA
DRVVQQAGLQ AVGSYWYLAT RPDEHEEMPD HGWQGRLKRA ARAIDARLQR DPLQCVIHGD
AKDANILMDE HGKVTFCDFH SVSIDDEKEA LEYYWNELKA RLPLNVSPAP TWEQLQDSME
FAYADFYRFM SGWGFWGSGA ERRVIALLDR LDHGSKLATE EDYDEAVQRE FG