Gene PHATR_33153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33153 
Symbol 
ID7204272 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp196183 
End bp197613 
Gene Length1431 bp 
Protein Length443 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186012 
Protein GI219112857 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACCG GGACAAAAAA CAGCTTCTTC GCCACCACAG TCTGCGTTGC CTTGTTTTGC 
TTTGTAAGAG CCCTTACCTC TTTGGGTACT GTCCAGAATC TCCAAGTCTT CGATACCGAA
CTCAAAGACA TAGCAGAAAG CGTTCATTCA CTTTCGGTCC ACGGGACCGG TGCTGACAGC
AACAACTCAT TGCCAAAGGC AAAACCTGAT GATGTCTACG TATTAGAATC AAAGACCAAG
GCTGCAGTCC GTGAAAGTAG CCTCAGGGAA AGCACTCTGG TCACCAATGC CGGTATCGAA
TTTCGGAAAC CAATATGGAG TAACGCTTCC AGCTTTGACA ATGGCGCACC GAGCATTCTT
GTCCAACTCA ACGGAGAATT GGCAAACTAT CTCGGATTTA TTGCCAAGGC TTTTGGACTG
GTGTGGTGGC TGGAGCGGGA GTATGGTGTG AATCCTACAA TTGTGCTGAG ACATCAACAG
CATCCCAAAT GGGTTGGTGC TCACGCGGAT GTTACTCGAT GTTTTCCGTA CTTGAGAGAC
TTTAATTTTG GGGCCGGAAA TACTCGGGAT ATAAGCAAAG AACTGAGTGT TCTGAGTCAA
TCTCATCAAC AAAGCAATGG TACGGCTGAA AGAGTTGTTG ATATCAGGAG CGAAGTGCCA
TACGACAAAA CAATCCAGTC CTTTTTGAGT CTCTACGCGA AGAGCCACAT CCACATTGGG
GGAGAAAAGA GCCGTATCAA CATACCTTTT CTGACCACTA AGCAAATGAG TTGCAGAGAC
CTCATAGTGG ACAAGTACTA TGACGATATT CGAAGAATAT ACCGATTTGA CAAAAGCTGC
TGCGTTGATG TACCCGACCC GGATGAATCT GTCTTTGTAG GTAGCTATTG CCAATCAGAT
ATGTGCAGAT TTTGGCGTTT CCGTCACTGA CTTTTCTTTG TTTTGCTGTG CTAGCATTTT
CGCAACTTTG TAAAGGAAAG GTCTGGACTA CGAAAGCAAC CTGGATATGA AGAGCTTGCT
CCGGAGCAAG TGGCGAACGA GCTGTTTGCG CATCTGAATC CGGGCGACAA AGTAGCAATA
ATCTCTCGCT ACCCCAATGA CTTCCGAACC CAAATGATTG TGGGCGCATT TGAGAAGCGG
AAGATTCGGG CTCGAGTTGT AGAGCCACGG TCCGGGGTGG CAGACTTTTG CTTTTTGATG
CATACGCAAA AGGAGATGGT TGGCACGGCT TGGTCTACTT ATTTTCTTTG GGCTGGCCTG
CTCGGAAATG CTACAAGCGT ACGACCGTAT ACAGCCATTG TACCTGGTCG GAACAGCAAA
ATCGACTCTC ACAATTTTAC GCATCCAGAC CTCAAATCCC GTTTTCGTTT TGAGCACTAT
ATCAGCAACT TTACTGCTGG GGATCTACGT AAACCAAAAG GTGAACAATA G
 
Protein sequence
MSTGTKNSFF ATTVCVALFC FVRALTSLGT VQNLQVFDTE LKDIAESVHS LSVHGTGADS 
NNSLPKAKPD DVYVLESKTK AAVRESSLRE STLVTNAGIE FRKPIWSNAS SFDNGAPSIL
VQLNGELANY LGFIAKAFGL VWWLEREYGV NPTIVLRHQQ HPKWVGAHAD VTRCFPYLRD
FNFGAGNTRD ISKELSVLSQ SHQQSNGTAE RVVDIRSEVP YDKTIQSFLS LYAKSHIHIG
GEKSRINIPF LTTKQMSCRD LIVDKYYDDI RRIYRFDKSC CVDVPDPDES VFERSGLRKQ
PGYEELAPEQ VANELFAHLN PGDKVAIISR YPNDFRTQMI VGAFEKRKIR ARVVEPRSGV
ADFCFLMHTQ KEMVGTAWST YFLWAGLLGN ATSVRPYTAI VPGRNSKIDS HNFTHPDLKS
RFRFEHYISN FTAGDLRKPK GEQ