Gene PHATRDRAFT_33916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33916 
Symbol 
ID7197701 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp661732 
End bp662890 
Gene Length1159 bp 
Protein Length319 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178273 
Protein GI219114955 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.859586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAGG CTTCATCTCT CAAACGAGGA GTGTCGTCCC AAAGGGGCCT TCTGGCTTCG 
CTATGGTCGC TTGTAACAGT AATGACAGTA TTAGCTTTCA TAGTTGCTCT CATCTTTACA
TTTGCTGGGG TACAAGAAGG CGACTCCTAT TCCAACCGAA ACCAGAATAC GAACGATCAA
GACGCGAGCG ATCCAAAAGT GGCAGTGACG AGTCGAGGCA TGGCTGTAGC GGCTCTCTGG
ACAGCTGTTC TGGCTACTCT AATCTCAATT TTCGGGACTG TCATTCTGGG ATGGCAGTCG
CCCACAGGAC GGTATTACAG TTGCTGCTCC ACAAAGGTAC ACCAAACAAA CGCACTGTCC
CTGGGCAGTT TCATCGGAAC CGTTTTTATG TTCGCGAATC TGACCTTCAT ATGTGCCATT
CTGTTCGGAG AATTCGAGGT AATTTTCATC CGGGTAAAGA GCACTTTTTG ACTTTCCTTT
GGACGTCTCT GACATATATT TTCTTCTTTC TGCCTTAGAT TCGTGACCGA TGCGGAGAAG
GAGAAGGACG GAAACGAGAT GGCGTAGCAA CCGCTTCAGC GGTCGCACGG TCGTCGACGG
CCTTCAGCAT AGCGTGCCTC TTCCTAACGC TCCTGTATGG GGGGTTCGTC TCCATTTTGT
TTGTCTATTC CGAGAGCATT TTGGAGGAAC TGACAGCCGT CGAAAAGGAA GATTTGCACA
ACGCAAGGTG TATCCATTCC GCATCGCCAA CTCCATTTGC AACCGCCTAC ACCGGCTACA
TTGGAGAGCG ATTCGATGTA CGACGAAATC TAGGCAGTAG TGCTGCTGGC TTACTGTTAA
TGACTCCTAA ACCGGCGAAC TGCTTATCCG ATGGTACGCT GACATAATGA CAATGATGGC
AATCAATTCT CTACGTGTAG AGTGATGAAG ATACATGGGT CCTAGCGGGA GTACATGATG
CTCTCTTGGC CCATTCTTAC CGAGACTTTT AGGTACACCG TTTTTGGCAG CTCAGTTTGT
AGCAGCTCTA GATTCTTTGA ATTTGAATCC GCTTTCCACC ACGTTGTACT TGAATCCGAG
TTGGAGTGTG ACCTACAACC ATCTCTCAGT CCCAGGCTAC TCTCAGCGCA ATGACTTTCA
CAAGGAAATT GTTTCCTGA
 
Protein sequence
MAKASSLKRG VSSQRGLLAS LWSLVTVMTV LAFIVALIFT FAGVQEGDSY SNRNQNTNDQ 
DASDPKVAVT SRGMAVAALW TAVLATLISI FGTVILGWQS PTGRYYSCCS TKVHQTNALS
LGSFIGTVFM FANLTFICAI LFGEFEIRDR CGEGEGRKRD GVATASAVAR SSTAFSIACL
FLTLLYGGFV SILFVYSESI LEELTAVEKE DLHNARCIHS ASPTPFATAY TGYIGERFDV
RRNLGSSAAG LLLMTPKPAN CLSDGTPFLA AQFVAALDSL NLNPLSTTLY LNPSWSVTYN
HLSVPGYSQR NDFHKEIVS