Gene PHATRDRAFT_50236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50236 
Symbol 
ID7199014 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp67765 
End bp68909 
Gene Length1145 bp 
Protein Length264 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185117 
Protein GI219129904 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGAAAGCAC AATCCCACAC GTTGATATCC CACACTCTTC ACATACCCCC GTATCTGCTT 
CATCATGGCC ACCAAGTTTG CTGATCTCTC CAAGGGTCCC AAGGGTAGGT AGTCAAAGTC
GCAGCCGAAG AATGGATTGA AAGGACCCAA GAAATTTTGA TATTTTCGTC ACACGAAACC
TCTGTGCTTG ACTACGAATA CCGTGTTCCC GAAGTCGAGT CGGCTTGCCA TTTAGGTTTT
GCCAGAATGA GAAAGGACCT GTCTATACCG AGTGCTTGAT CTTACATTTA CCGTTATCCT
CTTTTGGCAG ATCTCTTGAA TGACGACTAC ACGTCTTCTG TCGTGCTCAA GGCCAAGAAG
AACGCCGGAC CTGTTGCCGT TACCATCGAA ACAACCCGCG GAGACGATGG TGCACTCACG
TCCAAGGTCG GCACTAAGTT CGCTTACGCT AAATTCAACG TTGACAAGGG CCAGATCAAG
GCCGATGGTG GCCGAGTCTT GGAGACATCC CTGAAGGTTA CACCGGAGGT CAAGCTTTCC
TTTTTGGCTA GCAAAGGTGC TGACTTGGGA GTGGATTACA CCAAGGGCAA CTTCTACGGA
ACCGGTGTTT TGGACGTTAT GGACATGTCC TTGGTTAGCA CGTCGGCTTG CTACGGTTTG
AACTCGGGAC TCAAAGTTGG CGGAGATGCC GCCTACAACC TTTCCGGAAG CAAGGGTCTC
AGTGGATTCA ACGTCGGTGC CTCATACACC GCTGGACCGC TGTTTACCTC TCTCACGGTC
TCCTCGAAAT CCGCCGCGAC CATTGGCCTT CTCTACAAGG TCAATAGCGA CCTTATGCTG
GCGTCTCAAA CGGTCCATAC CTCAAACAAG GTCTGCGACG TTTTGGGAGT CGGTGCTGCC
TTTAAGGCCC CTGTCGGTAC CATCAAGGCC AAGTTCAACA GCGGAGGTGT TGTGTCGGCC
TGTTTGATCA GGGAAATTGC GCCGAAGGTT GTCATGACGG CCTCGGGTTC CGTCACCGGT
GCGGACTTTT CGACCTTTAA ACCAGGTTTC CAGATTGCCA TGTAAACATG ACGTGTAACG
AAGAACAAAG TATCGCAATT GCGATATGAG ATTTTAAAAA TTGTGCGTTA CGAAATTGCT
TTGAT
 
Protein sequence
MATKFADLSK GPKDLLNDDY TSSVVLKAKK NAGPVAVTIE TTRGDDGALT SKVGTKFAYA 
KFNVDKGQIK ADGGRVLETS LKVTPEVKLS FLASKGADLG VDYTKGNFYG TGVLDVMDMS
LVSTSACYGL NSGLKVGGDA AYNLSGSKGL SGFNVGASYT AGPLFTSLTV SSKSAATIGL
LYKVNSDLML ASQTVHTSNK VCDVLGVGAA FKAPVGTIKA KFNSGGVVSA CLIREIAPKV
VMTASGSVTG ADFSTFKPGF QIAM