Gene PHATRDRAFT_35537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35537 
Symbol 
ID7200779 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp146446 
End bp147568 
Gene Length1123 bp 
Protein Length352 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179983 
Protein GI219118420 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.991073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTCT CTTTTCTCCC TATTGCTTTA TTCTTGGCTT TCATCGGCGG TTACGCAGAG 
GAACTTTCTC GGGCGCAAAG AGTATCCTTT GGCGAACAGA AGACACGAAA TGCTCAGGTG
GTCGTCGCTA ATTATTCTGA AGCGGAGATG CTTGGCTTCA AGATCCAGCA CCAAGTGCAC
AGCCAAGCCG ACCTATTCGT CCAGCAGCTA GAAGGAAAGT TGCAACTCGT GAAGAATGAG
ATGATATCAC TTGAGAATCG CGACCTGGCC AATCTCGATA TCTTTACTTC TTACGCCAGT
GTTGTCGCCG ACGGCACAGA GAAGAACGCC GCGTCCTCAT TGTTTGTGAG CAGACAGGAT
CCGGCCATAA CGATTGCACT GGATTCCGAA GGCAACTTGA GGGAAGCCGT ACGCCTTAAC
CCTGAAATCG GTGAAGCAAT ATCCATTTCA CGCATTGATT CCCGAAAGGC GGATCGATTT
GTTACGATCA CTGCGGAAGA CTTCGATCAA GACAAACTTG CTAGTTTTGA AGTAGAAGAC
AGAGTAGCTC CATTAGCACG TCAACTCAGA AGCTCACACA AGAGCGGAAC AAGCAGAGAG
AGGTCTCTCC AAGCCATCGG TGCCTGCTCC GAATACGGTT TTGACGTAAT CGAAGTTGCT
GTTGTGGTGG ATTCCCTTCT CTGTGCTGCT GTAGGTGGAA CTGAAGGAGC TGCTTCCACC
GCTGCACAGT CTGTCATTGC AGGCGCCAGC CAGTTCTACG AGGTTGACGG ACTTTGCAAG
AAACTCCGCA TTTCGTATTT GGAAATTCAC TGCAATGCTG GTACCAATCC TATCGCTCCT
TTGCTTCAAC AAGCAGGAAA CTCTGACATT TGTAATACCG ACGCAAATGG TTTATTGCAG
AACTTTATAT GCTACACGGT AGATCAAGGT ATTGCCGCGG ACTTGAACCT TCTGTTCCAC
GGCAAGTTCT TTACTGTTAG TGGCTCTCTA TCAACTGGCT GCGCTTTCAC TGGAACACTT
TGTCTTACTG ATGGTACAGA TTCTGGAGTC AATCAGATCA ACTTTACAAC CGATCCCGTG
TCCCGGGCCA AATTGGTCGC TCACAAGGTG GGCCATATCC TAA
 
Protein sequence
MSFSFLPIAL FLAFIGGYAE ELSRAQRVSF GEQKTRNAQV VVANYSEAEM LGFKIQHQVH 
SQADLFVQQL EGKLQLVKNE MISLENRDLA NLDIFTSYAS VVADGTEKNA ASSLFVSRQD
PAITIALDSE GNLREAVRLN PEIGEAISIS RIDSRKADRF VTITAEDFDQ DKLASFEVED
RVAPLARQLR SSHKSGTSRE RSLQAIGACS EYGFDVIEVA VVVDSLLCAA VGGTEGAAST
AAQSVIAGAS QFYEVDGLCK KLRISYLEIH CNAGTNPIAP LLQQAGNSDI CNTDANGLLQ
NFICYTVDQG IAADLNLLFH GKFFTILESI RSTLQPIPCP GPNWSLTRWA IS