Gene PHATRDRAFT_50229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50229 
Symbol 
ID7199010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp52872 
End bp54593 
Gene Length1722 bp 
Protein Length520 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185195 
Protein GI219130066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCGTCGATTT TTTGCTTGTC TGATTAATCG AAGACTAGCA AAGTTTGCAA AAGTTGAAGC 
AAGATCCCTG ACTTACAAAA GTAACTATGG ACGACTTCAA CTCGCAATTC GGGAAGCTCT
CCACTACGGC GGCAGAATGG AAGCCAAGCG GAAATGCAGA CAACGATTCT GATGTACAGT
TAAACGCCAA GGTTGTCAAG GAATTTGTAC CAGGCCAAGG ATGGATCGCC TCGTCGACCA
AGCCTGCTGC GCGTGAAGCG AATATCCCTT CGTCGGTGAG GGAATATACA TTGGATGCGG
CAAATTCCTC CGCGAGGTTT GAAACGACAG TTCCAACAGG GCCTTTCCCT TCTCCGATGC
CAAGCTTCCG GGCCTTGCAA ACTCTGGGAT TAGGAGACGA TTTGTGGCGA CATTACAGAG
ATATTTCTTT AGAGAGCTGT AGGCAAATGG ACCCCGATGA CCAGCGGCAG AAGGCAGTCC
CTTTACCTTA TTGTAACGCC TACTGTCTAG ACGATATCTC CCAACGGGGA CGCTCCTCGT
TCGGTTATCC ATCGGCAACT TTCCATGTCA CTTCCAGAGA AGACGGCAAT GCTTACTGTC
TGCGCCGTTT CGACAATGTG AGGTGTGTGT CACCCAAAAT AGCCTCCACC GTATCAGACC
GATGGACCAG TGTCGCAGTT GTACAAGAAC ATCCCGGCAT TGCACCCTTT TACCAATGCT
TTATGGCCCA ACGTGCTGTG TTCTTTGTGC ACCAGTACAT ACCTGGAGCG CGTAGTCTCA
AGGAACGCCT CGGGGGTCCG TTGTCGGAGT GCGTCCTGTG GAGTTGTATC TCCCAACTTG
TGTCGGCTTT ACGAACAATT CACGGAAGAA ATTTGGCCGC TCGTACTTTG CAACTTCATC
ACATCTTATC GAATACGGAT TCGGCTGCAA GTCGGCTGCG TGTGCGTTTG AACTGTTTGG
GCATTGCAGA CGTTCTGGAG TTTGAAGCGC GCAAAAAGGT GGCCGACCTG CAGCGACATG
ATGTTCGGGA TCTTGGAAGG TTGATTCTCT CACTAGCGTC AGGTACCGAA ATCACCCATT
CCACCGACAT GGAAACGGTA GGATCATGTG AACAATTCTT GGCACAGAAC TACTCGCCAG
ACTTACACAA TTTAGCCATG ACATTGATCA GGAGTACACC TCAGCCGCCG TCGATTCTTG
ACGTAAGTAG AGTTGTCGCT CAACGGGCTT TCGATGAGCA AGATGCAGCT TATCAGTCCT
TTGATCGCAT GGAACGAGCG CTATCCGCAG AGTACGATTC AAGCCGGATG TTACGAATCT
TGCTGAAACT AAGTTACGTG AATGAACGAC CCGAATTTGG CCCAAATCGA AGATGGGCGC
AGTCGGGAGA TTGCTATGCT CTGACGCTAT TTCGTGATTA CGTCTTTCAC CAAGCTGATG
GTGGTGGCTA TCCAGTTATG GATTTAGGGC ACGTCATATC AGCGTTGAAC AAGCTTGATG
GCGCAGATGA AGAAAAGATT GTTTTGTCGT CTCGGGATGG GAAGAGCCTA ATGGTAGCAA
GCTACGCAGA GATAGCTCGA TGCCTGGAAA ATGCGTTCCA GGAACTATGC GTGGGCGCAG
TATCGCATGA TGCGTTGCAT TACTGTTGAC TGAGAGCGAA TCAGATACAT CACTTATTGT
CACAAATTAA ATTGATACTG TAAATGGGAT GCTCAAGACG TG
 
Protein sequence
MDDFNSQFGK LSTTAAEWKP SGNADNDSDV QLNAKVVKEF VPGQGWIASS TKPAAREANI 
PSSVREYTLD AANSSARFET TVPTGPFPSP MPSFRALQTL GLGDDLWRHY RDISLESCRQ
MDPDDQRQKA VPLPYCNAYC LDDISQRGRS SFGYPSATFH VTSREDGNAY CLRRFDNVRC
VSPKIASTVS DRWTSVAVVQ EHPGIAPFYQ CFMAQRAVFF VHQYIPGARS LKERLGGPLS
ECVLWSCISQ LVSALRTIHG RNLAARTLQL HHILSNTDSA ASRLRVRLNC LGIADVLEFE
ARKKVADLQR HDVRDLGRLI LSLASGTEIT HSTDMETVGS CEQFLAQNYS PDLHNLAMTL
IRSTPQPPSI LDVSRVVAQR AFDEQDAAYQ SFDRMERALS AEYDSSRMLR ILLKLSYVNE
RPEFGPNRRW AQSGDCYALT LFRDYVFHQA DGGGYPVMDL GHVISALNKL DGADEEKIVL
SSRDGKSLMV ASYAEIARCL ENAFQELCVG AVSHDALHYC