Gene PHATRDRAFT_39281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39281 
Symbol 
ID7195024 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp112755 
End bp114852 
Gene Length2098 bp 
Protein Length671 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183293 
Protein GI219126080 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0784232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTCCAT GGCATCGTAC TCACAGTCAA GCATCGATCG CTCTTCGGGG TAACGTGGGT 
AGTAGCAGCT GTAAGCAAGG AGTAGCTCAA GGGCTCAGGT GCATCGCCTG TTACTTGGTC
AAACGCATTC TCGGAAATAG GGGTCCTCTC AGGCGAATTT TGTTTGGCAT CGGAGTTATC
ATACTCCTAA CGACTGCGAG GCAGGGAAGG TCCTTCGCCA TAATTCTTAA AACGAACACC
AAAGAGAGTT GGATAAAGCT TAAGCTGGCT CCTGCTTTTA ATCCTGCTGT GAAAGCCGTG
GGTGGTAAAG GACCAGTGCG CGCATCAGAG CGTAATCGCG CAAATTTTTC TTCATGTAAC
ACAGATCCTG GCCAAGCCGA GAATGCTGAG GGAAGTGCAA TGGGCTACAA TAATCTTCTC
AGAATCCGCG AGATCAAGAT GAATGAAAGT TCAGCGAAAG GGCAACGACT TCTATGCGTA
TTTCCTTCAT TTCCTTCGGT TCAGCTTCAG AACGCTGCGG TAGAAACGTA TGGAAAGAAT
TGCGATGGTC GTCTCGTAGT AAGCCGCAAT ACGCCGCTGG TAAGCTACAG TTCCCAGACG
TATTGGATAA ACGAGAGAGA CCCTACATAT AGAGATGGAG CGGGGGCAAT ACTTCAGCAA
ATTCTGCACA AAGTCGGAAA TGAGTATGAT TGGTTTTATT TTGCTCAACC AGGGCGCTAT
TTGATCGGAT CCAATGCCCA GATTCGCCTA TCCTTTACGC GCAACGGAAC CGATTTAGCA
AGGAAGAATC CGTTAGCGCT GCGAACATAC ATACACAATA TTTGGGAGTT TCCCAAAAAG
GTTCATGCAC GAGGGAGTTG CCCTGGAACT GAGTTTTTAA ACCGCGCTGC CGTCGATGAA
CTCGTTCACG TCTATCTCAA GAACAAAGAG TCTACATCAG GAGATTCCAA TTTGATTACA
TGGATACAAT CAGTGATCTC AGACACGGGC ACACGATGTG TGAAAGGCCT GCTAGCAAAA
ATGCCAAATC CACAGAAGTA TGCCGAGCTG CTACTGTCGC CCAATGCGAC ATTCGAGCAA
CAGGCCGTGG GACTGTATCG AATTCAATCA ATACTGAACG GTACATGCGA CCGCCAATGG
CGAGAAAAAT TCGGATTTCT GAATGGAGAT GTTCGGGATG TAACTTTACT GCGCAAGCAT
GGCCCAGCGT TTGATTTCTT TCCTGCAGGT TTTCACAAAT CCGTTTGTGA GACCCCCTTC
GGAACAGGAA CCGAAGGACA CATGGGATAC AGGGGTTTGC GAAAGATCCA GATTGCGCAG
CAGTCTCAAG AGAAACGAAT CTTGTGTATG ATTTACACCC ACGAGAGTCG CCATGAGCAG
TTGCGCTCAA TAGTAGAGAC ATGGGGCAAA GGATGCGACG GGTTCTTTGC AGCTTCAACA
AAAAACGACG AAAGTTTAGG AGCAATCAAC TTACTGCACG AAGGTCCGGA GCTATATGAT
AATTTGTGGA TGAAAGTCAG GGCTATGTGG CAGTATGCTT TTGACCATTT CTTGAACGAC
TACGATTTTT TCCATATTGG TGGGGATGAC CACTACGTCA TCGTCGAGAA CCTAAAATAC
GCTGTTGCTA CGGGAAATTG GAAAGAACAC TGGAACCAGA GTGTTCCTCT CTTCTTAGGT
GGATCAGTTG CGGACCACGC AGACCTCCAA AGACGATACT GCGGAGGTGG CAGTGGCTAC
ACTTTGAATC GTATAGCTCT ACGAAGGCTG GTAGAAGAGC TTTTTCCCAA GTCACAGTGC
TGGCCGCATT GGACATCAGC TCAGGAGGAC AGAATCATGG CAGGTTGCTT CCGGTCAGTT
GGAATTCAGT GTATGGATAC AAACGATTAT AAAAATGAGA CCCGCTATCA TCCTTGGGGC
GTGGACTATC ACGCTTCTTG GACAAAGAGA AAAAAAGGAA ACTGGCACCC AAAAGTTTTG
GAAACAGTTC ATGGGATTGC GCAGCCCGAA GGCTTGGCAC AAATATCAGA TTCTAGCGTA
TCCTTTCATC TGAAACCACT CCGCACAGAT CCTGATTTGC CGCCCGATCG AGGAATGA
 
Protein sequence
MCPWHRTHSQ ASIALRGNVG SSSCKQGVAQ GLRCIACYLV KRILGNRGPL RRILFGIGVI 
ILLTTARQGR SFAIILKTNT KESWIKLKLA PAFNPAVKAV GGKGPVRASE RNRANFSSCN
TDPGQAENAE GSAMGYNNLL RIREIKMNES SAKGQRLLCV FPSFPSVQLQ NAAVETYGKN
CDGRLVVSRN TPLVSYSSQT YWINERDPTY RDGAGAILQQ ILHKVGNEYD WFYFAQPGRY
LIGSNAQIRL SFTRNGTDLA RKNPLALRTY IHNIWEFPKK VHARGSCPGT EFLNRAAVDE
LVHVYLKNKE STSGDSNLIT WIQSVISDTG TRCVKGLLAK MPNPQKYAEL LLSPNATFEQ
QAVGLYRIQS ILNGTCDRQW REKFGFLNGD VRDVTLLRKH GPAFDFFPAG FHKSVCETPF
GTGTEGHMGY RGLRKIQIAQ QSQEKRILCM IYTHESRHEQ LRSIVETWGK GCDGFFAAST
KNDESLGAIN LLHEGPELYD NLWMKVRAMW QYAFDHFLND YDFFHIGGDD HYVIVENLKY
AVATGNWKEH WNQSVPLFLG GSVADHADLQ RRYCGGGSGY TLNRIALRRL VEELFPKSQC
WPHWTSAQED RIMAGCFRSV GIQCMDTNDY KNETRYHPWG VDYHASWTKR KKGNWHPKVL
ETILICRPIE E