Gene PHATRDRAFT_38757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38757 
Symbol 
ID7203745 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp133754 
End bp135490 
Gene Length1737 bp 
Protein Length559 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182778 
Protein GI219125000 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0269601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGTCC CTAGTAGCGA CAACGGACTT GCTACTTTCT CACAATTCCC CGTCACGAAG 
AAGAACGGCA GGCATTCTAG AAACCCAGCT ACAGGTACCC CGTTAAGAGT TGATCCGGAT
GACGTATCCA ATGACGCGAC CTCTGCTCAC TTCACAGCTC ACTGTCATAT ACAATCAAAG
ACATTGAAAA GAGCCCTCGT CTTCAGTCGT AGGCGTACTT TAGATATCTT TTTCATTGGT
TGCCTCGCTA ACGCCTTCTT TTGCATCATA ATTGTATTCG GCATACTGAG GCAGTCTGGT
TTAACTGCTA CGACGACGAC CCTCGAAGAC GAAAGAATAG GGGCCTTACG CATGCCGTTT
TTAATCAAGA ACCAGACTTT GCAGAAAGAG TTGTACAATG CTTCGATGGC TGTAGATTCT
GGGCCCCTTG CTGTTGATAC CGTCGTAGAA TCGTTTGTAA TCAGTAGCGG CCGTACATCT
GACCGAGCCT TCAACCAGAT TATGCAGGTC ACCGAAAGGT TCTCGGCCTG CATCCTGTTT
ATGGACGACA ATCCCCGTCT TGTGGAGTGG CTAGCCTACC ATTTCTTTGC CTTAAACTTG
CGCGAAGTTG TCGTGGCCGT CGATCCCCGA AGCAAGTCGA GTCCGTGGCA GTCTCTAGAA
CGCTGGACGC CGTACATGAA TATCACTGTC TGGAACGACA CCGACTTCGG CTTTGTGGTT
GACCAATATA TTACGGTCAA CGGAACTCGC AAGCAAAAAA TTGATGTACA TCGTGGGAGA
CAAAAATTCT TTTACGGAAA ATGTATAAAG TATCTACAAG GACGAAATCG GACATGGACC
GCATTCCACG ACATCGATGA GTATATCACT GTTGACGAGC GGGTCGTTTT CGATGCCAAA
GAGCGTAGTT CCAAGCCTGG AAGCGTCTTG CAGATGCTTC AAGAGGTGAA GAGCATGAAA
CCTGTTCCTG ACGGTTGGAC CGAGAGCTGT GTCCCCGTCC CACGGTGTCG TTTCTCAGCT
GTGGAAAGCC AACCAGAAGA GGTAAGCCTG GAGGTCCCTC CTCTTATTGA TGCAAAACAG
CTCGAAACTC TACGCTGGAG GTATCGCTCT TTAAAAGGTC GAGATGGTCA GCCAAAATCT
ATTGTTGATA TTTCCGAAGT CACGCTGCAC AGAAATACTA AGTTCGGCCC TCATGCAGTT
ATCCTTGGAA TTTGTCCCCC GCATCTTTTT GATCGCAGTT TTTTAGTGAT AAATCACTAC
TTGGGCGACT GGGATATGTA GTAAGTGTCT GTGTTGGTAC AATAATGATT CAAAGTTTGT
TTGTGGAGCG TGTACTCTTT CTACTCCCGC AACCTTGTAA AATCTTGCGT TCTAACAGTA
CGTCTTGTTT ACCATGAAAG TTCGTTTCGC GATGATTGTC GCATAGGCAG CATGAAAAAC
AGAGAGGCTT GGGAATTCCG ATCCAGCGAG AGTGAAGGCG GTACAACCGA CCAGATCCGA
CCTTGGATTG GTGGGTTTGT AGCAGCCATG GGCGAAGAAC GCGCGTTGCA GTTGTTGAAA
GATGTTGGAC TGCCAAAGAA TTACACGAAC CCTTACAACA AAACGGAATG GAGGATTGAA
CAAAGCACGT TGGATGCCTT GTTGAAAAAG CGCCCGAGAA GGGCTACGAA CTACGTCAAG
TTTCTTGAGC AGCGGATTAG GCAATCCAAT ATAAATCATT CGACTGATTA TAATTAG
 
Protein sequence
MEVPSSDNGL ATFSQFPVTK KNGRHSRNPA TGTPLRVDPD DVSNDATSAH FTAHCHIQSK 
TLKRALVFSR RRTLDIFFIG CLANAFFCII IVFGILRQSG LTATTTTLED ERIGALRMPF
LIKNQTLQKE LYNASMAVDS GPLAVDTVVE SFVISSGRTS DRAFNQIMQV TERFSACILF
MDDNPRLVEW LAYHFFALNL REVVVAVDPR SKSSPWQSLE RWTPYMNITV WNDTDFGFVV
DQYITVNGTR KQKIDVHRGR QKFFYGKCIK YLQGRNRTWT AFHDIDEYIT VDERVVFDAK
ERSSKPGSVL QMLQEVKSMK PVPDGWTESC VPVPRCRFSA VESQPEEVSL EVPPLIDAKQ
LETLRWRYRS LKGRDGQPKS IVDISEVTLH RNTKFGPHAV ILGICPPHLF DRSFLVINHY
LGDWDMYLFV ERVLFLLPQP LRLVYHESSF RDDCRIGSMK NREAWEFRSS ESEGGTTDQI
RPWIGGFVAA MGEERALQLL KDVGLPKNYT NPYNKTEWRI EQSTLDALLK KRPRRATNYV
KFLEQRIRQS NINHSTDYN