Gene PHATRDRAFT_45386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45386 
Symbol 
ID7200006 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp1025384 
End bp1026782 
Gene Length1399 bp 
Protein Length454 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179561 
Protein GI219117533 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.917332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATAACCATGA CAACCAGAAA TGATCCCCGG AGAGAAGTGG GTAGTTGGGT CGAAACTAAG 
GCCACTGCTG TCACCTGCGA AGCCGAATGT CGTCGTAGAT ATGGTGCCTT GTGGAATTCA
AAAATGGTGC AAGGAGTGAT TTCAGAAGTG CTGGTGACTC CAGGCTTTCG AACCAATCGG
TCGACAACCA ACATAAAGGC ACAATATTAT CTCGGGGGAG GAACTTTCAG GGTGAAAACA
TTGAATATTC GAAGAGTGAA ATTATTTGCG CCTTCTGCGC TTAATATCAC CAATCGCAAT
TGTGAAATAT CAAACCTTGA CCAAGCATCG TTATCTTTGC TTCCGCGAGA GGAAGCACAC
TCTGATTATC CCAAGGAGAC ACAAACTCCC ATTCCGTACA CTCCTATACC CCCCATACCT
ATGGATATCC ATCCAGGATT AGCTCAAGTG ACGGGTAACA ACAATGGAGA TGAAAATACT
GGGTTGGACT CTGAAGATCA ACCCACGGAC CGAGATGTAG CGCCCGATGC GGATGCCCAC
GGAACATTCT GGTATAACGA CAGCAATGCA ACTAAATGTC AAATGAATGG AGAGGTCTCC
TTCCGGCCGT GGGGTGTTAA AAATACTGTT GGTGAAATTT TTGGTCAAGG TACTGATTCA
CGAAGATCCG TTTCTTGTCT GGATTACTTT CTAATGATGT TTCCAACTAC AACCCTCAAT
ACAATGTCAG ATGAGACAAG TAAAGTTCTT TCTTCTATGG GTCAAAAAGA AATATCAAAT
GGAGAGATGT TAAAGTTCTT TGGTGTATTA ATCCTTGCCA CACGCTTCGA GTTTTCGGCA
AGGGCAAGCC TTTGGTCGAC AAGGAGCACC TCAAAGTATG TGCCGGCTCC AGCTTTTGGT
AGGACTGGGA TGTCTCGGGA AAGATTTGAC AAAGTGTGGC AGTGCCTTCG CTGGACCAAA
CGAGCTGATC ATTTGTCTGC TGAAATGGGG AATGAAAGCG TTCGGTGGAC TATGGTAGAT
GGCTTTGTTC AGCAATTTAA TGCTCACCGT GAAAATCGGT TCAGACCATC TGATCTACTT
TGCGTCGACG AATCAATATC ACGGTGGTAT GGACAGGGTG GCCATTGGAT AAATCATGGA
CTCCCCATGT ATGTTGCAAT TGATAGAAAG CCGGAGAACG GGTGTGAGAT TCATAACACT
GCTTGCGGTC GCAGCAGCAT CATGCTTCGG CTAAGGTTGG TAAAGACAGC AGCAGAAGAA
GCACTTAATG GAGAGCGTCA CAGAGAAACT CTTCATGGTA CAATGATTCT CAAGTATCTT
GTACAGCCAT GGACAATGTC TGACCGCATT GTATGTGCAG ATAGCTACTT TGCTTCAGTG
GTTGCAGCAG AGGAATTGA
 
Protein sequence
MTTRNDPRRE VGSWVETKAT AVTCEAECRR RYGALWNSKM VQGVISEVLV TPGFRTNRST 
TNIKAQYYLG GGTFRVKTLN IRRVKLFAPS ALNITNRNCE ISNLDQASLS LLPREEAHSD
YPKETQTPIP YTPIPPIPMD IHPGLAQVTG NNNGDENTGL DSEDQPTDRD VAPDADAHGT
FWYNDSNATK CQMNGEVSFR PWGVKNTVGE IFGQGTDSRR SVSCLDYFLM MFPTTTLNTM
SDETSKVLSS MGQKEISNGE MLKFFGVLIL ATRFEFSARA SLWSTRSTSK YVPAPAFGRT
GMSRERFDKV WQCLRWTKRA DHLSAEMGNE SVRWTMVDGF VQQFNAHREN RFRPSDLLCV
DESISRWYGQ GGHWINHGLP MYVAIDRKPE NGCEIHNTAC GRSSIMLRLR LVKTAAEEAL
NGERHRETLH GTMILKYLVQ PWTMSDRIWL QQRN