Gene PHATRDRAFT_39076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39076 
Symbol 
ID7194801 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp267566 
End bp271163 
Gene Length3598 bp 
Protein Length1175 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183062 
Protein GI219125596 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.455166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCCTC TCGAGCATGT CCTTGTGAAC CTTTTGGGAG CGACAACGCT GGATTCGTCG 
TACCGTCGGT TCTTTGAAGA GTATGGCATT ACTCAGGCCA GTGAATTGGC CTCAATCACT
GAACATCGTC TTGCAACGGT GTCTTACGGC GTCTTGACCC CTGCTGTGGG AGACGGCCCT
GCTGCAATTG TTCGTACATT CCTTCCGCCT GCGCAACAGG ACCGGATTTT GAAGATTGTA
CAATGGTTCC TTTCGAAAGG CACCAATGTG ACAAACGACA CCTGGCTTGA ACTTACCTCT
GATGTTCTCG AGTATTGGCA ACCCGCCTCT GCTACTGTTG CCCCAGCTAC TCCTGTTGGA
TCGGATGCTC GAAGTTCCTT TGTTGAAAGT GCTGCCGCGA AATTTCGGAA GACGATCAAA
AATCATTCCG TCCCGTATCC AAAGTTCAGT GAAGACCGTT TTTGGGTCAC TTGGAATACG
AATATTCGTA TCAAGCTCCG TATTCATGGT GTTCAGTTGG TACTTGACCC GGATTATTTG
CCTGAGACCG TCGACGAGAC GGATACGTTT GTTGAAATGC AGAACTTTGT CTTTGGCGTT
TTCAACGATA TTTTGTTGAC CCCTCGTGCG CGTGGAATCC TCCACAAGCA TGTGGATGAG
CTGGATGCTC AGTCGGTCTA CCGCGACCTT GTTGCCTCGT ACGGCAAAGG TATTAACGCG
CAGATCACGG CCACATCCAT TGAAACGAAA CTTACTTTGT ATTCATTTGC GACTTCAAAG
AGCAAGACCT GTGTTGCTTT TTTGACGACC TGGCGCAATT TGATTTACGA TCTTGAACGG
ATCAACGAGT TCCCTTTGCC GGATCACCAG AAGAGCGTAC GACTGAAGTC AGCTGTCCGT
TCCCATCCGC AATTGAAACT TTTCCTCGGA AATGTTCAAC TTTACTCTCG TACCCATGTG
GGTAAGAGTG CTGATGATTC CGATTTCGAG TATGTTTATG ATTTGATGCT TGAACATGCA
ACTGATATTG ATCAGACCGA TTTGGAAGAC CGCGGTAACA ACCGTGGTGG ACGCTCAGCA
AACAATGCGA AGTTCCAGTC TTCTTCCAAG AAGAAAACTA ACAAACCGAT TGGTAAGAAG
CACAAGAATT ATGTGCCTCC TGAGAAGTGG AATGCTCTCT CTCCCGAAGA GAAGCGGACC
ATTATGGATC AACGAGGACC TCGCCCTGCT CCAGCCCCTG CCCCTGCCTT ATCGGTGAAC
GCCGCTGCCA CTCAGCCCCC TCCTACGGTG TATGTCAGTG ACTCGACGGC TGTTGACAAC
CAAAGCCTTG CTTCGACCCA CGTCCCACCT GCTGCTGGAC CTGGTCACCT GCTTCGTTCG
CTCATTTCGA ATTCAGCTGC CCGCCAGCAC TCTGCCCCAT CGAATGGAGC CACGTCTGAC
TCTTTTTCGG TCAATGGGAC CACCTATCGC CGCGAAGTGA ACCGTGCTTC TGTGCAGTAC
CGTCTTTCCA CTCACGATGT TTCGTTGAAT AAGGACTCTT TGGTCGATGG TGGTGCCAAC
GGTGGCCTTA GCGGCTCAGA CGTAACCGTT ATTTCGCAAT CCCTGTCAGA GGCAACTGTC
TCTGGAATTG GAAATTCGGA ATTGACCAAC CTCCGTTTGT CAACAGTGGC CGGACTCATT
CACACGACGG ATGGTCCCAT TATTGGTGTG TTTCACCAGT ATGCTCATCT TGGTACTGGT
AATACCATCC ACTCGTGCAA CCAAATGCGC TCCTGGGGAG TCACGGTTGA CGACGTCCCT
CGTACTTTTG GTGGCAAACA GCGTATTGTC ACGTCCGATG GTCGTTTTGT CATCCCGCTT
TCGGTTTCTG GCGGACTCAC TTACTTGTCT ATGCAGGCCC CTACCGAGGA GGACCTGGAC
ACTTTCGAAT GGGTGCCTTT TACCGCTGAC AACGAGTGGG ATCCAAATAG TCTCTCTTCT
CCTGCCGCTG CCGACGATGA CCTCAGTTTG CAGCTTCCTG TCGGCCATGT TCCGTTCCGT
GACGAACGCA TCAACAACTT TGGTCTCCTT GCGCATTCCG CGGCAGTCAG TCGATCCCCT
TTGAATGTCG ATGCTTTGCA ACCCAATTTT GGATGGGTTC CCAGTGCTCG TATCGCTCGT
ACGTTTGAAA ATACCACGCA ATTTGCTCGT GCCGATGCCC GTTTGCCCTT GCGCAAACAC
TTCAAATCGC GTTTCCCTGC TGCCAATGTC TCTCGTCTGA ACGAAATTGT GGCAACCGAT
ACTTTTTTCT CGGATACCCC TGCGGCCGAT GACGGCATTT TTAACCATGG TGGGGCTACG
ATGGCCCAAC TTTTCGTTGG AAAAAGTTCG CAAATCACCT CTGTCTTCCC GATGAAGCGC
GAGTCTCAGT TTGCCCATAC TTTCGAGGAT TTTATCCGTA CCCATGGTGC TCCCGATGCC
CTCCTCAGCG ACAATGCCCG TGCTCAGATC GGTAAGCAGG CACTTCAGAT CTTGCGCATG
TATGCGATCG ACGACATGCA GTGCGAGCCG CATCATCAGC ACCAAAATTA CGCGGAACGC
CGCATTCAAG AGGTGAAAAA GATGGTGAAC ACAATCATGG ATCGTACAAA CACTCCTCCT
GAATATTGGT TGCTCTGCTT ATTTTATGTG ACCTACTTGC TCAATCGCCT CTCTGTCGAA
AGCTTGAATT GGCGTACCCC GCTTCAGGTT GCCCATGGAC AGCGTCCCGA TATTTCTGCT
TTGCTCCTTT TTCGTTGGTT TGAGCCCGTT TATTATTACG ACCCTGACCA TGCGTCTTTC
CCATCGCATT CTCGCGAGAA AACTGGTCGT TGGATTGGTG TCGCCGAACA TAAAGGTGAT
GCGCTGACTT ATTGGATTTT GACAGACAAT ACTCACCAGG CCATTGCTCG TTCTGTTGTT
CGTCCAGCCA ATGTCGATAA TGGTTTGAAA AACCATCGTG CTGCGGATTC CTCTCCCGAT
GGTGGGGAGC CCTCGAATCC TAAGCCCATT GTCTTGGCTA CGAGTGACCT ACGCCATGAC
GCTACGATTG ATCCATCTTT TGAGAAATCC CATGCATTCT CTCCTGACGA ATTGATCGGC
AGATATTTGA TTCGTGAAGC CCCTGACGGC CAGAGCCATC GAGCCCTTGT TGCTCGTAAA
ATTATTGATG CCGACTCCGA TAACCACCAG GCAATCCGCT TCTTGTTGCA AATTGATGAA
AAGGATGCTG ACGAGATCAT TTCGTACAAT GAACTCTCCG ATTTGATGGA AGCCCAACAA
TCAGAGCCCG CTACGAACGG AAATATCGAA GATCATTTCA AGTTTACTAG TATTATTGGA
CACCAAGGCC CTTTGCAACC GACCGATGCG GGCTACAAGG GATCCTCTTG GAATGTTTTG
GTTCAATGGG AAGATGGTTC CCAGTCGTAC GAACCTCTAA TTGAAATGGC AAAGGACGAT
CCAGTCACAC TCGCGATGTA CGCGTCTGAC AACGATCTTC TTAACGTGCC CGGGTGGCGC
CGCTTCAATC GTCTGCTTCG CAACCGTGAT GACTTCAATC GATCTGTTTC GTTAGTGA
 
Protein sequence
MDPLEHVLVN LLGATTLDSS YRRFFEEYGI TQASELASIT EHRLATVSYG VLTPAVGDGP 
AAIVRTFLPP AQQDRILKIV QWFLSKGTNV TNDTWLELTS DVLEYWQPAS ATVAPATPVG
SDARSSFVES AAAKFRKTIK NHSVPYPKFS EDRFWVTWNT NIRIKLRIHG VQLVLDPDYL
PETVDETDTF VEMQNFVFGV FNDILLTPRA RGILHKHVDE LDAQSVYRDL VASYGKGINA
QITATSIETK LTLYSFATSK SKTCVAFLTT WRNLIYDLER INEFPLPDHQ KSVRLKSAVR
SHPQLKLFLG NVQLYSRTHV GKSADDSDFE YVYDLMLEHA TDIDQTDLED RGNNRGGRSA
NNAKFQSSSK KKTNKPIGKK HKNYVPPEKW NALSPEEKRT IMDQRGPRPA PAPAPALSVN
AAATQPPPTV YVSDSTAVDN QSLASTHVPP AAGPGHLLRS LISNSAARQH SAPSNGATSD
SFSVNGTTYR REVNRASVQY RLSTHDVSLN KDSLVDGGAN GGLSGSDVTV ISQSLSEATV
SGIGNSELTN LRLSTVAGLI HTTDGPIIGV FHQYAHLGTG NTIHSCNQMR SWGVTVDDVP
RTFGGKQRIV TSDGRFVIPL SVSGGLTYLS MQAPTEEDLD TFEWVPFTAD NEWDPNSLSS
PAAADDDLSL QLPVGHVPFR DERINNFGLL AHSAAVSRSP LNVDALQPNF GWVPSARIAR
TFENTTQFAR ADARLPLRKH FKSRFPAANV SRLNEIVATD TFFSDTPAAD DGIFNHGGAT
MAQLFVGKSS QITSVFPMKR ESQFAHTFED FIRTHGAPDA LLSDNARAQI GKQALQILRM
YAIDDMQCEP HHQHQNYAER RIQEVKKMVN TIMDRTNTPP EYWLLCLFYV TYLLNRLSVE
SLNWRTPLQV AHGQRPDISA LLLFRWFEPV YYYDPDHASF PSHSREKTGR WIGVAEHKGD
ALTYWILTDN THQAIARSVV RPANVDNGLK NHRAADSSPD GGEPSNPKPI VLATSDLRHD
ATIDPSFEKS HAFSPDELIG RYLIREAPDG QSHRALVARK IIDADSDNHQ AIRFLLQIDE
KDADEIISYN ELSDLMEAQQ SEPATNGNIE DHFKFTSIIG HQGPLQPTDA GYKGSSWNVL
VQWEDGSQSY EPLIEMAKDD PVTLAMYASD NDLLN