Gene PHATRDRAFT_34964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34964 
Symbol 
ID7199949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp767088 
End bp769052 
Gene Length1965 bp 
Protein Length654 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179287 
Protein GI219116985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCA TGGCCCTCGT TGTGGGCGTT CTCGCTCTAT CTCTTCGTAT GGGTGATTCC 
TTTTCGTCAC AATGTACAAA ATTACCACGG AGAACGGCAC TACCGTCTCG GCGTACGAAT
CGTTACGTTT TATGGAGCAG CAGTAGCCGT AGGAGTACAG TTGCACCACC GGATCTCGAT
ACGGGTGCGA ATCCGATTCC GGACGCCGAC GTAATTACAT CCACCGACGC TGTGGTGGTA
GGCGGAGGTC CCACTGGACT CTTGACAGCT ATCATGTTGG CACAAAAGTA CCCCGACCGA
ACCCTCCAAC TCTACGATCG TTTGGCGGAA CCGCCGTCTC CAGACGACGA CACCGTCTGG
AACGATGTGG CCAAGTTTTA TTTGATCGGC TTAGGAGCAC GTGGACAATA CGCCTTGCAA
ACCTTTGGAG TATGGGACGA AGTTGAGCAA CGCTGCGTAG CGGTCGTCGG ACGCAAGGAT
TGGCCGCCAG ACTCGGAAGA AGGTGTGGAA CGGATCTTTA CCAAAGAAGA CAAGAAGGTC
GCAACGCAAG TCTTGCCTCG TGACAAACTA GTCGGTGTCT TGCATCAGCA TATTCGGGAA
AATTATGATG GAAAAATATT CTTAAATTAT GGCTACGAAG TGCATCCGGT AGATTTTGAA
TTTCGTGGCG GTAGTCAAGT CCTACTGCAG ATTGTCCAAT GTTCCGAGAC CGTCGTACGG
CTGAATCCTT CGGCTGTACG AACAGCTATA GATAAACAGG ATGAGATGCT ATGTGATACC
CAAGGTGGCA AGTTTGTGGC GTCAGATTTG GTGATTGCTG CGGATGGTAC GGTTCGTACC
ATTGCCAATG CCATGGAACG TCAAGATCAG CAGCGATTCC GTGCAATGAA TCCACTCCAA
AGGCTAAGGG CTGGCCGACC TTTTCGCGTG AAGCGATACC TTGATGACAA TCAGCGTATA
TATAAAACCA TTCCGATGAA AATCCCCAAG GATTGGCGCC CTGACCTAAA CTACTCGGCT
CGTACGAAAG AGGGACGAAT CAATTACGAT GCTCTACCGG CCAACGAAAA TGGGGAATAC
TGCGGGGTGT TGCTGCTCAA AAAAGGAGAC CCAATGGCGC AGGCGGACAC CAGTCCAACC
GAGCTCCGCC AATTAATGGA CGACGTTCTT CCACAATTTA GTGCTCTTCT GGATGACGAG
GTTGTCGCTG CGGTTGCCCA AAAGCCTGTT TCGTATCTAC CGGGCTTTCG GTACGCCGGT
CCGCGTCTTA ACCAAGGCGA CCGTTGTGTA CTTCTCGGCG ATTGCGCCCA TACTGTCAAG
CCGTACTTTG GACTCGGTGC CAATTCGGCC TTGGAGGATG TTAAGATCAT GAGCGAGATT
CTCGATGCCA CGGAACATGA TATATCGGCG GCGGTCCGTG AGTTTTCACG ACGCCGGGCT
CCCGAATCGG AAAGCCTAGT GCGCATCTCC CGTGACCTCG ATCGTCCCGG AAAGATTGGA
TTCGTCACGT TTATTCTGCC TCTGATCCTG GACTCTATCT TTAGCAAAGC CATGCCGAAA
TTGTTCCAAC CTAATATCAT CACCATGCTG CAAAAAGAAA ACTGGACTTT TCGACAGGTA
GCATCACGAA AACGGCTGGA TCGGCTAGGA CAGCTTTCCA TCATTGCGGC AGGCTTAACA
GGGATGGGTT TCGTTGCGCG AGTGTTAGTT CGTTCGGTGG CAAGAATGAT GGGCAAGAGT
ACGACAAAGG TTGCAATGGG ACTTATCGGA GCCGCCTTTG GAATTGGGCT GCTCCGACGG
TTCGCTGGGC TAGTGGCACC AGGATCAGCA CCAGCCGACG TTGTCACCAA AATGGCTACA
AACAAAAAAT CCAAGGAGCA AAGCGATAGC CCACTCAGCA GTCGACAATC ATTTCTGACA
CCTCGTCTTG GCTTTAGCAA CAAAGGAGAA AGGAAGTCCA AGTAG
 
Protein sequence
MTTMALVVGV LALSLRMGDS FSSQCTKLPR RTALPSRRTN RYVLWSSSSR RSTVAPPDLD 
TGANPIPDAD VITSTDAVVV GGGPTGLLTA IMLAQKYPDR TLQLYDRLAE PPSPDDDTVW
NDVAKFYLIG LGARGQYALQ TFGVWDEVEQ RCVAVVGRKD WPPDSEEGVE RIFTKEDKKV
ATQVLPRDKL VGVLHQHIRE NYDGKIFLNY GYEVHPVDFE FRGGSQVLLQ IVQCSETVVR
LNPSAVRTAI DKQDEMLCDT QGGKFVASDL VIAADGTVRT IANAMERQDQ QRFRAMNPLQ
RLRAGRPFRV KRYLDDNQRI YKTIPMKIPK DWRPDLNYSA RTKEGRINYD ALPANENGEY
CGVLLLKKGD PMAQADTSPT ELRQLMDDVL PQFSALLDDE VVAAVAQKPV SYLPGFRYAG
PRLNQGDRCV LLGDCAHTVK PYFGLGANSA LEDVKIMSEI LDATEHDISA AVREFSRRRA
PESESLVRIS RDLDRPGKIG FVTFILPLIL DSIFSKAMPK LFQPNIITML QKENWTFRQV
ASRKRLDRLG QLSIIAAGLT GMGFVARVLV RSVARMMGKS TTKVAMGLIG AAFGIGLLRR
FAGLVAPGSA PADVVTKMAT NKKSKEQSDS PLSSRQSFLT PRLGFSNKGE RKSK