Gene PHATRDRAFT_29633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_29633 
Symbol 
ID7194765 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp61680 
End bp65055 
Gene Length3376 bp 
Protein Length949 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183150 
Protein GI219125779 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCGATCGACC GAATCGTTGT TGCTTCCTTT CGTACATCTT CAAGGACTCG CTGTAACAGC 
CAATCCTTGT CCATTGTATT TAACAACACC CAGTCTTTCG GTTGTCACTC TCGATTGTGT
AACCATTCAG ACAAGGCAGG CACTCTACGG CCAACATGGT AGAGCACGCA GCACCTTCCG
CCTTGACCGA CCAGAAAGTC GCCATTGTCG GTGGCGGAGT AGCGGGATTG TCCGCTGCTT
GGCACTTGTC CGTCAACACC GGTGCACACG TCCAACTCTT CGAAGCGGAA TCCCGTCTCG
GCGGGCACGC CTACACAACC AACGTGGACG GGGTCGACGT GGATATTGGC TTTATGGTGT
ACAACGAAAC CAATTATCCC AACATGGTGG AGTGGTTCCG GACGTTGGGT GTGACACAGG
AAGACTCGGA TATGAGTCTT TCCGTCAGCC TCGACGGGGG TGACACGGTC GAATGGAGTT
CCGACGGACT CGACGGTCTC TTTGCGAATC GCCGACAGCT CGTTAGCCCC CCGTTCTACC
GTTTCCTCAA GGACATGATC CGTTTCAATC AGCAAGCCGC CAATATTCTG CTCCTCACCG
ACGACGATCC CCGCAAACAC GTCACCACGG CACAGTATCT CCGCGAACAC GGATATTCAA
CAGAATTTGC CAAATTCTAC GTATTTCCCA TGATGGCGGC GCTATGGAGT GCCAGTATGG
AGGACGTCTT GCGATTCCCC GCCGCCCAGC TCATTGGATT CTTGTGCAAT CACAAAATGC
TCCAACTGTT CGATCGACCG CAGTGGAAGA CGGTCGCCGG AAGGTCGCAG CAGTACACCA
ACCTGGTACA GAGTATTCTC GGCTTCGAAG CGGTCCATTT GGACACGCCC GTTCACAAGG
TGGAAAAACT AGAGGACCAA ACCTACCGTC TCGTTACTTT GCGGAAAGAG GACGGCGAGC
ACGGATCAGC CACCGAAGTC TCCCTCGGCG TGTTTGATCA AGTAGTCTTT GCCTGTCATC
CACCCACGGC CCACGACATT CTGCAACGCA GCACTAGTGT GAGCAACAAT CCCACCAATA
AGGAGCACCA AGATCACCAG CTTCTCCTCC AACTTTTGGC ACAGATTGAA TACGCCGACA
ACGTTGTGTA CGTCCATTCT GATCCGTCAC TCATGCCGAA ACGTCGTCAC GCATGGGCGT
CCTGGAATTG TCTCGGACGC AGTCAACACA TGCTACCCTT CCATTCCACC AGTAGCAAAA
AAGGCGAAGC CTTTGAAGGT GCCGAATCGG GATTTGGTAA CGTTGCACAC TCTGCACTAC
CCGAAGACCA AGATTCATTC CACGAAAAAA CAGCAGTACC AGCACAGCCT ACGCTGGAAG
GCATCCACGG TCGCATGAAG GCAGTCTTTG TGACCTACTG GTTGAATCGG CTGCAAAATC
TTGAAACGGA CCGCGACATT TTCGTATCAC TCAATCCCCA CCACGCGCCG GAACCAGCCT
TGACGCACCA GCGGGTCATT CTCGCACACC CTCAGTTCAA CAGTAAGACG CTCCAAGCCA
GAGAAAAACT CGGCGCTCTC CAGGGAAAGG ACGGTCTCTG GTTTTGTGGT GCCTGGTCCG
GGTACGGGTT TCACGAAGAC GGATGTCGCG ATGGATTTCG AGTCGCCACG GCCATGTCCG
CTGTTCCCTT ACCGTGGGTG ACGGAGGCGC GGGCTACTGA CGAGACTCCT AGTACACCAG
ATGCACTCAT CTTGCCACCA CCGGATCTCT CGAGCAATCA CACCCGAATG ACCTTGTGGC
AAGCCCTGTA CCAACGTGTC ACGTACGATT TGCCCGTCGC AATCTGCCGA CAACTTTCCT
TTTATTTCAT GAAGCAGGCC GTCCAAATGG GTCGCTTGCG TCTGAAGTTT AACGATGGAT
CCGTGGTTAG CTTTGGCGAC GGCACACCGT GTGGCTGTGA CACTAGTGAC GTGACAATTC
GCGTATTTGA TCCATGGTTC TTTGTCAAAC TGGCGACCGA ATACGACCTG GGTCTCGCAC
GATCGTACAT GGCTGGGCAT TTCATTGTGG AGCCTCTCGA AAAGACGTCA TCCTACCATC
CTGTCATTCG CCCGGAACAT GCGTCCGAGG AAGCAACCAT CACTCTGGGT GATCCGATTG
GCTTGACCCG TCTCTTTTTG CTGCTGATCG GGAACCGTGA TGACAATGCT GCAAAAGCAC
ACATACCTCG TCGAGCGGGT CGGGGGCACA AGTACGCCAA CGCGTTGTCC AACGCATCGG
GATTGGTATT GGCTCAGATG GGTTCCTTCG TCAATTACCT TCGCTACAAA TTGACAATGG
ACAATTCCGA ACGAGGGGGT AGCCTAAAAA ACATCCACGC GCACTACGAT CTTTCCAACG
ATCTATTCAA GACTTTCTTG GACAAGGAGA CACTTATGTA TTCGTCGGCG ATTTACGATG
CCGTGCCTGC TCCACGCCCG CACTCTGGAC TCGTCTTTCG CGGGTCCCTC GAAGAAGCAC
AGTGGCGTAA GTTGGATACA CTGTTGGATC GTGCCCAGAT TCAGCCCGGA CAAACGGTCC
TGGACATTGG CTTTGGTTGG GGCGGATTGT CTATTCATGC CGCCAAAAAG TACGGATGCA
AAGTGACTGG TATAACGCTT TCAGTGGAGC AAAAAGCACT TGCCGAAAAG CGCGTCAAAG
AAGAAGGTAT TGAGTCCCTC ATTACGTTCG AAGTCGTGGA TTATCGGACC TTTTGCGCGC
GCAAGAGCAA CTGCGGTATG TTTGACCGTG TGCTGAGCTG CGAAATGATT GAAGCTGTTG
GACACGGTCA TTTGGTAGAA TTCTTCTGGG CCGTCGAACA GGTCCTGTGT CGTGACGGCG
TCCTTGTTAT GGAGGCTATT ACGACACCAG AAGAACGATA TGAGAACTAT TTACGTTCGA
CCGACTTTAT CAACACCATA ATCTTTCCTG GCTCCTGCTG CCCTTCCTTA CACGCGCTCG
TAGACGCCGC CTACCGAGGA TCGACGTTGA CGCTAGAGCA CGTCGATAAC ATTGGACTGC
ACTACGCCCA AACTTTGGCT GAGTGGCGTC GTCGTTTCAA CGCCGAAGAA CCCTTTGTGC
GCCAGCTTGG TTTTGACGAT GTCTTTTTAC GGGCCTGGAA CTATTACCTG ACATACTGCG
AAGCGGGTTT CTTTTCACAA ACGGAGAATT GCTTGATTTT GGTCTTTGCC CGACCAGGAT
GCAAGGCATT GACGGCTTTG TGCGAGACGC GGTCAGTGGT GCAAGCATCA CCTTTTAGCG
ACAAGGAGAT CGAGACTTTT GTGGCCGAAT GCAAATAGGC AATTGGCAAG TAAAATGATC
GAACATATAG CTTCGA
 
Protein sequence
MVEHAAPSAL TDQKVAIVGG GVAGLSAAWH LSVNTGAHVQ LFEAESRLGG HAYTTNVDGV 
DVDIGFMVYN ETNYPNMVEW FRTLGVTQED SDMSLSVSLD GGDTVEWSSD GLDGLFANRR
QLVSPPFYRF LKDMIRFNQQ AANILLLTDD DPRKHVTTAQ YLREHGYSTE FAKFYVFPMM
AALWSASMED VLRFPAAQLI GFLCNHKMLQ LFDRPQWKTV AGRSQQYTNL VQSILGFEAV
HLDTPVHKVE KLEDQTYRLV TLRKEDGEHG SATEVSLGVF DQVVFACHPP TAHDILQRST
SVSNNPTNKE HQDHQLLLQL LAQIEYADNV VYVHSDPSLM PKRRHAWASW NCIHGRMKAV
FVTYWLNRLQ NLETDRDIFV SLNPHHAPEP ALTHQRVILA HPQFNSKTLQ AREKLGALQG
KDGLWFCGAW SGYGFHEDGC RDGFRVATAI NHTRMTLWQA LYQRVTYDLP VAICRQLSFY
FMKQAVQMGR LRLKFNDGSV VSFGDGTPCG CDTSDVTIRV FDPWFFVKLA TEYDLGLARS
YMAGHFIEAT ITLGDPIGLT RLFLLLIGNR DDNAAKAHIP RRAGRGHKYA NALSNASGLV
LAQMGSFVNY LRYKLTMDNS ERGGSLKNIH AHYDLSNDLF KTFLDKETLM YSSAIYDAVP
APRPHSGLVF RGSLEEAQWR KLDTLLDRAQ IQPGQTVLDI GFGWGGLSIH AAKKYGCKVT
GITLSVEQKA LAEKRVKEEG IESLITFEVV DYRTFCARKS NCGMFDRVLS CEMIEAVGHG
HLVEFFWAVE QVLCRDGVLV MEAITTPEER YENYLRSTDF INTIIFPGSC CPSLHALVDA
AYRGSTLTLE HVDNIGLHYA QTLAEWRRRF NAEEPFVRQL GFDDVFLRAW NYYLTYCEAG
FFSQTENCLI LVFARPGCKA LTALCETRSV VQASPFSDKE IETFVAECK