Gene PHATRDRAFT_45345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45345 
Symbol 
ID7200034 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp909410 
End bp911320 
Gene Length1911 bp 
Protein Length575 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179534 
Protein GI219117479 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.824285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTTTTGTTC CACACACTCC AGTCCATCGT TGAAGGCCCC CACTTGTTGG TTTGGTACCT 
TCGCTCTATA CCCTTTTCAG TTCTGTGCTA TTGTTCACCA TTGCCTCGGT TTTGCAGACT
CTTGGTTACG AGGCGAGGCG AGTTTACGAA CGCGGTGCAT CATGAAGGAT AGTAGATGTA
CGAGGCTGGG CGCGCCGTTA CTCGTACTCT CGTTGTCGTC CTGGTCCGGG GTGGTGAAAG
CCTGGACGAA TCATTTCGTC CACAATCCCC AACGTCACTC CTCGGCACGG AATCCTGGCG
TAGCCTGGCG TGGAAGACCT TCGTATCGCG TCGGACCGTT GCGGGAACTC GTAGTGACGG
ACCAATCGTT CCGACCGACA CGGGGAGTAC CACTCCACGC CACGGGGAGC CGTGATGGCC
GTGCCACCCA AGCCGACATT GAATGGGACG AAGACGAATA CGATTTCGAT CAGCAAAAGA
GTATGGACGA GTATCGGACC CAGTTTCAAG CACTCGCCGC CGAAACGTCC CAAAATCCCC
ACGCCGTACA GCAAGCCCAG GATCTCTTTG ACGAGCTCTA CAAGGCGTAC ATCATGACGG
AAGACGCCTC CTACTGGCCC GGGACCGATA TTTACAATCT TCTGCTCGAA ACACACGCGT
ATTCTCCGCA CAAGAATGGT GCAGTGGAGG CAGAAGCCAT TGTTGCCCGT ATGGAACAAC
AGGAACACGG GGTGGCCAGG CCCAACGTCG CAACCTACGC CAAACTTATG GAGGCGTGGA
CGCAACGCAA ACGCTTGGAT AAGGTCACGG CCGTATGGGA ACGCATAGCC GAACAGGGAT
TGCAACCCAA CATTTCTACT TACAACAAAC TCATCAAAGC CTACGGGGTT GCGGGCAAGG
CGGAACAGGC TTTGCAGGTA CTGGAAGATT TGTTGCTGCA ACACCAAGGG AAAGACAACA
ACAAAGAAGA GGACGACCAG ACGGATGCGG AGACCAGCAC AAAACCCACG CAGAAAACCT
GGGTACAAGT GTTACGGGCC TTTGCCAGCA AAAAATACGT CCGGAATGGA GAGGGCGTGG
ACCAAATACA AGCCTTGCTG CGCCGCATGG CACAGGCCTA CCGACAGGGC GAAGCAGACT
GGAAACCAGG TGTGGATGCC TACAACTCCT TGCTCAAAGC CATGAGCTTT CAAAAGGGAT
CCGGTAAAGA GTCGGAAAGC GTTCTGTACG GCATGCTGGA ACAGTTCCGG GAAGGAGAAG
AAGCGTTGCG GCCGAACGCG GGTAGTTTTT ATCACGTGTT GCACGCGTAC CGGGGTGACA
AGGATGCGGG TGTTTCCTTC AAGGTGGAAA AGTTGATACA GTTACAGGAA GCTTTGGCAG
TCGAGCGCAA CGACCCAAAC GATCCAGCTC GAACGACAAC TCGCGTCTAC AACGCCGCCA
TGGCGGCTCT ATCGCGGACC AAAGATCCGC AAAAGGCCGT CCGCGCCAAG AGGTTTATGG
ACCGTATGAA TCTGCAGCAT AACGATCCAG ACATGCGTCC GAATGAAGCC ACATACACGA
GTTTGCTGAA CGCGTGTGCG TACACAACCG AAGGTGAACC AGCCGACAAG CTGGCCGCGT
TTCAAATTTC CGTGGACGCG TTGAAAGAAA TCCGCCAATC CCCATCCATT TCGACAAATT
CAAAAATGTT TGGGTTGTTT TTGAGAGGAT GCGCCAATCT CATGCCGCAT AGTCGAAAAC
GCGATGCGGT GGTAGAAAGT GTATTCGCAT CTTGCTGTGA CGAGGGCTAC ATTTCAGACT
ACGTCCTTGA ACAGTTTGAA AGGGCGGCCT CGGAACAATT GCAGCTCAAA GTGTTGGGCG
GCTTTCTTGT AGATGGCGTC GAAACTCCAG CCGCATGGAG ACAAAATGTA G
 
Protein sequence
MKDSRCTRLG APLLVLSLSS WSGVVKAWTN HFVHNPQRHS SARNPGVAWR GRPSYRVGPL 
RELVVTDQSF RPTRGVPLHA TGSRDGRATQ ADIEWDEDEY DFDQQKSMDE YRTQFQALAA
ETSQNPHAVQ QAQDLFDELY KAYIMTEDAS YWPGTDIYNL LLETHAYSPH KNGAVEAEAI
VARMEQQEHG VARPNVATYA KLMEAWTQRK RLDKVTAVWE RIAEQGLQPN ISTYNKLIKA
YGVAGKAEQA LQVLEDLLLQ HQGKDNNKEE DDQTDAETST KPTQKTWVQV LRAFASKKYV
RNGEGVDQIQ ALLRRMAQAY RQGEADWKPG VDAYNSLLKA MSFQKGSGKE SESVLYGMLE
QFREGEEALR PNAGSFYHVL HAYRGDKDAG VSFKVEKLIQ LQEALAVERN DPNDPARTTT
RVYNAAMAAL SRTKDPQKAV RAKRFMDRMN LQHNDPDMRP NEATYTSLLN ACAYTTEGEP
ADKLAAFQIS VDALKEIRQS PSISTNSKMF GLFLRGCANL MPHSRKRDAV VESVFASCCD
EGYISDYVLE QFERAASEQL QLKMASKLQP HGDKM