Gene PHATRDRAFT_30334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_30334 
Symbol 
ID7195792 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp189069 
End bp191513 
Gene Length2445 bp 
Protein Length645 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184078 
Protein GI219127721 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGACTCACGA GTGTTGGCAG GTATTCTAGC AGACCAGCAC AATCCCCATG ACATTGGGCT 
GTAGTCCGTC GTTTCTTCGT CTCTTTGTAC ACTTTCAGCT ATCAGAGTGA TTCCAGTAGT
TTCCGACGTA GCGGTCATTT GGTTTTGGGG AAAAGAGCAC CAACAGTCGC AGGCACGTTT
GAATCTGTTG CTTGCCTTGC AGTACTGTAC CGACTCAACC TTCCGCACCG CCATGCCCAT
TCCGATGGAT GTGGACCCCG TCGACGATGA CCCTCGTCCC TCTTCATCTC CTTTGTCGTC
CCCTCTTCTG TTGCGGACAG TGGTTCCTAT CACCCGGGAA GAAGAGGCTC GACAAGCGAT
TGAACTACTG CGAGGAAACG ACATGGCCGA ACGGGTGGCG GCAGCGCACC GCTTGGATGC
CGTTGCAGCT ATTTTAGGCG AGGAGCGCAC ACGAAACGTA AGACAACAAC TACCAACATT
TATACTGAGC GCGTGGATGA GACAATGGAA TCGACGTTAG CTAGCGTGTG AATCGTAATC
AAGGTATCTC ACACTGGCCT TTGTGTTTCT CTAGGAATTG TTACCGTTTC TGAGTGATGC
TGTGGATGAT GAAGATGAAG TGCTGTTAGC AATGGCACAA GCACTGGGTC GCATGATAGA
TCTTGTTGGC GGGCCAACGC ACGCCGAAAG CTTGTTTCTA CCTCTGGAAC TCCTTCTGAG
CGTCGAAGAA ACGACGGTGC GCCAGGCCGC CGCCGAAAGT ACACTCTTTA TTGCGTCGCA
ACTTTCCGAG TCCGACTACC AATCCTGTTA CGCAAAAATG ATCGCACGTT TGGCTACGCA
AGAATGGTTC ACCGCCCGTA TTTCTGCCGT GGGCCTCTTG TCACAGGCCT ACACCAGTCT
GCGTGTCGAA CAACAGAAAG AACATTTGGA ATTTTATGCG AATCTCGCAA AGGACGATAC
TCCCATGGTC CGTCGAGTAG CGGCACAATA CTTGGGACAA ATGGTACAAA ATGTGGTCCG
AGCTACAGGA AGGAGCTCTT TGTCGGAGAC CGGAAGTGTC GTAACAACTC TCTTGCCCAT
ATTTGAAGAG TTCTCTTCGA ACGAACAGCC CGATTCCGTT CGACTGCAAA CCACCGAGAA
CTGTGTGGCG TTCGGACAGG TACTGAGTGA GGTGGACGGT GATTTAAACG AATCCGAGCA
AGGATTACTG AAAAAAGTAT TGCCTTTGAT ACTTGCAACG ATTGACGATC GGTCCTGGCG
AGTCCGATGG ACGACGGCTG CCAAATTCTC AGAAGTAATT TCCGCCTATG GTCGACTCCC
CGATGTCATG GATTCCCTGG TACAAGGCTA TGAAAAGCTC TTGCAAGATC CAGAGGCTGA
AGTGCGAACA GCTGCTACTT TCAACTTAGC TCACGTCGCA AAGGGCGATG CTCAAGTCCT
CGTTCCGCCT CAACACCGTC CTCCACAAGA CGGGGATTCC GAAGGAGATG CTACGGATCT
TCGTGTTCCT GTAGCAGAGC GCCTGGTAAA GCGAGTAACA AGTCTAACGG AAGACGAAAG
CGAACATGTC CGCGCAGCGC TTGCCATGGT GGCGACTGAG CTAGCTCCGA TTTTGGGTCG
CGAAGGTACC ATTACATTTC TTGTTCCCCC TGTTCTCTTA TTGCTTCGTG ACGCAGCTTC
GGAGGTTCGA TTGAATCTTA TTTCCTCGCT GTCAGCTTTA AATGAAGTGA TCGGTGTCGA
TCTACTATCG CAGTCACTAC TACCAGCCAT TCTCGATTTA GCCCAAGATG GCAAGTGGAG
AATCCGTATG GCAATTATCC AACTTATACC TCTGCTCGCC AAGCAGCTAG GACAAGAATT
TGTGAGCGAG AAGCTTGCTT CTCTCTGCGT TGAATGGCTC GGTGATGATA TTGCAACTAT
TCGGCAAGCG GCAGCTAGCA ATATCAAGGA CTTGACTGCC TTGTTTGGCA CAACTTGGGC
AACTGAGTTT CTTCTTCCAT CGATCGTGGA CATTCGAGAA AACGAATCTT ACCTTAGACG
ATTAACGGCG CTTCAGGCAT GTTCGATGAT GGCAACCGAA ATGGATTCTG ATTCGGCACG
GATAGAAATA TTGCCTCTCA TTCTTGAGAT GGCAACAGAC CTTGTAAGTT TGAATGATGT
CATGCATGCT TTTGTCGTGT TCGAGCTATC CGCTCATCAC CTTCAAACTC GTTCCTTTCA
GGTACCAAAC ATCAGATTTA ATGTGGCAAA GGAACTGCAA TCCATGGCTC ACGCGTGTGG
CATCTCGGCT TACGAATCAC AGGTTTTGCC TGTATTGAAT ATGCTGCTCG AAGATGAAGA
TAGGGATGTT CGATTTTATG CCGAAAAGGC TGCCACTGCT TTGGATGAGA CATTTGCCGC
TATGGATGCA TTAATCAAAT AGCAAAGCTG CTCGCAATTT CACTG
 
Protein sequence
MPIPMDVDPV DDDPRPSSSP LSSPLLLRTV VPITREEEAR QAIELLRGND MAERVAAAHR 
LDAVAAILGE ERTRNELLPF LSDAVDDEDE VLLAMAQALG RMIDLVGGPT HAESLFLPLE
LLLSVEETTV RQAAAESTLF IASQLSESDY QSCYAKMIAR LATQEWFTAR ISAVGLLSQA
YTSLRVEQQK EHLEFYANLA KDDTPMVRRV AAQYLGQMVQ NVVRATGRSS LSETGSVVTT
LLPIFEEFSS NEQPDSVRLQ TTENCVAFGQ VLSEVDGDLN ESEQGLLKKV LPLILATIDD
RSWRVRWTTA AKFSEVISAY GRLPDVMDSL VQGYEKLLQD PEAEVRTAAT FNLAHVAKGD
AQVLVPPQHP ERLVKRVTSL TEDESEHVRA ALAMVATELA PILGREGTIT FLVPPVLLLL
RDAASEVRLN LISSLSALNE VIGVDLLSQS LLPAILDLAQ DGKWRIRMAI IQLIPLLAKQ
LGQEFVSEKL ASLCVEWLGD DIATIRQAAA SNIKDLTALF GTTWATEFLL PSIVDIRENE
SYLRRLTALQ ACSMMATEMD SDSARIEILP LILEMATDLV PNIRFNVAKE LQSMAHACGI
SAYESQVLPV LNMLLEDEDR DVRFYAEKAA TALDETFAAM DALIK