Gene PHATRDRAFT_46551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46551 
Symbol 
ID7201694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp683503 
End bp684866 
Gene Length1364 bp 
Protein Length414 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180882 
Protein GI219120280 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACCGT TGCGCCGTCA ATCGAAGCGA TCAAACTCAA AACTTCATCC GCACGGTTTC 
GTGGTTTTCC TAGTCGCTCT AGGAGGATGT TTGTTCGCAT TACACCAAAG CGTTTGGAGA
CACGTTCGTG TTGAAAGAGT TGTTGCCCTT GGATCGCATT CTTTAGTCAA TGCAGCTCAT
AAAGCGGGGG TTTCAAAGAC CAACTTTTCA TGGAAAAACG AAGCCGTACT GATTCCGAAG
CCCAAGAATG AACCCGGTGC ACATGAATCT AGTCTACCTT CTTGGATGAT AGACTATATC
GAATGGCATG CGGAACAGCG AAAATTGCTG TTGGAGTCCA ACTACATGTC GTTTCGTTAT
TTGATTCTTC GATGTGAGGA CGGAAGAGAC TTCAAATGTG GAGGACTCTC AGATCGCCTC
AAGCCGTTGC CGCTGTTATT TCTGGAGGCT GCCCAGTCCC GGCGACTCCT TTTTATTCAT
TGGACAAGAC CTGCGCCGTT AGAGGAATTT TTACAGCCTT CCCGCGGAGG CTTGGACTGG
AGAATACCTC CATGGTTCGA GTCCAAAATG GAGTCGGAGA AAGTCCTACT CATTACTACT
GCTAGTACGC TAGCAGCGGG TGTCAGGCGG ACGAACAGAT TCGTTGCCGC TCGCATACAA
GATCAACATG GGGGCTCAGT GCTGTACAAC AAAATTCAAG GCACCCGCGC GTACCGTAGA
GTATTCCGCT CCCTTTGGAA ATCTATCTTC AAGCCATCAC TGCAGGTACA AACCGCATTA
CAGTCACAGC TCAAAAAGAT GGAACTGATA GATGGACAGT ATGTTGCTGC TCATTTGCGA
GCCTTGTATG ACAAAAGTAC CCTCGATGAT ACATATTTGC GCGCTATAAC CCTCAATGCC
GTGAATTGTG CCAGCCAGTT GCGGCAATCC TCCGAGACGC CGGTGTACTT TGCAGCTGAT
ACAGAGGACG CACTCGTTTA CGTGCGCCAC TATGCGAGGA CTCAGAATTT ACCTATTGTG
GCAGCACACC GGCAGGAGCC GTTGCATTTG GACAAGTCGA AAAGTCGAAA TCCGCGGGAT
TATGACAATA TATTTGTAGA TCTCTTCTTG TTGAGTCAGG CAAGCTGCAT TGCCCACGGT
CGTGGTGGTT TTGGAAGGCT GGGTGTGCTT CTGTCGCACA ACGCTTCTTG CGTGTTCAAA
TTTGTGGAGA ATGGAACGTT CAAGACCTGC GACTGGCGAG CGTAGCAATA TGAAAGTTTT
TTCCTCCCGC TAGATTTCCC TAAGCATGTA TCGAATGATT CCTTCGCTTT CAATTTCTGC
TATTGTTAAA AGCTCGGTCA AGACGCGATG CACATTTACT TGTT
 
Protein sequence
MIPLRRQSKR SNSKLHPHGF VVFLVALGGC LFALHQSVWR HVRVERVVAL GSHSLVNAAH 
KAGVSKTNFS WKNEAVLIPK PKNEPGAHES SLPSWMIDYI EWHAEQRKLL LESNYMSFRY
LILRCEDGRD FKCGGLSDRL KPLPLLFLEA AQSRRLLFIH WTRPAPLEEF LQPSRGGLDW
RIPPWFESKM ESEKVLLITT ASTLAAGVRR TNRFVAARIQ DQHGGSVLYN KIQGTRAYRR
VFRSLWKSIF KPSLQVQTAL QSQLKKMELI DGQYVAAHLR ALYDKSTLDD TYLRAITLNA
VNCASQLRQS SETPVYFAAD TEDALVYVRH YARTQNLPIV AAHRQEPLHL DKSKSRNPRD
YDNIFVDLFL LSQASCIAHG RGGFGRLGVL LSHNASCVFK FVENGTFKTC DWRA