Gene PHATRDRAFT_46387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46387 
Symbol 
ID7201644 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp182275 
End bp185543 
Gene Length3269 bp 
Protein Length623 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180773 
Protein GI219120052 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.7871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCACA TAACCAAATG GTGCTTTCCG GGAATTAAGG ACTGGTTGAC AACATTTGTG 
ATCGGATTTT TAGTAACAAG CTACTGGCGT GGGACATGGA CACTTCTGGA CATATGGCTG
TGCGACCAAC CCGCAGATGC TGGGCTTACG TCAGCTGACT CGTTTTGCTT TGCTGGCCTT
CCTGATGAAG CTGTGAAGCA TCGAAATTCT GGGTGGCTTT CCATGGGAAT AGGAATGTTT
TTGACTGCAA TCGGTGTCAG CCTCATGTGG TTGGACTTTT GGAGGCCTCA GGTGTCGAAT
GTAAAGCACC GAGTGCAAAT TCCAGCCCGT CGAATTGTCA TTCGTTTCTT ATTAGTGTAC
ACTCTGGGCA TGGCATCCGT CAACATATGG CGTGGCGTTT GGTATTTGAC AGATTATTTT
CTATTGCCCA ACAAGCTCAC TGCAACTGAA TGGGGAGATT TTCCATTAGC ATCTTATTGG
GTCTCTTCGG TAGTTGGTTC GACAGTCTGC TTCCTTTTTT ATGCTGGCCC GTGTTTATTG
GCCCCGCCAG CAATCTTTTT GATTGATGGT CCTGGAATCA ACCCACCGTA AGTTTACTAA
AACATATTGG CATCTCACAT TGCCTTACAT GTGCTTACGC CGTGTCTCGC GGTTCCAGAC
CTATTGCAGT AACGCTGATT TCATCCTACT ATTCCCTGAC GCTACCAGCG GATCACAGTA
TTCCGGATTT ATCGCACACA GTGATTGCGC TAGATTTGCT TTTTAGCTTT CTCGGCGTAC
CAATCATGGT AGTATGGTTC TGGAGAGGCA GTTGGTTGTT GCTTGACTAC TATTTATACG
GATTTTCACC AAATTCGCAC GATGTTCACT TCTCCATACT TTGGTCATCC ATTGTTGGGG
TTTCTTTCCT GATCGTATGT AGTGAAACCA TTTTTGCCTA CATAAGGGTT CGCAACACAG
TTGTTCTACT ACTTCTTGGT CGGCTGAGAA CTTTTATCCT GGCTTGGGGT ACGGTCAATT
TTTGGAGGGC TGTTTGGTAT ATTTGGGACG AGTTTTTGGG AGGTTCCACA CAATGGTCCT
GTTGGCTGGC ACATGCTGCA TCTATTGCGT TGCTAACAAG CTTCGGTTGC ATGTCTTGTA
TTTTGGGTAA GTCAAGGCGC CTACTCTCCG CCCGATGTAT TTAACCCAAT TACCTCCTTT
GAACTAATAA ATGTCATTTT TCTTGAACCA GCCCCGGCTT CCACTCTCGG AGTGGACGCA
GTACCTCATA AGGATTGTGC TGATGATCCT CTCTTCAGCA ATCTTCCGGT GCCCGCTGAT
GACTTGTTTA TGTTTGGAAT TGGACGACAA CCACTAACTC TGGAGCAGTC TGCGTGACCA
CGAAAATCTT TGGAAAATCA AAAGCACCAT TTGCAGACCT CTTCGAAAGG GCTGAATGTT
GATGGCTCTC CTTTAACACG CACCTCCGTG GCAATATCGT TGGAGAGCAA GAATGATGAA
GGTGAAGCCT GGCGTAGCGG GGGGACGGGT CGTCGGGGAT CGTACAGCAG CGTTCGCAGC
GAAGAAATTT CTTTGTCGAA CAATTCTTAT CTCGGCCGTC AGCGTCCCGA TCTTGATCGA
CGTGTCAGTC GAGCTGACCT TGTCTCTTCG GGGCAATCTG TGAGACGCAG CAGCCAGTTT
TTTCGAAACA GATAGAAGTA AGACAGGGCA ATTGTGAGTA GAGGCGCTCA ATCGCAAGAA
CTAACATTCA ATGATGTCCT TCTCTCCCTT TTTTGCGGAT GTTCACTTGT ACATTTCATT
AGCGTTAGAG GAGATTCACT AATCCGGCCT CGATGAGGCC GCAAAATCAC TAATCCGGCC
AAAATAAATT TATTAAGTAC CAAAGGAAGG ACCACATTGC TTTCCTTACT AGCGATGATT
ATGAAAAAAT ATGGATCTTA AATAATTCGA GTATTTAAAG ATCACATTCT ATGTTAATTT
TTGAGATGCT AGATTTTTTA ATTTTTAGGC CTGCGATGGA CGATGTTCTG CAAAAGACGC
TTCGGTCGAA GGAGGCAACT TGACGGTGAG AGACGATCCA CTGAAAAATC TTTACGAACA
GTGGCTTGAG AAGTAAGTTT CAAGTTTATT AATGTATGTG ATCCAATTAC TTAGGAAGAA
ACTCTGGCCC TACCTTAGCT ACTTACAGTC AACGAAATCT AGGTTTTGGC CGGATTAGTG
ATTTTGCGTG CTCATTGAGG CCGGATCAGT GATTTTCCTC TTAGCGTTAC TATTGCATGA
ACGAATCGAT CAGTAGTTTT TTAAAGAGAA ATTTCGGTAT TTGGGATGAG CAGGGGTGAA
ATTTTCGCTA TTTGGGAAAA ATCACACTGT TTCTAAGTGT TTTTATTCTC GCGGGAAATA
CTTTATAAGT ACATTTTTTA ATTAAGAACA ACCCCATATA CTATGCATAG CCATTTAGAT
TCAATTCACC AGTTCCTTCC ATGAAATTTT TGACTCTCCA CCCAGCAGCC AACTCTTTCT
TGTTCACAAC TGTGAAGTAC AACACTGGCT GAGATATTGG ACAGCGGCGA CAAAAAGGAG
AAAAAACAGA AAGCAACGCT AGGCTTTGAC AGTAAGGGGG ATCAGCACGA ATTCATCCTG
TCACAAAGGG GGTTTGTTGG GATCAGCTTT TACTCGCAAG CCGCCGAAGA GATATCGGAC
AACGAAGAAC CCCACGAGCC GTTGTTCTCG GGCAAAGCTG CGAATGAGCA AGCACTCGGT
CCGGCACAGT GCATTCTAGA AAAAAAAGAT GTTCTCGCTG GCCGGGCATG TTGCCAAGCC
CGTCAGCCAG AAGAAGATAT CGTAGCGTTT TACCAATCCT ATTGCACGGA GACGCCGCCT
TTGCCGGGCA AGGGGTAGTC TACGAGACCA TGCAAATGGC CGAGGTGCCC GATTTTGATG
TCGGTGGAAC CATTCACGTC ATCATCAACA ATCAGATTGG CTTTACCACC AACCCCCTAC
ATTCGCTCTC AATGCCCTAC TCGTCGGAGT TGGGCAAGGC CTTCAATTGC CCCATCTTTC
ACTGCAACGG CGACGATCCC CTGGCAGTAT CGACGGCACT CGAGACCGCC GTCGAATGGC
GTCACGAATG GGGCATGGAT GTCATTATCG AGATGGTCTG CTACCGTTGT AATGGTCCCA
ACAAATTGGA TCAGCCGGCC TTTACACAAC CCAAACTCTA TAAGGAAATC TCTCAACACC
CACCAACCCT GGATATTTTC GAAAAGTGA
 
Protein sequence
MFHITKWCFP GIKDWLTTFV IGFLVTSYWR GTWTLLDIWL CDQPADAGLT SADSFCFAGL 
PDEAVKHRNS GWLSMGIGMF LTAIGVSLMW LDFWRPQVSN VKHRVQIPAR RIVIRFLLVY
TLGMASVNIW RGVWYLTDYF LLPNKLTATE WGDFPLASYW VSSVVGSTVC FLFYAGPCLL
APPAIFLIDG PGINPPETIF AYIRVRNTVV LLLLGRLRTF ILAWGTVNFW RAVWYIWDEF
LGGSTQWSCW LAHAASIALL TSFGCMSCIL APASTLGVDA VPHKDCADDP LFSNLPTSSK
GLNVDGSPLT RTSVAISLES KNDEGEAWRS GGTGRRGSYS SVRSEEISLS NNSYLGRQRP
DLDRRVSRAD LVSSGQSACD GRCSAKDASV EGGNLTVRDD PLKNLYEQWL ENGDKKEKKQ
KATLGFDSKG DQHEFILSQR GFVGISFYSQ AAEEISDNEE PHEPLFSGKA ANEQALARRR
YRSVLPILLH GDAAFAGQGV VYETMQMAEV PDFDVGGTIH VIINNQIGFT TNPLHSLSMP
YSSELGKAFN CPIFHCNGDD PLAVSTALET AVEWRHEWGM DVIIEMVCYR CNGPNKLDQP
AFTQPKLYKE ISQHPPTLDI FEK