Gene PHATRDRAFT_49971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49971 
Symbol 
ID7198654 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp465389 
End bp469010 
Gene Length3622 bp 
Protein Length957 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184710 
Protein GI219129048 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATGCAGTA CGTAGTCGTA GGACTAAGTG TAAGTGTTAT ATCTCTCACT GTCAATCCAT 
CATAACTGCA AAGGAAGCTT CCACTGGTTC CTGTTCGACT CGTTGCCCAA CAAAAGCAAG
GCCGTATGTG TGCGCCCCAG AACTCGGCTT TTTCTGTTAG TTCACTGGCA GAGTCTTTGG
GGAAGCGGAA GCGACTCTCG TACAAAGCTT GCGTCCTTGC AACGGTTATT TGTCTTGGAA
CCGCAACTGG TGGGTGTTGC GCGTTTCATC CCGTCGTTTT GTCACGGGTA TCCACACGTC
GACGTGTTAG TTCATCGTCG CCTTCGTTGC CATCCGTACG GATACAGCAG CAATATCATC
GAGACTGTCG CCATCAACAT GGACTAGTGC TCTTTGCGTC TATACCCGTC GTTCCCAACG
CAGCTTCGAT AGCGACTGGC AACAGCACGA ATATCGATAC CAAGACAACA AGCCATCTAC
AACCGAGAAC ACAACCACCA AACCCACTCT CTCGGACTAC TAAGAAGGAG TCTTCGTCAA
AGCAACTCAC GACAGCGTTG GCATTTTTGA CGGGAGTTGC CGACGTTTTT CTCATTCGCA
AATACCAAAC CTTTTGTACC ATGATGACGG GAAACACCAT GTGGATGATG AAAGCCGCGA
CGGAGTGCCA GTTTAGTCTG GTTGGCTACT ACGTCGCCGT TATCTCGTCT TACATAGCCG
GTCTCATCAT TTTTCGCAAG GCCGAAATGA CGTGGAAGAC GCAAAGTCTC GGCCGATTTT
GTGCACCTTT TGTTACCATC AGCTTTCTAC TGGCCGACTT TTGGTCCTCC CGAAACGCTG
CAATTCGGTG GCCCGCCGCC ACTTTGCTGA GCGCCTCCTA TGGTATCATA AACTCGGTAG
GGTCGGAAAT GGCGGGGTCT TTGGCGTTTG TCGTCACCGG GCACATGACT AAACTTACTC
ACGTGTTGAC GGACCGCTTT TCCAAACAAG CCGGAAACAA ACCCATCGCC GACAAGGATA
AGTCCACACT CTTACAATCA TCACTAATCA TAGTGGGGTT CGCGGCCGGA GCGTTCGTCG
CGTGTGCGCT ATTGTTCAAA CGTCCGCACC TTCTCGATCA ATGGGGAGCA TTTACGGCTT
TAGGAATCCT CTACGGCTCA CTCTTTGTCT GGCAAGATCG AGAGTCGATC CAGGCTTGGT
GGCTGGCGCA CAGATCACCA ACGTAGCTGC ACAATATGCC AAGCCGTGCA ACGATGCATC
GCTGACGCGG TAAGGGTTTT TTGATCGTTA CGTTTTGTGT TGGTGGCCGA GTGGTTTCTA
TTTTGGAGCC ACACGGATTG ACTCTTCTCT CGGCTAGCTT ACTATTTTCC CGGTATGCTT
CACTCGTGCA TTGATAGCGA TGTTCCCTCT TTTCACGCTG CTTCGCCACG AGTAGAAAGG
AAGGGATAGC CACTGCTTAA AGGAGGATTA TTTGTAACAG TAAGTCCAAA GCGGTTTTTC
GGGGCCGTCG TAAACCTAAT GTAAAAATGA CAGTCGGCCT TGGCCCGTGG TAATCGTAAA
CTGCGACTGG TCGTTGGGAA CGTTGTGAGT CCCGGGTGTC CGGGTTCCTG TCATGTCCGT
CGTTGGGCTA ATTGTAACAA AATAATCTAG GCTCCACAAA ACGGCATCAC TTTCAACTTG
AAATAACGAG CGCGGGCATC ACGGAGAAAA TTTCCGAATT TTGCGAGCAC GTCAAACCTC
TGACTAGTCC ATTGTGAAAA CAGGAGGAGA AGGGTTCCGT TCTGTCTGGA CCATCGCGAC
CGAGACGATG CAATCTCGAT GCTCGGTCAT GTCGCAAGCG ATTCCAATGG ATCGGCTAGT
AACGGAAAGA GCAATAACTC CCAAACGGAC TCGATTGGAC CTCCACCGAT CCGAGGCGTA
TACAGATCGG AGCGCCATCA GTTGCTTCGA GCTAAATTCT GGAACTCCAC CTGCGAAGAG
GTCTACCCTT CGTTGGCGTA AAGGTGCTAG ATCTTGTACT TCACCTTCAT ATACGCTGCT
CAATGTACTA AGCTTGCTGC TTACGCTACT GGGACAGTGC AACTATACTT CTGCCGCCAG
TAACTTGCCG CCATGTTTGC CCAAGATCAA CTCTGGTCTA TCCATCATGA CAATACGGGG
AGGTGCGCGG ATATCCTCTA GCTTCCAGGG AACAAAAGCA TTTACTCGGC CCGCGGTAAA
GTCCCGCATG TCGACTTTGG AAGCCAGGTC TCCACCTTTG CAAGATTCTT TATCACGTTC
AAAATTGCTT TTGATTCGAC TCATGTTCCT GACGTACTAC GGATCCTTAG GCACAATCAT
GCCATATCTT CCCGTCTACT ATCATCACTT GGGACACGGC GGACAAATTA TCGGTTTGTT
GGGCGCCGTC AAACCCTTTA CCACCTTCTT GGTTGCTCCT CTTTGGGGTT TGATTGCCGA
CCAAACACAA AAGCCGTTCG TCATCCTGAA CATTACTTTT TTGGTATCCT TGGTCGGTCA
ACTGCTTGTG GGTGTTCGTC ACGAAGCGCT GTATATCACG TTTATGGTGT TTCTCACGGC
CGTTTTTAAT GCCCCTGTCA AGTCATTGAT CGATTCCATG GTTATGGAGC ACATTCCGGA
GCAGTCGAGC TATGGCCGGC TTCGGTTGTG GGGTCAGATG GGATTTGGCG TGGCGAGTTC
GTGCGTCGGT ATTTTGTTGT CCAAGAGCAA GCATGTACCG TGGCCGGACA CCAACGACTT
CTCGTTATCC ACCGAGAATA CCCTTGCACG GCTTCCCTCC TTCCTGCAGA AGTTGGTGCA
ATTTACCGAT AAATGGTGGC GTTCGATGAC GGGGTACAAG CTACTGTTTT TGACGTACGC
TGCCCTTTCT GCACCAACTT GGTTTTGCAT TCAAGCATTC CGACAAATGG ATGAAAAAAG
CAAACGAGTA GCGAAGAAAT CCAGAAAAAG AGAAGAGACT ACCAAAGTAG GCGAAGGTTT
GCTACTCTTG CTCCAGAACG CCGATGCCCT TCTTTTCTTT TCTTTGGTTC TAGTCGTGGG
TATTTCGAGC GGAGTAATTG AGAATTTTGC CTACGTTCGG ATGCGTGAAG TCGGTGGAAC
GGGTAAACAA ATGGGATTGA GTAGGCTCGT CAGTAGTTTG GCCGGTGCTC CAATGTTTTG
GTTTTCGGGA CCTTTGACAG AAACGCTGGG AGCCGACCGT GTGATTGTGC TCTCGCTACT
CAGCTACGTG ACGCGATTTC TTATCTATGC TTTCATGCGT AATCCATATC ACGGCCTCCC
AGCAGAAGCG TTGCGTGGCG TGACATTTGC GGCGTTTTGG TCCACAGCGA CAATTTACGC
TCATCGAGTG TCGCCACCGG GACTGCACGC TACCATGCTT ATGTTTCTGA ATGCAATATA
CGGAGGACTT GGACAGTCGG TGGGTGCCAT CATCGGGGGT AAAATGCAGC ATCGCTTTGG
CACGGTGAAA ACTTTCCTGT ACTCGGCGGG GGTTGATCTT GTGTTCGTAT GCGGTGTGGT
GGCGTATTTA AATATCCGGC AGGATTCCAG CTTTAAGAAT CCCAAGCCGA TCGTAGCCCG
AAAGAGAGGA AAACAGAGTT GA
 
Protein sequence
MCAPQNSAFS VSSLAESLGK RKRLSYKACV LATVICLGTA TGGCCAFHPV VLSRVSTRRR 
VSSSSPSLPS VRIQQQYHRD CRHQHGLVLF ASIPVVPNAA SIATGNSTNI DTKTTSHLQP
RTQPPNPLSR TTKKESSSKQ LTTALAFLTG VADVFLIRKY QTFCTMMTGN TMWMMKAATE
CQFSLVGYYV AVISSYIAGL IIFRKAEMTW KTQSLGRFCA PFVTISFLLA DFWSSRNAAI
RWPAATLLSA SYGIINSVGS EMAGSLAFVV TGHMTKLTHV LTDRFSKQAG NKPIADKDKS
TLLQSSLIIV GFAAGAFVAC ALLFKRPHLL DQWGAFTALG ILYGSLFVWQ DRESIQACCT
ICQAVQRCIA DASALARGNR KLRLVVGNVS IVKTGGEGFR SVWTIATETM QSRCSVMSQA
IPMDRLVTER AITPKRTRLD LHRSEAYTDR SAISCFELNS GTPPAKRSTL RWRKGARSCT
SPSYTLLNVL SLLLTLLGQC NYTSAASNLP PCLPKINSGL SIMTIRGGTI MPYLPVYYHH
LGHGGQIIGL LGAVKPFTTF LVAPLWGLIA DQTQKPFVIL NITFLVSLVG QLLVGVRHEA
LYITFMVFLT AVFNAPVKSL IDSMVMEHIP EQSSYGRLRL WGQMGFGVAS SCVGILLSKS
KHVPWPDTND FSLSTENTLA RLPSFLQKLV QFTDKWWRSM TGYKLLFLTY AALSAPTWFC
IQAFRQMDEK SKRVAKKSRK REETTKVGEG LLLLLQNADA LLFFSLVLVV GISSGVIENF
AYVRMREVGG TGKQMGLSRL VSSLAGAPMF WFSGPLTETL GADRVIVLSL LSYVTRFLIY
AFMRNPYHGL PAEALRGVTF AAFWSTATIY AHRVSPPGLH ATMLMFLNAI YGGLGQSVGA
IIGGKMQHRF GTVKTFLYSA GVDLVFVCGV VAYLNIRQDS SFKNPKPIVA RKRGKQS