Gene PHATRDRAFT_47702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47702 
Symbol 
ID7202707 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp577252 
End bp580530 
Gene Length3279 bp 
Protein Length994 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182092 
Protein GI219123563 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTGACAGTG AGCAGGATTT TGAAAAGGTC TTGTCCCGGA CGAGTAGTTG TCGTCTTTCT 
TCTTCCTACA TGGAAATTAT ATTCTTACGT TGCTACTCTT CCAAAAGTTG CCAATCCTTT
GGGAGGCGTG CCGCAATTGA CAGCCGCGGG TAGCGAACGC TTTTTTCACG GATTCTCGCG
AGAAGCTCGT TACGTCTTTC CACTTCCGGC ACGATGCTAT CGTCGCTGCG CTACTCCTCC
GCGAGATGTC GGGTGTCACT GGCATGTCGT TACCAAACTA AGCCATGCGC TGGCAATCCT
CGGTGTGCAA TTCCTATTCA AAGTAAATCT GCCGTGGGCT TCTGGAAATG TGACACTTTT
GGAGACCAGA GAGCATTGCC AATTCGAAGC ACCCGATGCG TGTCGACCGA TACCAATACT
CCTTCAATAG ATCCCACCAC TTTGTCGACA ATCGACCACG TGACGAAAGA CACTAATCCT
CGCACTGCGA CGGAAGATTT GTACGAAAAA CTCGAGCATA CATCCCTGTC GCTCTTGACT
CAAACTCCCA TTTCTCTCTC GCACCAAACC GACATTGTCA AAGTTTTGGA AGCTTGGAAG
CTTCTGTTAC CCAAACTTCA AGAGCTGGAT GAAATACCTG ATAGAACGCT GCCTACGAAT
CAATGCAGTG CGGCGATATC CCGTATGCAG GACCTTTACG AACTTTGGAA ACGCCGACCA
ATTGTCACTA ACCGCCCCTT TCAAATCATG CTGGAAGTGT TCGCGCACAC GCCGACGATC
AGCGACAACA GTAATGCGCG TGGTATGGCT GCACTCCAGG TTCTGGAAGA CTGGAACCGA
TCGTTTATGG GGGACATGGA GCTAGAACCG CGCCGTACAG ATTATCACCT TGTCCTTCAC
GCCTTTGGCA ATCAACCCCT ATCTTTGAGT TCCGCATACA TTGAGCCAAC TACTGCCGCC
CAACCCGGCG AAGTCGCGCA GGAAATTATG ACCCAACTCG TGGCCTGGGG TGTCACCATG
AAACCCACTG CCGAAACCTA CCAATTGGCG ATAAGGTGTT TGACAAGGGG CATCCAGCAA
CTGACGACAT CCTGGGATAA CGAACGTGAC ACCGAGGTAA TGTCGAACGT TGAGCAGCTT
CAACACACGG TGCGTGAGTA TGCGTCGCGT CTTATCAACT TAACGGTATC CTCTGGCTCT
TCATCGTCCA TGACCATCTG GCTGGGGCTT TCTGACGCCT TCCAAGCGAT GCATGTACCG
GTGGATCGGT TTGAAGAGGA CGGCCCCAAA AGAATTCAAG ACGCGGATTG GTATCGGCAA
ACTTTACCGG TATGGAAGCA GGCACTAATC GAGAGCCGAA GAGCATTGGA CCGGAGAATA
TTGCGTGAAA ATATGGATCG CACGTGTAAA GCTCTTCTGT TACTGCACGA GCGGTACTTG
GAACCAGCAG AGCAAATCAC GGCAATACGA GATACGCTGG AAGAACTCAA GGGGCTGTAC
ACAAGCTTGC CGTCGGCTGA GCACTACTGT ATTGCCATAA ATAGTCTCAC TTCGATGAAG
GAGTCGGTGG CAAAAAAATT CGCTCTAGAC CTGGCAGTTG GCATGGAAAA GCAACACCGG
CGTAAATTCG ATTCCAGCCA AAGCGATGGA TGTTTGGACT CGGAAGAGAT GGTGAAGTCG
TGGAATCTAC TTATGACGGT GTACATGCAA GTAGGAGAGC TGTCTAAAGT ACTCGATCTC
TGGAAAGAAA TGACGGTCAA TAAAATACCA CGGAACAGCT TAAGCTTGTT CCTGGTCCTT
AAGGTCCTGG CCCAGCAAGA AACATCGCAG TCAGCCAATC AGGCGACATC GATCCTCTTC
AAATATCTAT CCATGGACCG CAAACCCTTT GAACCTACCG CTGAGCATTT TCGGTGCGTC
ATGATGGCAT GGTGCAACAG TCGAGATCGC CACGCAGCCT CTCAGTGCCA ACGCGTCTTT
GATCGGATGC TACAATACAG TGCCGAGTCG CGAAAACAAT CGAGAAGCGA CGGCACAGAA
AAGCCGACAC TAGAGCCACA CGCACTTCAT TTTACTGCAC TGATTAAGAC ATTGGGATAC
AGTCGACGAC CCGATGCTGC CAAGAAAGTG TCGGCTTTGC TCGTTGAAAT GTTGGAGTTG
GGTCTAGATC CTGATTTGCA AACGTATACG TCTTTTTTCT CGGCGTTGTC ACATACGAAG
AGTCTTCAAG GCGCAGAGGA AGCCCAGGAA TGGTACGACA AACTGCAAAA GGAATGCTCC
AAGGTAGTAC TAAATGTCCA CTGTTATGCG TCGGTTATGT TCGCCTGGAC CAAAAGCGGC
GCCGTCGATG CACCTGAACG ATGTCGAGCT ATCTTTGACG AATTATGGAG GGCTTACTTA
TCTACCCCTG CGTCCGAAGA AGGGGATCTC CGTCCAACTA GCGCCGTATA CCGCGCTTTG
ATGGAAGCCT GGGCTAGTAG TGGACGAGCG CAGGCACCAG ACGAGGTAGA TGCGCTACTG
TCCTTAATGG AGAAAAAGGC CGAGCAGGGC TTGATCGATC CCCCTGACAA AAAAGTATAC
GCTTTGGTTA TGGCAACTCA TTGGAAACAC AATGACCGCA ACGCTGTAGC AAAAGTACAA
GACGTATATA GTCGCATGAC TGCGAGTTAC GAAATGGGAA ATATTGCGGC TAAACCAGAC
GCTCACTGTC AGACTATATT GATGAATGCT TGGGCGAAGA GCGACGTCCC CGAGCGGGCA
AAGATTGTGT TGGATCTTCT CCGGGAAATG TTTCAAGCTT ATAGCCAAGG TGACTTGGAT
ATGCAACCCA ATGCCTATGC CTTGGCGGCC GTGCTGAACG CCTGTGCTTT TGTCGATAAG
GACAATGAGA CTCTTCGACG ACAAGCCGTG CAAATCGCTC TGACTGCGTT TAATGACTTC
TCAAACAGTG AGTTAGAGGG GACGAACCCC TTTATCTACT GCTATCTTTT TCGAGTACTT
GGCCATCAAG TCGATGACAT GGTCGAAAGA ACGCGTTTGG CCAGTGTTAT CTTTCAACGT
GAGTCGATCG TGAAAACCTA GCTATTTTCT CGTCGCTCCT CTCTTACGTT CTCACACGTT
GGCTCTTCCG TCGCAACAGG TTGCTGCCAG GAAGGATTCG TCGATGACCA AGTCATCAAA
ATGATGAGAC GCTATGTTCC CGTATTGTAC AAAAAGATTC CATTGGATGG CAAGAACAAA
CCTCGTTTGC CTATCGGCTG GACGCGTCAA CTGGACTAG
 
Protein sequence
MLSSLRYSSA RCRVSLACRY QTKPCAGNPR CAIPIQSKSA VGFWKCDTFG DQRALPIRST 
RCVSTDTNTP SIDPTTLSTI DHVTKDTNPR TATEDLYEKL EHTSLSLLTQ TPISLSHQTD
IVKVLEAWKL LLPKLQELDE IPDRTLPTNQ CSAAISRMQD LYELWKRRPI VTNRPFQIML
EVFAHTPTIS DNSNARGMAA LQVLEDWNRS FMGDMELEPR RTDYHLVLHA FGNQPLSLSS
AYIEPTTAAQ PGEVAQEIMT QLVAWGVTMK PTAETYQLAI RCLTRGIQQL TTSWDNERDT
EVMSNVEQLQ HTVREYASRL INLTVSSGSS SSMTIWLGLS DAFQAMHVPV DRFEEDGPKR
IQDADWYRQT LPVWKQALIE SRRALDRRIL RENMDRTCKA LLLLHERYLE PAEQITAIRD
TLEELKGLYT SLPSAEHYCI AINSLTSMKE SVAKKFALDL AVGMEKQHRR KFDSSQSDGC
LDSEEMVKSW NLLMTVYMQV GELSKVLDLW KEMTVNKIPR NSLSLFLVLK VLAQQETSQS
ANQATSILFK YLSMDRKPFE PTAEHFRCVM MAWCNSRDRH AASQCQRVFD RMLQYSAESR
KQSRSDGTEK PTLEPHALHF TALIKTLGYS RRPDAAKKVS ALLVEMLELG LDPDLQTYTS
FFSALSHTKS LQGAEEAQEW YDKLQKECSK VVLNVHCYAS VMFAWTKSGA VDAPERCRAI
FDELWRAYLS TPASEEGDLR PTSAVYRALM EAWASSGRAQ APDEVDALLS LMEKKAEQGL
IDPPDKKVYA LVMATHWKHN DRNAVAKVQD VYSRMTASYE MGNIAAKPDA HCQTILMNAW
AKSDVPERAK IVLDLLREMF QAYSQGDLDM QPNAYALAAV LNACAFVDKD NETLRRQAVQ
IALTAFNDFS NSELEGTNPF IYCYLFRVLG HQVDDMVERT RLASVIFQRC CQEGFVDDQV
IKMMRRYVPV LYKKIPLDGK NKPRLPIGWT RQLD