Gene PHATRDRAFT_47270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47270 
Symbol 
ID7202357 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp140437 
End bp143782 
Gene Length3346 bp 
Protein Length897 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181665 
Protein GI219122672 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACATAGATG CAAGGCAGAA AAGTACCGTG ACGACCATGG CAACTCCGGA CCAAATATTG 
GCAGCTTCTC GCTACCGAGC GGGAGGTGGT GCTCTTTTTA CGAGTTCCAA GTGCCCGCCG
GGTCCAGGCC TTGAAAATAG TTGCCAACTT CCTTTCGGGT TCATCTACAC TCCACTTTCT
CCTCCCGATA ATATTCAAGT CGTGCCTATC CACGACGAAA ATCTGCCGCC TGTAATCTGC
TTAACGTGCC TTTCGTACTT AAACTTGTAC TGTGATGTGG ATGAAACAAC AGGCGTTTGG
ACGTGTGCTC TTTGTGGATG CAAGAATGCT GCGCCGCCGG AAAGCTTTCA CAACGGAACG
CTTTCCCCAA TATTGATATC TCCCATTGTA GAGTTTCGGC AGCCCATTGC GGAGGCGCAT
GATCGTGTGA ATACCATTTC GGTGGTGGTT GTGATGGATG CCAATCTTCC ACGAGCTGAG
GCGCAGGCGG TAGGATCGGC GTTGCAAGCT ATCCTTCCCG AAATGGCCGA CGCGAAAACA
CGAATCAACC TTGGATTTAT TGTTTTTTCT AAACATGTGT CAATTTACCA ACTTAACTCA
ACCGGTGTTG CTTCCGCTGA CATATTTTCA ACCCATGAAG GACTCACCGA GAAGCACTTG
GAATCAAGAC AATACCTCAC CGAAATCGGA CAGGATGGCA GCCTAGAATG CATGTGGCGA
TGCCTGTCCG CTGTGTACGG AGTTGTGTTG GATAATGAGG AGGGAAGCGA AATAAACGTT
TCCAAGGGAA AGCAACTATC TCGATTGGAA CAACTAAAAC AACGCAAGGA GACACGAATG
CGTAAAGAGC TTGAGAGAGA CGACGACGAT CCAGATGTTG TAGTCAAGTC GCCTTGGGTG
TTGGCGAAGG AAAACAGTGC ATCCAGGCAC CCTTTGCGGT GTACAGGAGA AGCTATACAG
TGTGCCATTG ATCTCGTCGC CTCGCCTTCC AATGTCGATT TGATCGAGTC GAGAATTTTG
GTCTTTACTA ACGGATGTCC CAACTATGGG GATGGCAGTG TCGTTTACGA CGATAGAGAC
ATGACAACGA CAGCACGAGC CCGTCCAACA GCAGATGTGG TTGATCCTCT CAAGCTGTCC
GGGGCGGTGG AGTATTTTAG CATTATCGCA AAGGCAGCCG TGGAAGGGGG TATTGCCATT
GATGTATGTT GCTCTGGTGC GTCTGAGCTT TGCCTTCCAG TGTTCCAGGC GTTAGTCGAA
CCAAGCTCGG GCTATGTTTT GCCTCACGAA ACCTTTGCGG GACCGCATCT GAAACACAAT
ATGAACCACT TTCTGAAAGA AACAAATATG ACAATGGCAG CGTGTAGCGA AAGTCAGGCG
GAAAAGTCGA AAAGCCTAGC GCCTTCTGGC TGTACAATCG ATATCCGTAT GCCAAGGTAA
GTTGCCGGCC AGTGAGTGTT CGATCTCCTT TTGAGCGGTA GTATCTAAAT GAAATTGAAC
AGCTTTGTGA ATCCCACACA CCTGGTTGGT CCGGGTGAGA TTCTTGATGA CTTCAAAGGT
TTGTTGCTGA ATGAGCGTTC TGCTTTCGCC GCCGGATGTA AGCTAGCCGC TCGTATCGGA
ATGAGAACAA ATCATTTACC CCAGAAAGAT TTTGTTGACG ACGCAGTGAC CCGACTATCA
ATGGGAAGAA AAGATCCGTT GTCAACTTTT TCCGTTATGC TTGAGATCAA CAATTTCTTC
CAGAAAGACG CCTTTGCTTT CGTTCAGTGC ATTGCTCGTT TCGTCGACCG CAGAGGACAA
ATTCTGATAA CGAGAGTGTT TTCACATCGA ATTTCTATTG CAAACGACGT CGGTGAGTTC
TTGGATTCTA TTGATGAGGA GGTGGTGCCC GTGGTCCTTG GAAAAGAAGC TGTCTATAGG
TCAATGTATG GAAGAGAAAT TGACGCCAGA AACGAAGACG AAACAGAAGT TGCAACCTCT
GATGAACTCG ACGATCTTGC GTATGATGCA CAAAAAGATC TTGACGCGAC AATTCATCGC
ATATCCGTTG CTTTTCGCTT GCTTGGACTG GAACAAGGAA ATCGTGGGTA AGTGGACATT
TGTGTAAATT TGCATTGAGT CGGCAACTAC TCTTACATGT TGCTACTTGC GACGATCCTC
CAGATTGGAT CTTACCGAGG AAGGGGGAAT TCGCACAGTA GGGTCTTCGA TTGATTTCGC
GTTTCCCCCC GAGCTTTCGG ACGCACTGCG TCGTCTATAC CACCTGCGAC GTGGTCCTCT
GCTGAGCCCC GGCCCGATGC GATCGGATGA CGATCGGGCG CAGATCCGAT CTCTTTTCCT
TCGCTTACCC TTGGAGGATT GTCTGTGCAT GTGTGCGCCC TCTTTGTGGC GCACGGAGGT
TACTCCAGAG TGCAAGTCTG ACTCTGTAGA GTGGATCGCT GTTCCACCCG AATCTCTAGC
TTTATGGGAC AAAGTGGGTA ACGTTGAGAT GGAGTGCGTA TTTACAATGA TGTGGCATGA
GAACTAACAC TCATGCTTGT CGGTCTCTTC GTATAGACGG CTATTGTGGC AGACTGCTAT
CATAGTCTTT TCATTTGGTT TGGGAGGGGG GTACCAGAGT CTTTCTTTGA TTCGATTCGG
CAGCAGGCGA GGACATATTT ATTGGATCGT TCTGTTATAC GCTTTCCGAT GGCAGAGATA
TACACTGTTT CCGAGGGCGA ATCCATGGAT CGTAGATTCA CTGCTCTTCT AGCTCCCTCG
TACGGGGATC CTGTCGACCA TCAAGTAGCA AATTTTCCGG CGCTTGGTCA GTTGTCACCA
CAAGAGCTCG AAAGTCTTCG TTGTAAATTC AGATTTTACG ACCCTACTTC AGACCCAAGT
TTCCGAACGT GGTTCTGGGA CGTTGCGAGC GCAACTAGCT CAAGTAAAGA GTTCGGTCTG
TCGCTCTGCG AGTAAGGAAC TTACTTAAGC CTATTCACTT CCACACGTGA GGGTCCGATA
AGAAGATCTC TTTTTGGTCT AGCAACATTT TCCTCGCAAG TCGGCTCCCG TAGCTAATGA
TGGCTGTAAG TGGTACGGAG CTTGAAGTTG CCGTGTACCT GAAAAGCATT TTACCTACTG
TGTGAGATGG AAGTCTCGCA CACGTAAATG GGTAGAACTA AAGCTTCATA CGCACAACTC
AAATCCTTGT CCATTCATTC CAAGCATAGT TATCTTTCCG CTCGTAATGT ATGAGAGACC
TTGCAGATTT GGAGGTGCTT CGGCAAAAGC TATTGCTGGA CAGAAGCAAC CGTTTTCTGA
GGAGGTCAAA ATATCGGATG GCGCGGCATC GTCCGAGATC ATATCC
 
Protein sequence
MATPDQILAA SRYRAGGGAL FTSSKCPPGP GLENSCQLPF GFIYTPLSPP DNIQVVPIHD 
ENLPPVICLT CLSYLNLYCD VDETTGVWTC ALCGCKNAAP PESFHNGTLS PILISPIVEF
RQPIAEAHDR VNTISVVVVM DANLPRAEAQ AVGSALQAIL PEMADAKTRI NLGFIVFSKH
VSIYQLNSTG VASADIFSTH EGLTEKHLES RQYLTEIGQD GSLECMWRCL SAVYGVVLDN
EEGSEINVSK GKQLSRLEQL KQRKETRMRK ELERDDDDPD VVVKSPWVLA KENSASRHPL
RCTGEAIQCA IDLVASPSNV DLIESRILVF TNGCPNYGDG SVVYDDRDMT TTARARPTAD
VVDPLKLSGA VEYFSIIAKA AVEGGIAIDV CCSGASELCL PVFQALVEPS SGYVLPHETF
AGPHLKHNMN HFLKETNMTM AACSESQAEK SKSLAPSGCT IDIRMPSFVN PTHLVGPGEI
LDDFKGLLLN ERSAFAAGCK LAARIGMRTN HLPQKDFVDD AVTRLSMGRK DPLSTFSVML
EINNFFQKDA FAFVQCIARF VDRRGQILIT RVFSHRISIA NDVGEFLDSI DEEVVPVVLG
KEAVYRSMYG REIDARNEDE TEVATSDELD DLAYDAQKDL DATIHRISVA FRLLGLEQGN
RGLDLTEEGG IRTVGSSIDF AFPPELSDAL RRLYHLRRGP LLSPGPMRSD DDRAQIRSLF
LRLPLEDCLC MCAPSLWRTE VTPECKSDSV EWIAVPPESL ALWDKTAIVA DCYHSLFIWF
GRGVPESFFD SIRQQARTYL LDRSVIRFPM AEIYTVSEGE SMDRRFTALL APSYGDPVDH
QVANFPALGQ LSPQELESLR CKFRFYDPTS DPSFRTWFWD VASATSSSKE FGLSLCE