Gene PHATRDRAFT_45473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45473 
Symbol 
ID7200571 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp293841 
End bp298085 
Gene Length4245 bp 
Protein Length1226 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179615 
Protein GI219117648 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATCC TGGCACCGAG AGGTAACATT ATTGCACTAT TGGGTCTCCC ACGCGGCAGC 
GTTATTAGTT TGGATGGGCA AACGGTAGCT TTGAAACGGG ATGACTTTGT AGGCTTTTCG
AACGTCCCCG CAAGTGACAG TTGTCATTTT GTAACGATAA GGGCGACTTC GAAAACTAGC
ACCACGAGCG ATTTCCCAGC GCAATTGGCA GCAGCAGTGA CGGTTGCCTA CATGACCTGG
GACAATGACT TGATTCGCAA ATTCGATCCG CAAACAGAAG AAATGTCCAG TGTACCTGCC
GACACTTGCA CAACCGACAA TCTACTGCAC AGAATTGAAA CACGGCACAT CGATGGACAA
AGTTTAATTT CTTACCCACA GCTATTAAAC GACGAAAAGG AGAAGGACTG GGTATTGCTA
ACAAATCACG TCTCGAAACG ACTGCTACAA AAACGCCACA TAGGAACAAA TGATAAAATT
GTTCCAGGAT TGTGGGATGA GCAGGAAAAC GAAGCAATCG ACGGTACACC GATCCACTAC
CCCATCATTC CAATCTTAGC CCCACATGCA TCAAGGCATG CAGCTACAAA ACGATACTTG
GCAAACTTGT CTCCCTCCGC TCGGACAAAA TTATATACCC ACGAATCACC TTCCGATTCC
ATTTTTGAGC GAGTGTTGCT TGAAAAGTAT GACAATGACA GCGGGCTACT CTTGGGCGAT
ATACAATTGG CATATGTAGT ATTTTTGCAC TTGCACTGCT TCGTCTCTTT TGAACACTGG
CGGGACATGA TCGTCATGCT AAGCTTGCTA TCTGAAAAGG TTTTTGCCAA CGACCGCATG
AAGCGCTTTC CAAACAAGCT GATACAAGTG CTCAAATCAC AGTTGATGGT AATCGGAGAA
GATTTGATCG AGAATTCAGA CTTGTTCGAC GACAATGATT TATCACCTGC CATGCAGCGT
CTGATTGCCA TCTTGATGAA GGTTGTAAAA GATGGTGAAG GGCAGGCTCT TTTGTCTCGA
CTATGGGCCA TGCTTCGTAC CCGATTTCCG ATGTTTACAG ACAACGAGTT TCATTTATCA
AGTAATTACA GCTCTGGAAC AAACAACGAA GAGACAGAGG AAAGTGACGA GGATAGACCA
GTTCTAGTGG CCAGCGAGGA AGTGCAGGCT TCGTTGGCTC GGTCACAAGG CGCGCGGTTG
TCAACAGGTA TTGCCTTTGA ACCTGTTTTC ATGAAGACCG AGCTTCAGGA AGCTTATCCA
CTCTTACTTG CAGCGGTCAT GCCGCACGAA GATATACTTA TGACGTGCGC CAGGGCGCTG
GACGAAGCCA CTGATGTTTC TTTGGTTCGC GAAGCCGCAG CTTACTTGGA GCAAGTCGAG
CTACAAAGAC ATACTACAAG GTAAGGCGAC TTGGACTTTT CACCATTGAC CATCAATTTG
TAGAGAGATC GAACGACGTG GTGACTAAAA GAGAACCTCC CCTCGATCAT GTACTTTGTT
AGCATAACAT GCCTGGATTT ACAGTTATTC ACATAAACTT ACAAAGACAC ATCATTCTTC
GACTTCTTTG GTCCCCCACC GTTTCCAACC AATAAAATCT GTTCAGACAT TGTGGATAGT
GTCGTATCGA CATAGCTTTG TGTCGTCCAC AAGCCTTTTC TGGCCAAATC CTTGGAAGAC
ATCGAACAGT AGTAGTGAAA AAGACATCAT CTCATCGTCT CACTGCCAAA TGAAGTATAG
GATCGAATTT ATCGTTCACT GGTACCGGAA CTGGACTTTG TTTTACCTTG TATTTCTACA
AGCAGCTATG GCGCAAGTGG ACTTTGTAAA GGATGCGAAG GAAATTCGTT CTTATTTGAA
GACAGAGGAG GACGCGTTTG TGACCGATGG CATACATGGT CTTATCCGAG GCTCCTCTTC
CATCGCACGT CTCATCCGCA ACGCGTTGCA TACAGATGGG TACTTTGTTT TGCATAGTAT
CTTGACGACC AGAGAGTGCC AGCAAGCCGT CGATAGGATG TGGGACTTCG TACATGATAC
ATCGGCGGGA TCCGTGCAAC GGGAGCAAGA TGAAACCTGG GCTTCGTGGC CATGTGATGA
ACCCATCGAT CGTTGCAATA GTGTGGAAAC ATTTAATGTA AACGGAGCTG GCTGGTTACT
TGGAGATCTC CGGGAGCAAT TGGCGGAGAG GGTTTTCGAA GATTTGTTCG GTACTTCTGA
GTTGCATTCT TCTAAGGAAG GGTTTGTGTT TGGGCGTCCG AACCATGTAG CCGCCGACGG
CAATTCCGAC ATGTCGTTTA CGCGAAACAG CGATGTGACG ATTCGATCTG TAGTGGCGCT
TGAGGATGCA GGTGCTGCTA AGGGTGGATT TTCTTTCTTC CCCGCGTCAT TTCGAGCTTC
ATTGGAGGAA TGCTTGGACA AAAAGCCAGA AATAGTCAAT CTAAAAAAAG GAGATGTGTT
ACTTTGGCGG TCGGATCTCC TTCATGCTTT GATCCCACCT TCACAGCCCA CGCTTCAATT
TCAAACATTG GCACTCGTCA GCATGCAACC CGCTAGCCGA ACTCCCGTGT ACCTGCGCAG
CCTCAAAATG GAAGCGTACA AACAACGAAG AAGTGGCAGT CATTGTGTCC ACGAAGAGAA
CTGGAGCCGA AATAGTGGCA TCGGTCGCCC CTATTTTCGC TGCAGTCCAC CTCTACTAAC
CCGACGACAA GCGGAATTGT ACGGTCTAAT ATCATACACA AGCTGTGACG AAGCATGGAA
AGAAGAAAAG AAGCGAGCCA TGGTATGCGG TGTGCGTTTC CAAGATGAAT TTGAGGCGCA
CACATTTCCT ACGGCTCGCC CGTGTTCGGC TGTACTTGAC TATTTGACGA CAAATAATCC
AGGCGACATG ATGGGAAAAG ACAAATATTT AGGTGGAGTG GCATCGCCAT GTGGAAAATA
TGTGTATGGC GTCCCAGGAT CGGTACGCTT TTTCTTCTTT TGGCGTTGCG TCTAAAAAGA
GTGGACTTAG TAGCTTACAT GGATCACTTG TTCTAACAGG CGCGACGCGT GTTGAGAATT
CACGTAGAAG ATGGAAATAT GGACTGTATC GGACCTTCAT TCGAAGGGAA ATTTAAATGG
TTACGGGGCG TCGATGTTCC AGCGGAATCG ATGATGGACA AAAGGTATCC GCAAGGATGC
TGTTTGGCTC TCCCTTGCAA CCATAGCTCC ATTTTAAAAA TCAATCCATC GACAGATGAG
GTTTATTCAT TTGGCCAAGA CACCATAAAA GGCTGCGGCA GCGACGATTG GCTCTACCAT
GGTGGAAACC TGGCTTCAAA TGGTTGGATT TATGCGATTC CTGCAAACGC GAAACAAGTG
CTTAAATTCC ACCCGGTAAC AGACAAAGTA TACTTGATAG GGCCAAACTT TCCTGGTCGG
TGCAAGTGGT TTGGTGGAAT CCTTGGTTCC GATGGTTGCG TCTATGGTAT CCCTCACAAT
CAGACCGGCG TACTGAAAAT CGATCCATCA ACAGATCAAG TCGCAATTCT CTACCATGAC
AGTGGAAAGC CGCTGCCGGA TGGTCGCTGG AAATGGCATG GTGGTATACG TGCTGGTGAC
AAGATCTATG GGTTTCCAAA CAATAGCGAC AACATTCTAG TCATCAACTG TCGCCTCAAA
CAAGTCTACA CAATTGGGGA TAGTTCAATT TTGAGATCGG GTAGACATCG AGTGAACAAT
GATAATCGTT ACAAGTACCT GGGTGGCGCT TTAACATCAG ACGGTGGGTT TGCCTACCTT
TTTCCGTGTG ATGCAGAACG TGTTCTCCGA ATTAATTGCG ATACGGATGA TCTCGCGCTC
GTGGGGCCAT TTTTGCTCGA AGGGGAAAAC AAGTTCCAGA ACGGCTTTGC CGCACGTGAT
GGATGTTTGT ACGGTATTCC ACAGAGAAGC TCAGGTGTCT TGAAAATAAC ACCGTCTTCG
AATCCCAGCG AAGAGGACCA CGTTGATATT GTATATTGCG GCGATGACAT GATAGGTTGC
AAAGATAAGT TTGAAGGAGG AGTCCTGGGA CTTGACGGGC GTTTATACTG CATTCCTTTG
CGAGGTGAGC AGCTTCAACA TATCTTGTTA AATGCATTCA CAGTCTGCAT GCTCTGACAC
ATACTACACA TTCTCAACTA CAGCAAACAC ATGTCTAAGA ATCACACCTT CGTATAGTAC
ACCATCCTAA AAAAAAACAA GAATTCCAAA CAGTATTCTA ATTTG
 
Protein sequence
MNILAPRGNI IALLGLPRGS VISLDGQTVA LKRDDFVGFS NVPASDSCHF VTIRATSKTS 
TTSDFPAQLA AAVTVAYMTW DNDLIRKFDP QTEEMSSVPA DTCTTDNLLH RIETRHIDGQ
SLISYPQLLN DEKEKDWVLL TNHVSKRLLQ KRHIGTNDKI VPGLWDEQEN EAIDGTPIHY
PIIPILAPHA SRHAATKRYL ANLSPSARTK LYTHESPSDS IFERVLLEKY DNDSGLLLGD
IQLAYVVFLH LHCFVSFEHW RDMIVMLSLL SEKVFANDRM KRFPNKLIQV LKSQLMVIGE
DLIENSDLFD DNDLSPAMQR LIAILMKVVK DGEGQALLSR LWAMLRTRFP MFTDNEFHLS
SNYSSGTNNE ETEESDEDRP VLVASEEVQA SLARSQGARL STGIAFEPVF MKTELQEAYP
LLLAAVMPHE DILMTCARAL DEATDVSLVR EAAAYLEQVE LQRHTTRIEF IVHWYRNWTL
FYLVFLQAAM AQVDFVKDAK EIRSYLKTEE DAFVTDGIHG LIRGSSSIAR LIRNALHTDG
YFVLHSILTT RECQQAVDRM WDFVHDTSAG SVQREQDETW ASWPCDEPID RCNSVETFNV
NGAGWLLGDL REQLAERVFE DLFGTSELHS SKEGFVFGRP NHVAADGNSD MSFTRNSDVT
IRSVVALEDA GAAKGGFSFF PASFRASLEE CLDKKPEIVN LKKGDVLLWR SDLLHALIPP
SQPTLQFQTL ALVSMQPASR TPVYLRSLKM EAYKQRRSGS HCVHEENWSR NSGIGRPYFR
CSPPLLTRRQ AELYGLISYT SCDEAWKEEK KRAMVCGVRF QDEFEAHTFP TARPCSAVLD
YLTTNNPGDM MGKDKYLGGV ASPCGKYVYG VPGSARRVLR IHVEDGNMDC IGPSFEGKFK
WLRGVDVPAE SMMDKRYPQG CCLALPCNHS SILKINPSTD EVYSFGQDTI KGCGSDDWLY
HGGNLASNGW IYAIPANAKQ VLKFHPVTDK VYLIGPNFPG RCKWFGGILG SDGCVYGIPH
NQTGVLKIDP STDQVAILYH DSGKPLPDGR WKWHGGIRAG DKIYGFPNNS DNILVINCRL
KQVYTIGDSS ILRSGRHRVN NDNRYKYLGG ALTSDGGFAY LFPCDAERVL RINCDTDDLA
LVGPFLLEGE NKFQNGFAAR DGCLYGIPQR SSGVLKITPS SNPSEEDHVD IVYCGDDMIG
CKDKFEGGVL GLDGRLYCIP LRQTHV