Gene PHATRDRAFT_46203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46203 
Symbol 
ID7201275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp574120 
End bp577895 
Gene Length3776 bp 
Protein Length1242 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180670 
Protein GI219119837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00106502 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGGGC GTTGGAGCTT TTCTCATGTG GCTGCAACGG CGGCTAGCAT CCGCGGAACG 
TCGCCTGATG CAGCAGACTC AATGTCGCTC CCATCAATGA ACGATTCCTT CACAGTTCTG
GACACTGATC CTGTGTACTT TTCGGACGGT GAGCTGGAAG CCATGTGTAT GCAAGCGAAT
GAGGAACAGA GGACTATGTT ACAGAATGCG AATGGAATGA CAGTCGAGGG AGAGAATGAT
TCGCTAGACA ATGAACGGCG CCATGAGACA GAAAGTGTCC GCATGACAAC CAACAGCGCC
GACCATCGTC AAAGTGTGTC CACAACCGAA TCGCCGCGCG ATAAAATCTG GAAGAAGTGT
TTAATGCAAC AGCATTCTGG CAATGCGATA CAAAATCCTC TCGCAACTTC TACACCAGCA
CTCCAGGCTC TAAACTACGG TACTTCTAGA CCCAACACAA GTCCTTCCAC CCTTGCTTTC
GATAATGGGT CGCCTCGGTA TCATCGCCGT GCTCGACGTC GCAGCGAATC AAGTTACGGA
ATGAATACGT GGAAAAAGCG ATACGCTCAC CACGTACTCT CTCCGGAGCA TCACCTATTA
GGAACAGCCG CTTCCAAATC AAGGCAAAAT TCGAGTTTTT CCGGTACCGA CAACCCGGAG
CAAGGCTTGA ATTCGCAACG TCAACTGTTC TCCTCCCGTG ATTTTCCAAG CCCCCATGAG
CAGCGCTTTG ATTTCTACAT GGGAAGTTCG ATTAATGCAT CTCCACTTCG TCCTTCACTT
CCCACGCCAC CATCACCATC CACTATATCA GACCAGTCAA CGAGAGAAAC GAGCTCGCCG
CCACGCTCTG TATCGGAAAA CCATAAGAAA CCATATTGTA GCAATACCAA CACCCAAGAA
AAGCTGGCTG AAGCAGCTCA TGGGTCATTG GTCTTGCCTT TGTCGCCAAA AGACGAGAAT
ATTGTAACAG CAATTCCGCC GCCTGTCGAA GCCACTACAC CATCCGGGAA GATGACCACT
CCCAGACAGG CGATGCACGC TTCGCTAACA CAAGAACAGG CTCTCGTCAT TCAAGAACCA
GACAGCAGTC CAAAAAGACC TGCGACTGGC CCGTATTCGT ATCCGGAATC AATACCTAAG
AAAATTGTGA TAGTGCCCCA GTTTGATGAT CATGATGATG ACGACCATCA CCCGATGCTC
ACGTCGTATG AACCGTCCAA TGGGGAAGAG GTACAAAGGG ATGCAGAGCC GAACAAATCT
TCGTCGATTC CCTCGAGAGC AACACCGACG ACTGGTACAT TGCTTGACCT TTACGAGGCG
CCAGTCCTAG CAAAGGAGTT CTCAGTTGCA CTTGTCGAGC CTGTTCTACC TACAGTGAAC
AGCGCCAGTG AAACATTTCA CTGTTTGACT CCATCGCAAA ATGAACCGCC AGCACGAAAT
GTCTTGCAGA CGGAATCAAT GATAGCATCC TTCCATACCT TTGACTTGGA CAAGAAGCCT
AAAAGGGTGG CTTCTCCCGA TGAGCCCATA GCCTCACCGA GATTGAAAGC GACAGCCAGG
GCACCATTGC TGACTGTGAG TGTTTCATCA GATCTCCCCG AAGAAAGTAT ATCCCAGGCA
CCTACAACTG TATCCAAAAA ATCATCAGAC CCGTTCGAAT GGGCTTATGA CATCTGGCGA
GGGAAGAACT TACTCCTGCC CAAGAGTGCC GTGCGTCGCG ACCCATCGTT TACATCGCCT
TGTAAAATCG AAACTTCAGA AAATGAAGTC TGCATCGAGG ACGCCCGCGT ATCTCCTCGT
TCGACGCCGT TTTTGCTCCC TCTTGAGACC ATCGTGCCCT CTACAACACC GGGCATTTCG
ACGTATCCTT GTCCAAATGC TGAAGCAGTT TATTGTTCAG CAACACCGGT GAAGGGTGAA
AAGGCCTTTG CCAACGTCTT ACAGGGATGG AAAACGGTCA GTAACGAGAG ACCTTGCACT
CAGTTTTTGT CTCCAGAAAA CAGCGTGATA TTCAGTCAAA CAAAGGCAGG CAGCGCAGTC
CTACCTTGTG CAAATCTTGA ACACACGAGA TTAGATCCGA AAGGCAATGA TACAGAAACG
CTCACTTATC CAGCATCGAC GCAGTCGCAT CGCTCTTGCA ACAATATCTC TACTCCGCTC
TCCAGCACTG GCAAAGCTTC GATTTTAAAA TTGGAATGCC GTCAAGGAAA TGGCTCCTCG
GATCCAATAG TATCTAGAGC GATTACAGTG ACTGATGACC AGAACATGCA GTTCCACAAT
GAAGTTGGCT CTATTTTGTC CCCCAACCTG GTATCGAGCG CACCTTCTTT CTGGGACAAA
GCGATTGAAG CAAACGATCC CGCACAGAAT GCGCAAGACC AAATATTTGA TCTTGCCAAA
ACAACGTCGA CATTACTTCT ATCCTTACCA GGTACGACAA CATCATACTC CGAAAAGCAT
TCTTTCAAAT TGAAGGAGGG ATCGACTTCA GGAGAAGCGT TGCTGAACGA TCAAAGTGAA
AACTTTGGTC AACCAACTGA CATTTTGGAG AGCGACTTAG GCAGTGCAGT GTGTAAGAGT
TTGGATCTCG CGTACCTCAA AAGTGCTACA TGTGATGGAT CAATACCGTC AAAAGCCATT
CGCAAGAGCC GACATAGTAT CGATCACAGT CTCATGGTTG TGGGCAATAA GGCTCGATTC
AAGGGCAAGC AAGAAGCGTT TGATGACGCT GTCTCCGTCG GCGAAAGCAC AATATCCTCA
ATGACTTCGT GCACAAGTCG TGTCGGAAAC ATTGAGAGCC GAACGAGAAT TAGAGATAAA
ATCAGGAAAG TCAATGCGAA AAAAGACTTG TCTTCAATCT CTCCAGGTGC GGCAAAAAGC
CGTGACACTT CGTCGACTCA AGCCGCAAAG ATTAAAGAAG TCTACAGGAA GAAGCGGCTG
GTTATGCGAC AAAGTATGGG CCAATTGTCT GAGGCCTTAC CAAATCGACC AATGGAGCCC
TCGCTTTGGA GTCAAAGCGA CTTTAAGATG TACGCTGCGG AGCTTCTCGA CATGCTGCCG
TCCGAGATTT CAGGGGCTTG TTCGACAGAT GATTCGCGAG ACATTGAGCC TGAGGAAAAG
GAATTGTGCA ACCTTTCTTT ACAGCCGTTT TCCACACATA CAAACTTGGA CCATGCATCT
ACTGTGAACG AGGGGAATGC CACCTGTAAT TGCTCCAAAT CGGTCTTCTC GGGGAACGAT
GAATTAATTG AATTCTTCTT GCCACGGTTA GGAATGGCTT GCACTTGTAG CAAAGGGTTG
CAAAGCTTGA ATTATCCCAG TGAACCAGAA TCCCTTGCTA ATATTTTGAG ACCATGGCAA
GTTGCCTACT TGGGAGACTT TGGTATACAT CGTGGAGATC AGCTTGTGAA GGCCCACCAC
AGGAGTGCTG ATGCATTGGC AAGCGCGATG CGCCAGTACC GTCGAGACCA CGGGTTGACA
CCTTTCCGTA CGAAAAGCTG CGGGATGGCT CTTTCCATTT GGGCGAAGAC TGCAAAAACA
TACATTCGAT CAGTTCGGAA ACAAACGACA GCACATGGGG AAGTAGCTTG GAATCTGCCC
AATACTCTTT ACATCCTCAG CTCCTTCCTG GAAAAAAATC CAGGGAATTC GGGGAGGCTG
TCCTCCCCCA TAGATCAGTT TGAGGCTGAG AGCAACGATG GATCACCATC AGAATTTAGT
TGCATTTAAC TGTAAGTCAA TAAAAGTAAC ATATAGCAGA AGAAGTGACA TTAGTG
 
Protein sequence
MLGRWSFSHV AATAASIRGT SPDAADSMSL PSMNDSFTVL DTDPVYFSDG ELEAMCMQAN 
EEQRTMLQNA NGMTVEGEND SLDNERRHET ESVRMTTNSA DHRQSVSTTE SPRDKIWKKC
LMQQHSGNAI QNPLATSTPA LQALNYGTSR PNTSPSTLAF DNGSPRYHRR ARRRSESSYG
MNTWKKRYAH HVLSPEHHLL GTAASKSRQN SSFSGTDNPE QGLNSQRQLF SSRDFPSPHE
QRFDFYMGSS INASPLRPSL PTPPSPSTIS DQSTRETSSP PRSVSENHKK PYCSNTNTQE
KLAEAAHGSL VLPLSPKDEN IVTAIPPPVE ATTPSGKMTT PRQAMHASLT QEQALVIQEP
DSSPKRPATG PYSYPESIPK KIVIVPQFDD HDDDDHHPML TSYEPSNGEE VQRDAEPNKS
SSIPSRATPT TGTLLDLYEA PVLAKEFSVA LVEPVLPTVN SASETFHCLT PSQNEPPARN
VLQTESMIAS FHTFDLDKKP KRVASPDEPI ASPRLKATAR APLLTVSVSS DLPEESISQA
PTTVSKKSSD PFEWAYDIWR GKNLLLPKSA VRRDPSFTSP CKIETSENEV CIEDARVSPR
STPFLLPLET IVPSTTPGIS TYPCPNAEAV YCSATPVKGE KAFANVLQGW KTVSNERPCT
QFLSPENSVI FSQTKAGSAV LPCANLEHTR LDPKGNDTET LTYPASTQSH RSCNNISTPL
SSTGKASILK LECRQGNGSS DPIVSRAITV TDDQNMQFHN EVGSILSPNL VSSAPSFWDK
AIEANDPAQN AQDQIFDLAK TTSTLLLSLP GTTTSYSEKH SFKLKEGSTS GEALLNDQSE
NFGQPTDILE SDLGSAVCKS LDLAYLKSAT CDGSIPSKAI RKSRHSIDHS LMVVGNKARF
KGKQEAFDDA VSVGESTISS MTSCTSRVGN IESRTRIRDK IRKVNAKKDL SSISPGAAKS
RDTSSTQAAK IKEVYRKKRL VMRQSMGQLS EALPNRPMEP SLWSQSDFKM YAAELLDMLP
SEISGACSTD DSRDIEPEEK ELCNLSLQPF STHTNLDHAS TVNEGNATCN CSKSVFSGND
ELIEFFLPRL GMACTCSKGL QSLNYPSEPE SLANILRPWQ VAYLGDFGIH RGDQLVKAHH
RSADALASAM RQYRRDHGLT PFRTKSCGMA LSIWAKTAKT YIRSVRKQTT AHGEVAWNLP
NTLYILSSFL EKNPGNSGRL SSPIDQFEAE SNDGSPSEFS CI