Gene PHATRDRAFT_47057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47057 
Symbol 
ID7202148 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp294414 
End bp299163 
Gene Length4750 bp 
Protein Length1458 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181176 
Protein GI219121652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCATG AAAAAGATGT TTCCGAATGG AGCGCTTGGG TCCACAGCCT GTCCTGGAGG 
GAATTGCAAA ATGCGTCCGA GTTTACCATC CCATCCCCAA CAATGCCGGA CGATGAGACT
ATATGGCGTG ATTTGTCCCG TCTCCATACA CCGCTGCCAA CACCAATCTT TCCCGCTGCA
GCGTGTCCGC ATCAGGCCGC GGCCGATCGG TTCGTCGACC ACGCAAACGA AGCCCGACGA
CGACGTCGCG ATGCGCGGCA TCGGCCGCAA CTGTTTCAAC TCATCCCGAC TGCAACACCA
ACACAGCCCT CGTCATCGAT AGTGCACACA CGAAGAAATA ATGTACCAGC GTATCGAGTG
CACGCCCGTA GATTTCGGTC GGCGGCGGGA GATCAATGGG GATTGGGGTG TACGTGGGAA
CAGCGTGAAG CGGATTGGAA TCTTACCAAC GCGTTTCGAA TCGTCCACTT GCCAGAATCG
GGCCAAGCAT TCGGTAGACT ATTTCTCATC GACGATGGCA AAGAAAGATA CTCTCCATTA
CATAAGATCG ATTTTCTACG TTGCTTGGTG GTGGCTTCGC GTGGCCACTG GGGAACCAAA
GGGCCTCCTA ACAAGAACAA ATCTTTCGCT CGCTGGTTGG AACCAACACA TCGATGGTTT
TCTTTGCCCA TGTACATTGC GAGTCGATTG GAGGCTGCCT TGTGGGCAAC GTACCTGATT
CGTCAGCGCA AAACAGGTGC TCAACCACCG TCTCTCTGGG AGTCGGCGCT TTGGAATAGC
ATCTCGGAAA AACTAACTGA AGAAAAGATA CTTCACCTGA TTGCCCAATC CATTGGTCAA
GGTTTGCAAA CAGAAATTAT GGTCAAAGAC CGGGTCATGA CCCCCATGTT TCGAGATACA
GTCATGTACG GCTTGCTTGA CGGCTCTGTA CGTTGGCCTC TGTCGGATCA CACGAAGAAC
AAGAGTCATT GGATGGAGAC TCTTTTGACA AGCCGTGTGT TGGAGATATC GTCGCCGGAA
GATAGACTCC GCGCAAGTAC TCGAGAAGCC CTGCACCTAC TCTTAACTCA CGCCGTGGAA
GAAGCGCTCT TGCAAGAGCT GACACAAAAT GACAGGACAT CACATAAACT GTCACGCCCC
AAGTCCACGA AGCGACCCAC AAGGAATCGC AACAAAAAAA GAAAGGTATT GCAGTTCAAA
GGGCGATCGT GGAACGATCA CTGCCGCCAT TGGTACCGGA AGACGAAGAA TTGTCCTGCT
TGTATCCAGA AGAAGAGTCG GAGCGGATAC AGCGTATCCA CTTTTCGAAC AGTGGCGTGT
CTGTTCGTGT ACGCAATCGA AATATTGTAT TGTCGTTGTC GATATTGGAA GAAGTTCTGA
GTGCTGTGTT TGTCAAAGTC GGTCTGACAC CGATTGAACT ACCATTTGCC GAGCACGATG
CCAAACTAGC GCAACGTCGA GCTCAACAAT CTGAAGTAGA TCGAGCTCGA AAGGAATTTC
AGCAGAGAAG ACATACTGAA CTTTCGTCAA GTGGAACTAC TCGGCAAACT CTCGAGAATT
CATTAAAACT CGAAGGAGAG CGAACTCAAG CTCTGACAAG CCAAGTCATG AGTCAGCAAG
AATGCCAAGA GCAAGGCGGC TCATTCTTCT CTGGTCAGCA GTCACCATTT ATCAATCGGC
GACGGTTCGA CTCGATAGCG GTCGCTAACG CGTCCGCTAC CAGTGGGTCC TTTGCGGGTC
TGTTGTCCCC TCGTTACGAA ACAGTGGGAG GAGAATTTAT TCATCGATCA GAGGCGGTCG
ATTCCTCGTG GGAGTTAAGT GGATTCGCTC TCGATGGGTG GGGCCGGATT CAAGGTTTTA
CAAGTCGAGA TCGTAGTATT ATGGAAGATT TCTTTGATGG CCAAACTCAG GAAGCGGTAT
GCGTAAATGA TATCATGGCC TCTTCAACGG CGGCATCTAT CGCTTCGTCG ACGGAGGTCA
ATGCCGTGGA TGTGGACGTC GAAACGATAA TGGACGAAGA TGGATGTGGT TTGACAGACG
CAGTAAAAGA ATTGGCCATC CGGGAAGCCG CCACGGCAGG ATCAAGAATC TCGATGAAAT
CAGTTGTGGC CAACATACCG AAAGAGGTTC AATCTGGCAC TGCGATACCA TTTGTGGAGA
GCGCAATGCG ATTGAGCCAA GAATCAATGC TGCATGTCCA GCCGGAATGC GAGGATCTTG
CCATGACTGA CGTAGTCAGT ATTGATGATG AATGCCATTC TCCTTCACCT CCCCCGCCAT
CGACACCATC GCCGACGTTG TCCCCAATTC TTGTCTCTTT AGCAGATCTT CAAGAAATGA
GAAAGGAGTC GGTATGTCGA GACAACCCTG TGGATACAGC GAATCTTTCG TCGTATCGGT
CATTCTCAGC CGCTACAACA AGTCTTCCGA ATTCGCCTCG ACTTCTTACG CCACCACGTC
CACAGATACC AACCAGCCTT TCAAGGGACA ACCTCAAACT AGCAGATGAC TCCGACCAGA
AGACAACGGG GAAAGAATCC CCTGCATCCT TCAAACTCCA TCTCAAATCT CCCTCGGTTC
TACGAAAGTC GATGAAAGTA AAGTCCATGG ATGACACTGA GATAAAAAGC AAGTCTAACT
TGCGACGGTT CAACGATGGA ATCGTCTCTT ATGTACATGG CGCTTCCCGT TTACCCAGAT
TCCACGATGA CCACGCCATT CGTCCGCGAC GGTCCATATT GCTGCGTTCA CCGGATGTCT
TGACCCCTTA CCGAACTGCG GCGACGCGCG CCGTAGCTTC GTCCAATAGG ACTGAAAAAT
TCGATTCCTT GAAACCGTTT GAGAACCTGA ATCTGAATCA TAACACAAAT GCAACAGGGC
GATCGATTGG CGATGGCTGC GCTCGCAGCG AAATTGCTGG TGACGCCCAC GACGACTATT
TGAATTGGAA TGACAGTCAC AGGAGCACTG CAAATGACGA TGGCGACAAC CATACAGTGA
CGAAGGATGG CGACAACCAT ACAGTGACGA ACGATCACTT CAGCTCTATC GCAAAGGGAA
CCAGAGGAAG TATCAACCCT ACGAGAAGAA CGCGATGCAT ATCGTGACAT GTGTTTGACC
ATGGGTGCTG AGGTTGCTAA GTTAAAAAAT ATACTCGCTG CGCAAAAGTC ATCAATGTTC
TTTTCAGTTG CCCCCGAATA CGCAGATCCC TTGCTTTACC CTGTCGTTCA CACTCACAGT
GTTGGTCCAG ATGCCGTTAA GTTTGAGAAC ATTCCTCGAG CTCGAACACT TGCTGCAATG
AGCGATGCCG GATACAAAGG AGAGTATGAA TCGCTAGCCA GCGACGACGA TGGCGGGAGA
TTGGCAGGTT CACGACACCC ATCTTCAAGC GTGACCGGAG TCGGTTCGGA TGTATCTGTC
GAAAACTGCG GCCACTATGC TCTGCAATCT CCGCTGGGTG CACCTCTCAC CAGGGACCTT
CACGATCTTA CGTCGCTTCG AGGAATGCAG TCTCGCCTTT CAAAAGATAT TCTTCATTTT
CTTGATGCCA CAAACATGCA CCTTCGAAAG CAAGACAGCA AACGGATGAA AGCGGTGGAA
CGTATGACTC GATTGGTGAA TACTGTTTGG CCTCGTGCAC AAGTCAAAGT GTATGGTAGT
CATGTTACCG GTCTCTGCTT GCCGTCGTCG GACCTTGATT TTGTCATCTG TCTGCCGGCT
GTCCACAAAC GTGCACCCGC TGTTGCACCT GGTGCCTTGG AAGGTCGAAA TGCCATCAAC
GAAACGTCTC AGAAGCTGCT GGCCCGAAAG TTGAAAAGTG AGTCATGGAT TGATCCAAGA
TCGATGAAGC TTATTGAGAG GGCAGTCGTC CCAGTAATCA CAGTGTCTAC AAAGGATACA
AAGGCGCGCA CAATTCAACT TGATATTAGC TTCGATGGGC CCGGCCACCA TGGGTTGCAA
GCAATCGATA TGGTGTCCGA AATCCTTGAA GAGCTGCCGA TGCTAAGGCC TCTTGTCCTT
ATCCTCAAGC AATTTCTCCT CGACCGTGGC CTTCTGACAG CGTACACTGG CGGGCTATCC
TCATATTGTT TGTTTTTAAT GGTGGCACGG TACCTTCAGG AACAACAGAC GTCTTGGGGC
GACTGCGGAT CTTTACTAAT GGGATTTCTC GACTTTTACG GAAACTGCGT AAGTTCCGGC
TGTCTACACT TGTTTTGAAC ACACACGTAG GTACGCTCAT TCCCCTTTCA CTTTGAACGG
AATAGTTTGA CCCTCGAACA ACTGGTATAA GTGTACGTCA TAGGCAGTAC TTTCCCCGGC
CAAACTATTC ATCAGCTCGT ATGCATTCCC CTGGAATGCC TGTCTGGAGT GTGTCACCAC
CTCCCGTTGT TGGGAGTCCG CCTTTGACTC AATTTTTGCG ACGAAACAGT TTCAATGATA
GAGGATCGAG TGACGGTATG CAAATGGAAA ACTTTGCTAG ACCACCGCGA TACCAGCCAG
CTCTATCAAA TCGCTATACG ACGCCCGAAA TTCCTCATAC AACAGACCAT GCGGCCCACG
AACACTCTAT GCCGTATACC TTGGATCCCT TTCTTGTCGA AGATCCGCTT TCGCAAGGTA
ACAATGTCGG TCGAAATGCT TTTCGAATTT TCCAAGTGCG GAGAGCTTTT TCTGATGCAC
ACCGAGCGCT GGTGGCGGCT CTCGAATGGG ATATGCAGTC AGGAGGAGAC TTTGGCCACA
GCAGCCCTGA
 
Protein sequence
MKHEKDVSEW SAWVHSLSWR ELQNASEFTI PSPTMPDDET IWRDLSRLHT PLPTPIFPAA 
ACPHQAAADR FVDHANEARR RRRDARHRPQ LFQLIPTATP TQPSSSIVHT RRNNVPAYRV
HARRFRSAAG DQWGLGCTWE QREADWNLTN AFRIVHLPES GQAFGRLFLI DDGKERYSPL
HKIDFLRCLV VASRGHWGTK GPPNKNKSFA RWLEPTHRWF SLPMYIASRL EAALWATYLI
RQRKTGAQPP SLWESALWNS ISEKLTEEKI LHLIAQSIGQ GLQTEIMVKD RVMTPMFRDT
VMYGLLDGSV RWPLSDHTKN KSHWMETLLT SRVLEISSPE DRLRAIHEAT HKESQQKKKG
IAVQRAIVER SLPPLVPEDE ELSCLYPEEE SERIQRIHFS NSGVSVRVRN RNIVLSLSIL
EEVLSAVFVK VGLTPIELPF AEHDAKLAQR RAQQSEVDRA RKEFQQRRHT ELSSSGTTRQ
TLENSLKLEG ERTQALTSQV MSQQECQEQG GSFFSGQQSP FINRRRFDSI AVANASATSG
SFAGLLSPRY ETVGGEFIHR SEAVDSSWEL SGFALDGWGR IQGFTSRDRS IMEDFFDGQT
QEAVCVNDIM ASSTAASIAS STEVNAVDVD VETIMDEDGC GLTDAVKELA IREAATAGSR
ISMKSVVANI PKEVQSGTAI PFVESAMRLS QESMLHVQPE CEDLAMTDVV SIDDECHSPS
PPPPSTPSPT LSPILVSLAD LQEMRKESVC RDNPVDTANL SSYRSFSAAT TSLPNSPRLL
TPPRPQIPTS LSRDNLKLAD DSDQKTTGKE SPASFKLHLK SPSVLRKSMK VKSMDDTEIK
SKSNLRRFND GIVSYVHGAS RLPRFHDDHA IRPRRSILLR SPDVLTPYRT AATRAVASSN
RTEKFDSLKP FENLNLNHNT NATGRSIGDG CARSEIAGDA HDDYLNWNDT LSQREPEEVS
TLREERDAYR DMCLTMGAEV AKLKNILAAQ KSSMFFSVAP EYADPLLYPV VHTHSVGPDA
VKFENIPRAR TLAAMSDAGY KGEYESLASD DDGGRLAGSR HPSSSVTGVG SDVSVENCGH
YALQSPLGAP LTRDLHDLTS LRGMQSRLSK DILHFLDATN MHLRKQDSKR MKAVERMTRL
VNTVWPRAQV KVYGSHVTGL CLPSSDLDFV ICLPAVHKRA PAVAPGALEG RNAINETSQK
LLARKLKSES WIDPRSMKLI ERAVVPVITV STKDTKARTI QLDISFDGPG HHGLQAIDMV
SEILEELPML RPLVLILKQF LLDRGLLTAY TGGLSSYCLF LMVARYLQEQ QTSWGDCGSL
LMGFLDFYGN CFDPRTTGIS VRHRQYFPRP NYSSARMHSP GMPVWSVSPP PVVGSPPLTQ
FLRRNSFNDR GSSDGMQMEN FARPPRYQPA LSNRYTTPEI PHTTDHAAHE HSMPYTLDPF
LVEDPLSQVR RRLWPQQP