Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47057 |
Symbol | |
ID | 7202148 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 294414 |
End bp | 299163 |
Gene Length | 4750 bp |
Protein Length | 1458 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181176 |
Protein GI | 219121652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCATG AAAAAGATGT TTCCGAATGG AGCGCTTGGG TCCACAGCCT GTCCTGGAGG GAATTGCAAA ATGCGTCCGA GTTTACCATC CCATCCCCAA CAATGCCGGA CGATGAGACT ATATGGCGTG ATTTGTCCCG TCTCCATACA CCGCTGCCAA CACCAATCTT TCCCGCTGCA GCGTGTCCGC ATCAGGCCGC GGCCGATCGG TTCGTCGACC ACGCAAACGA AGCCCGACGA CGACGTCGCG ATGCGCGGCA TCGGCCGCAA CTGTTTCAAC TCATCCCGAC TGCAACACCA ACACAGCCCT CGTCATCGAT AGTGCACACA CGAAGAAATA ATGTACCAGC GTATCGAGTG CACGCCCGTA GATTTCGGTC GGCGGCGGGA GATCAATGGG GATTGGGGTG TACGTGGGAA CAGCGTGAAG CGGATTGGAA TCTTACCAAC GCGTTTCGAA TCGTCCACTT GCCAGAATCG GGCCAAGCAT TCGGTAGACT ATTTCTCATC GACGATGGCA AAGAAAGATA CTCTCCATTA CATAAGATCG ATTTTCTACG TTGCTTGGTG GTGGCTTCGC GTGGCCACTG GGGAACCAAA GGGCCTCCTA ACAAGAACAA ATCTTTCGCT CGCTGGTTGG AACCAACACA TCGATGGTTT TCTTTGCCCA TGTACATTGC GAGTCGATTG GAGGCTGCCT TGTGGGCAAC GTACCTGATT CGTCAGCGCA AAACAGGTGC TCAACCACCG TCTCTCTGGG AGTCGGCGCT TTGGAATAGC ATCTCGGAAA AACTAACTGA AGAAAAGATA CTTCACCTGA TTGCCCAATC CATTGGTCAA GGTTTGCAAA CAGAAATTAT GGTCAAAGAC CGGGTCATGA CCCCCATGTT TCGAGATACA GTCATGTACG GCTTGCTTGA CGGCTCTGTA CGTTGGCCTC TGTCGGATCA CACGAAGAAC AAGAGTCATT GGATGGAGAC TCTTTTGACA AGCCGTGTGT TGGAGATATC GTCGCCGGAA GATAGACTCC GCGCAAGTAC TCGAGAAGCC CTGCACCTAC TCTTAACTCA CGCCGTGGAA GAAGCGCTCT TGCAAGAGCT GACACAAAAT GACAGGACAT CACATAAACT GTCACGCCCC AAGTCCACGA AGCGACCCAC AAGGAATCGC AACAAAAAAA GAAAGGTATT GCAGTTCAAA GGGCGATCGT GGAACGATCA CTGCCGCCAT TGGTACCGGA AGACGAAGAA TTGTCCTGCT TGTATCCAGA AGAAGAGTCG GAGCGGATAC AGCGTATCCA CTTTTCGAAC AGTGGCGTGT CTGTTCGTGT ACGCAATCGA AATATTGTAT TGTCGTTGTC GATATTGGAA GAAGTTCTGA GTGCTGTGTT TGTCAAAGTC GGTCTGACAC CGATTGAACT ACCATTTGCC GAGCACGATG CCAAACTAGC GCAACGTCGA GCTCAACAAT CTGAAGTAGA TCGAGCTCGA AAGGAATTTC AGCAGAGAAG ACATACTGAA CTTTCGTCAA GTGGAACTAC TCGGCAAACT CTCGAGAATT CATTAAAACT CGAAGGAGAG CGAACTCAAG CTCTGACAAG CCAAGTCATG AGTCAGCAAG AATGCCAAGA GCAAGGCGGC TCATTCTTCT CTGGTCAGCA GTCACCATTT ATCAATCGGC GACGGTTCGA CTCGATAGCG GTCGCTAACG CGTCCGCTAC CAGTGGGTCC TTTGCGGGTC TGTTGTCCCC TCGTTACGAA ACAGTGGGAG GAGAATTTAT TCATCGATCA GAGGCGGTCG ATTCCTCGTG GGAGTTAAGT GGATTCGCTC TCGATGGGTG GGGCCGGATT CAAGGTTTTA CAAGTCGAGA TCGTAGTATT ATGGAAGATT TCTTTGATGG CCAAACTCAG GAAGCGGTAT GCGTAAATGA TATCATGGCC TCTTCAACGG CGGCATCTAT CGCTTCGTCG ACGGAGGTCA ATGCCGTGGA TGTGGACGTC GAAACGATAA TGGACGAAGA TGGATGTGGT TTGACAGACG CAGTAAAAGA ATTGGCCATC CGGGAAGCCG CCACGGCAGG ATCAAGAATC TCGATGAAAT CAGTTGTGGC CAACATACCG AAAGAGGTTC AATCTGGCAC TGCGATACCA TTTGTGGAGA GCGCAATGCG ATTGAGCCAA GAATCAATGC TGCATGTCCA GCCGGAATGC GAGGATCTTG CCATGACTGA CGTAGTCAGT ATTGATGATG AATGCCATTC TCCTTCACCT CCCCCGCCAT CGACACCATC GCCGACGTTG TCCCCAATTC TTGTCTCTTT AGCAGATCTT CAAGAAATGA GAAAGGAGTC GGTATGTCGA GACAACCCTG TGGATACAGC GAATCTTTCG TCGTATCGGT CATTCTCAGC CGCTACAACA AGTCTTCCGA ATTCGCCTCG ACTTCTTACG CCACCACGTC CACAGATACC AACCAGCCTT TCAAGGGACA ACCTCAAACT AGCAGATGAC TCCGACCAGA AGACAACGGG GAAAGAATCC CCTGCATCCT TCAAACTCCA TCTCAAATCT CCCTCGGTTC TACGAAAGTC GATGAAAGTA AAGTCCATGG ATGACACTGA GATAAAAAGC AAGTCTAACT TGCGACGGTT CAACGATGGA ATCGTCTCTT ATGTACATGG CGCTTCCCGT TTACCCAGAT TCCACGATGA CCACGCCATT CGTCCGCGAC GGTCCATATT GCTGCGTTCA CCGGATGTCT TGACCCCTTA CCGAACTGCG GCGACGCGCG CCGTAGCTTC GTCCAATAGG ACTGAAAAAT TCGATTCCTT GAAACCGTTT GAGAACCTGA ATCTGAATCA TAACACAAAT GCAACAGGGC GATCGATTGG CGATGGCTGC GCTCGCAGCG AAATTGCTGG TGACGCCCAC GACGACTATT TGAATTGGAA TGACAGTCAC AGGAGCACTG CAAATGACGA TGGCGACAAC CATACAGTGA CGAAGGATGG CGACAACCAT ACAGTGACGA ACGATCACTT CAGCTCTATC GCAAAGGGAA CCAGAGGAAG TATCAACCCT ACGAGAAGAA CGCGATGCAT ATCGTGACAT GTGTTTGACC ATGGGTGCTG AGGTTGCTAA GTTAAAAAAT ATACTCGCTG CGCAAAAGTC ATCAATGTTC TTTTCAGTTG CCCCCGAATA CGCAGATCCC TTGCTTTACC CTGTCGTTCA CACTCACAGT GTTGGTCCAG ATGCCGTTAA GTTTGAGAAC ATTCCTCGAG CTCGAACACT TGCTGCAATG AGCGATGCCG GATACAAAGG AGAGTATGAA TCGCTAGCCA GCGACGACGA TGGCGGGAGA TTGGCAGGTT CACGACACCC ATCTTCAAGC GTGACCGGAG TCGGTTCGGA TGTATCTGTC GAAAACTGCG GCCACTATGC TCTGCAATCT CCGCTGGGTG CACCTCTCAC CAGGGACCTT CACGATCTTA CGTCGCTTCG AGGAATGCAG TCTCGCCTTT CAAAAGATAT TCTTCATTTT CTTGATGCCA CAAACATGCA CCTTCGAAAG CAAGACAGCA AACGGATGAA AGCGGTGGAA CGTATGACTC GATTGGTGAA TACTGTTTGG CCTCGTGCAC AAGTCAAAGT GTATGGTAGT CATGTTACCG GTCTCTGCTT GCCGTCGTCG GACCTTGATT TTGTCATCTG TCTGCCGGCT GTCCACAAAC GTGCACCCGC TGTTGCACCT GGTGCCTTGG AAGGTCGAAA TGCCATCAAC GAAACGTCTC AGAAGCTGCT GGCCCGAAAG TTGAAAAGTG AGTCATGGAT TGATCCAAGA TCGATGAAGC TTATTGAGAG GGCAGTCGTC CCAGTAATCA CAGTGTCTAC AAAGGATACA AAGGCGCGCA CAATTCAACT TGATATTAGC TTCGATGGGC CCGGCCACCA TGGGTTGCAA GCAATCGATA TGGTGTCCGA AATCCTTGAA GAGCTGCCGA TGCTAAGGCC TCTTGTCCTT ATCCTCAAGC AATTTCTCCT CGACCGTGGC CTTCTGACAG CGTACACTGG CGGGCTATCC TCATATTGTT TGTTTTTAAT GGTGGCACGG TACCTTCAGG AACAACAGAC GTCTTGGGGC GACTGCGGAT CTTTACTAAT GGGATTTCTC GACTTTTACG GAAACTGCGT AAGTTCCGGC TGTCTACACT TGTTTTGAAC ACACACGTAG GTACGCTCAT TCCCCTTTCA CTTTGAACGG AATAGTTTGA CCCTCGAACA ACTGGTATAA GTGTACGTCA TAGGCAGTAC TTTCCCCGGC CAAACTATTC ATCAGCTCGT ATGCATTCCC CTGGAATGCC TGTCTGGAGT GTGTCACCAC CTCCCGTTGT TGGGAGTCCG CCTTTGACTC AATTTTTGCG ACGAAACAGT TTCAATGATA GAGGATCGAG TGACGGTATG CAAATGGAAA ACTTTGCTAG ACCACCGCGA TACCAGCCAG CTCTATCAAA TCGCTATACG ACGCCCGAAA TTCCTCATAC AACAGACCAT GCGGCCCACG AACACTCTAT GCCGTATACC TTGGATCCCT TTCTTGTCGA AGATCCGCTT TCGCAAGGTA ACAATGTCGG TCGAAATGCT TTTCGAATTT TCCAAGTGCG GAGAGCTTTT TCTGATGCAC ACCGAGCGCT GGTGGCGGCT CTCGAATGGG ATATGCAGTC AGGAGGAGAC TTTGGCCACA GCAGCCCTGA
|
Protein sequence | MKHEKDVSEW SAWVHSLSWR ELQNASEFTI PSPTMPDDET IWRDLSRLHT PLPTPIFPAA ACPHQAAADR FVDHANEARR RRRDARHRPQ LFQLIPTATP TQPSSSIVHT RRNNVPAYRV HARRFRSAAG DQWGLGCTWE QREADWNLTN AFRIVHLPES GQAFGRLFLI DDGKERYSPL HKIDFLRCLV VASRGHWGTK GPPNKNKSFA RWLEPTHRWF SLPMYIASRL EAALWATYLI RQRKTGAQPP SLWESALWNS ISEKLTEEKI LHLIAQSIGQ GLQTEIMVKD RVMTPMFRDT VMYGLLDGSV RWPLSDHTKN KSHWMETLLT SRVLEISSPE DRLRAIHEAT HKESQQKKKG IAVQRAIVER SLPPLVPEDE ELSCLYPEEE SERIQRIHFS NSGVSVRVRN RNIVLSLSIL EEVLSAVFVK VGLTPIELPF AEHDAKLAQR RAQQSEVDRA RKEFQQRRHT ELSSSGTTRQ TLENSLKLEG ERTQALTSQV MSQQECQEQG GSFFSGQQSP FINRRRFDSI AVANASATSG SFAGLLSPRY ETVGGEFIHR SEAVDSSWEL SGFALDGWGR IQGFTSRDRS IMEDFFDGQT QEAVCVNDIM ASSTAASIAS STEVNAVDVD VETIMDEDGC GLTDAVKELA IREAATAGSR ISMKSVVANI PKEVQSGTAI PFVESAMRLS QESMLHVQPE CEDLAMTDVV SIDDECHSPS PPPPSTPSPT LSPILVSLAD LQEMRKESVC RDNPVDTANL SSYRSFSAAT TSLPNSPRLL TPPRPQIPTS LSRDNLKLAD DSDQKTTGKE SPASFKLHLK SPSVLRKSMK VKSMDDTEIK SKSNLRRFND GIVSYVHGAS RLPRFHDDHA IRPRRSILLR SPDVLTPYRT AATRAVASSN RTEKFDSLKP FENLNLNHNT NATGRSIGDG CARSEIAGDA HDDYLNWNDT LSQREPEEVS TLREERDAYR DMCLTMGAEV AKLKNILAAQ KSSMFFSVAP EYADPLLYPV VHTHSVGPDA VKFENIPRAR TLAAMSDAGY KGEYESLASD DDGGRLAGSR HPSSSVTGVG SDVSVENCGH YALQSPLGAP LTRDLHDLTS LRGMQSRLSK DILHFLDATN MHLRKQDSKR MKAVERMTRL VNTVWPRAQV KVYGSHVTGL CLPSSDLDFV ICLPAVHKRA PAVAPGALEG RNAINETSQK LLARKLKSES WIDPRSMKLI ERAVVPVITV STKDTKARTI QLDISFDGPG HHGLQAIDMV SEILEELPML RPLVLILKQF LLDRGLLTAY TGGLSSYCLF LMVARYLQEQ QTSWGDCGSL LMGFLDFYGN CFDPRTTGIS VRHRQYFPRP NYSSARMHSP GMPVWSVSPP PVVGSPPLTQ FLRRNSFNDR GSSDGMQMEN FARPPRYQPA LSNRYTTPEI PHTTDHAAHE HSMPYTLDPF LVEDPLSQVR RRLWPQQP
|
| |