Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45350 |
Symbol | |
ID | 7199982 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 922004 |
End bp | 926117 |
Gene Length | 4114 bp |
Protein Length | 1335 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179319 |
Protein GI | 219117049 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTGGC GATTCTTCGT GAACGGTCGT CGGTGGCGCT GTGGCGTCGG CACTTTCAGT TTGATCGGAA TGTTAGGGTT CTGGAACCCG GCGCGCGGAT GGCTGGTACA GCGAGCAGTA CGAAACGTGA GAGTCCCTGG ACGTCTCAAT GACGAAATCT TACGGCAGAA AATGGGCTCT ACCCGAAGTT TGACGCGACG GAGTATGGTA GCACAGTGGA ACGTTGGTGA TACCGTCTTG GTCCCTTCCA CGAGAAATAA GAACGACCAG TCGCTCGCAC AAGGTGTAAT CTGCGATCAC CGGGGTTCGG GATGGTATTC CGTACAACTT GGTAGTGGAG CAGACAAGCG CATCGTCAAG TTCCGTGGTA CGCAGCTTGA ACGGTATCTT TCCGACGAAG CAAAGTCTCA AGCTTCCATG GAATCGTCGG CGCCGTTCGG AAATTTCGAC AACGTTCCGT TCAATATTTC GTCTCTTCCT GTCATGGACG TTACCAATCC GGTTACCAAG ACCAAGTCAA TTTTGAAAAG TGGAGCACCG GGACCTGCTC CACCGCCACC GACGATCATT GATCTAGACG CTGCTGTTCG GTCTATGGAC GATCCTTTCG ATCCTGTGGA AAACGTTGCA CAAAGGAACT ACTTGCAACA CGTAGCGCAC CATTCCAGCT TTGAACAATG GGTGGTCTTC ACAGATCTCC ATTGCTCAGC ATCGACAATG GATGCCACAA TAGAGACCTT GCGAACTGTG CACCAGCACG CTGTGAAGAG AAAAGCAGGC ATTCTCTTTC TAGGAGACTT TTGGCATCAT CGACGAACGT TGCGCATCGA CTGCCTCAAC ACTGTTCTAC ACGAATTATC GACCTGGACC GTTCCCATGG TCATGATCCC AGGCAATCAC GACCAAGTCA CTCTGGGTGG CCTAGTGCAT GGACTTACAC CATTGGAGCA CGCCTACCGC GTTACGGCCA ACAAGGGCTC GTTTTCCACA ACCTTTCCCG GGCCGCTCGT CTTTTCACAC GCCACCGTAT TCGCGAACGC CCTGTTTATA CCACACATTC GTGACAATGC TATCATGGAA TCTGTTTTGC AGTCCACGCA CGCTCAGAAC GCAGAAGCGC TCTTTGTACA TGCCGACATA ACTGGTGCTT ACATGAACGA TCTAATTGTC TCTCTGGGTG GTGTTCCACC ACGCATGTTC CCTGGTAACA AGCCAATCTA TTCCGGACAT TTTCACAAGC CTCACACCGT CAAGCAAGGC AATAAGGCGA TCGAATACTT AGGATCTCCG TACGAGGTGT CGCTGGCAGA GGCTCAGCAA CCGAAAGCTC TCGCCGTGCT GGATGCATCC AATGGTTGGA AATGCATCGA AAAAATTCCT TTGAGCATCG GCCGAAAGCA CTTTCGGCCT CTAAACGAGG ACGAATTCTT GGCACTTCGT CCAAAGCAAT TTGGAACACG GGACCGGGAT ACAGATGTGT TGGCTTCTAT TAGCGTGGAC TCCGGTGATC GTGTGCTTTT CTCCGTGGAC AAGGATAAAC TAGAGAAGTT GCGGCGGTCT TCAGAGGTTG GTGAAACCAA TCCAATCGAC ACTCATGTGA GCATTTTGCG TCAAAAAGGC ATCACCGTGG AACTTCGTGA AACACGGGAG TTGCCTGTTG GCCCAATGGA ATCTGCATCG CCCGACATGA AGGGTGACTA CATCAACCTT TCGCTCGAAT CCACTTGGAC CTCATTTATA GAAGGGGAGG TGCGACGAGG GGCCATGACT GAAGAAAAAG CTGATTTTTT ATCGAAGCCT GGTTTAGATA TTTTGGCAGA TCTCGATTCC GTGGTAATTG GTAGTATGTC AGGAAATAAA ACTGATGTTG AGCTTTATTC GCTGACCGTA GAAGGGTTTG GCCCATTTCG ACAGCCAGTT ACCTATCCGC TACTAGAACG AGGGCTAGTT CTGTTGAGAG GGTCGAATAA AGATGGGGGA TCTGACAGGT AATTGATTGG GATGTATTCT TTTTGCTGAA TTGATATGCC GTTTCTGAAT TGAAGAGTCA AATTGATATT GCAGCAACGG AAGTGGAAAG TCGTCACTGG CAATGTCGGC ACTTTGGGCT TTTACTGGGA GCATTGATCC CCGCCCATTG CAAGACTCAA AAGTTTCGGA CGTCGTGCAC GATTCGTGCA AGGTAATTGG ACTTCCACGC TGTGACGCGT TATGTCTTTC ACAAGTGCAC TTACTCTTTC GCTCTTTCTT GTAGGCAGCT CGCGTTACGG TGAAGGGAGC TTTCAATGGT GTTGAATTTT CCGTCACTCG AACAAAAACG GCAACAAAGG GGAACATTGT ATTTACCCTA GGAGGCGAAG ACCTGACGAC GCAGTCTGCC AAGGAAACGC AAGAGCTTAT CGACGAGACC TTTGGCGTGA ATTCACAGAT TCTCGCTAGA ACAATATTTA ATGGGCAGCA TGCATTGAAT GATTTATTAG AGGCTACCGA CTCGAAACTA AAAGACGAGC TTGCCACTGT CGTGCCGTTG AGTGGGTGGC AAGATGCGGT AACGCTTGTG CGGAAAATGG GACGCGAAGC TGGTAAAAGG GCATCCGAAA TTGAAGGAAT GTTAGCTCTT CGCGAAAAGG ATTTGGAAAG GCTCGACCGG CGACTAGAAG ACGCTACTTC TGTAGTTTAT GAAACGGAAG CCAGCCTCCG AAGTACAGAA CAATCCGTCA CAGACGAGTT GGAAGGGTTG TATTTTGCTG GAACCCATTG CATGGAACTA GATGACTGGG ATGCTCGGCT TCTTGATGCA TCTGAAAAGG TAAAGGCTCT TGAAAGGAGC TTGCGGTCCA AGCAGACTGA AAGAGACGAA GTAATGAAAT CAGCAGCGGC GGAGGCGACT CGCCGGTCGA GTTTTCTCGA TTCTGCTGCT GATAGCTTCA GGAGGGTCGA AGCCCGATAT GGTCGGTTGG CGATGGATTT TGAGACGGCA ACCAAAAAGG TCCAAGAACT CGAGAAACTT TGGAGTTTGG ATCTGTCCTC AGGAGAGCTT GATACAGCAT ACGCTCCGGT CCTTTGCCCA ACATGTGGTC AAAGCGTTTC AAGTGACGAC TCCGGACATG ATCTCAGAAG CCTTAAGGGG GCTATGGAGG ACGATATTAG CGTTGCGCTG CTCCGGCTCC ACGAAGCGCA GACCATGGTG CAAGACGTTG GCGGTGAACT CGCCGCTGCG AAAGCTCAAC ACGGCGAAGC CCTCTCTTTG GTCAAAGACT TGAATACTCA AAAAGAAAAA GAATCGCAGG TATGGTCAGA AACCATCTGT AAACAAGAAA GAGCACTTGC TGACGCTCGA GAAGCCCAGT CGGTTGCATC ATTTGAGTAT ACATTGGCAG TCAAGGCTTT CCAGCAAAAA GCTAGAAGAG ATGAACTTCA GTCTCAAATC GATAGACAAC ACCAAGCCTT GAGCAACGTG CGAGCACACG CAGAAGCTGT AGAAGCGGAG ACTATGGAGT ACAGGAACTT GGTGAAGGAG TTACAAGCAA GCCTAGATAC AGAAGAGAAA CAAGTGGCTC TGATGAGCGA TCTGTCAGAT GCATTCGGAC AAAGGGGAGT GCAGGCTTTT GTGCTTCAAT CGGAGATCGA AATCTTACAA ACGTTGACCC AATCCTTTCT CGATGACTTC AGCGACGGGA CCCAAAAGCT TAGCCTTTCG CTTGACGCAG GTGACCGGAT TTCTCGGCGG GCTTACGTCC GCTCCCCCGA TGGCGCCTAT CATGAACGCC CGTTGGCCTC GCTCTCTGGT GGACAATGGA GACGATGTTC ACTTGCACTC AATCTTGGCT ATGCTGACCT TGTTGCTAGA CGAGGCAGAT TCCGATCGTC TCTTTATATA ATGGACGAAC CCTTGACTCA CTTGGATCGC TCAGGTAGAG CGGATGTAGG AAGGGTATTT CGAAAGCTCC TGCGTCGATC TACTACTAGC GGAGAAGGAG GGCTCGCCCC GTCGACGATA ATTGTCATTT TGCAAGATCT TGCCGCTGAG GAACTTGGAG AGGCCTTCGA TTGCATCGAT GAAGTTGTAA AATTCCAAGG GACAAGCCAA GTTTTTGTAG ACGAAGAGCT TTAG
|
Protein sequence | MFWRFFVNGR RWRCGVGTFS LIGMLGFWNP ARGWLVQRAV RNVRVPGRLN DEILRQKMGS TRSLTRRSMV AQWNVGDTVL VPSTRNKNDQ SLAQGVICDH RGSGWYSVQL GSGADKRIVK FRGTQLERYL SDEAKSQASM ESSAPFGNFD NVPFNISSLP VMDVTNPVTK TKSILKSGAP GPAPPPPTII DLDAAVRSMD DPFDPVENVA QRNYLQHVAH HSSFEQWVVF TDLHCSASTM DATIETLRTV HQHAVKRKAG ILFLGDFWHH RRTLRIDCLN TVLHELSTWT VPMVMIPGNH DQVTLGGLVH GLTPLEHAYR VTANKGSFST TFPGPLVFSH ATVFANALFI PHIRDNAIME SVLQSTHAQN AEALFVHADI TGAYMNDLIV SLGGVPPRMF PGNKPIYSGH FHKPHTVKQG NKAIEYLGSP YEVSLAEAQQ PKALAVLDAS NGWKCIEKIP LSIGRKHFRP LNEDEFLALR PKQFGTRDRD TDVLASISVD SGDRVLFSVD KDKLEKLRRS SEVGETNPID THVSILRQKG ITVELRETRE LPVGPMESAS PDMKGDYINL SLESTWTSFI EGEVRRGAMT EEKADFLSKP GLDILADLDS VVIGSMSGNK TDVELYSLTV EGFGPFRQPV TYPLLERGLV LLRGSNKDGG SDSNGSGKSS LAMSALWAFT GSIDPRPLQD SKVSDVVHDS CKVIGLPRCD ALCLSQAARV TVKGAFNGVE FSVTRTKTAT KGNIVFTLGG EDLTTQSAKE TQELIDETFG VNSQILARTI FNGQHALNDL LEATDSKLKD ELATVVPLSG WQDAVTLVRK MGREAGKRAS EIEGMLALRE KDLERLDRRL EDATSVVYET EASLRSTEQS VTDELEGLYF AGTHCMELDD WDARLLDASE KVKALERSLR SKQTERDEVM KSAAAEATRR SSFLDSAADS FRRVEARYGR LAMDFETATK KVQELEKLWS LDLSSGELDT AYAPVLCPTC GQSVSSDDSG HDLRSLKGAM EDDISVALLR LHEAQTMVQD VGGELAAAKA QHGEALSLVK DLNTQKEKES QVWSETICKQ ERALADAREA QSVASFEYTL AVKAFQQKAR RDELQSQIDR QHQALSNVRA HAEAVEAETM EYRNLVKELQ ASLDTEEKQV ALMSDLSDAF GQRGVQAFVL QSEIEILQTL TQSFLDDFSD GTQKLSLSLD AGDRISRRAY VRSPDGAYHE RPLASLSGGQ WRRCSLALNL GYADLVARRG RFRSSLYIMD EPLTHLDRSG RADVGRVFRK LLRRSTTSGE GGLAPSTIIV ILQDLAAEEL GEAFDCIDEV VKFQGTSQVF VDEEL
|
| |