Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50551 |
Symbol | |
ID | 7199382 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | - |
Start bp | 89639 |
End bp | 93175 |
Gene Length | 3537 bp |
Protein Length | 564 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185518 |
Protein GI | 219130744 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0179423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGAAAGCTC GGCAACCTCT CATATTCCAT CAAGCCGAAG GGATTGGAAG GATTCGTTCA TTCTTCACAG TCAGCCGAGA CCGTTGGTAC GGTTATCGTC TCCATTTGCG GAAAGGCTGA GCACAACCCT CGATCTCGGT CGAGCTTTTC TGCAACTATC ATCACTAAAT TCTATCCTTA TAAGCATGTC AACCCCAACA TCTTCCTCGT CCTTTGCGTT GGCGAAAGTT CTTCCCTCGT CGGTCACGGC ATGGATGGCC GATCGCCCAC ACGCCGTCGA CACGATCATG TTGTTTGTGG CCTTCCAGAT CGCCTACGCC GCGACGAATC CCAGCATACA GTGGCAGTAC ATGGCGATTT ACGGTCTCGG TCTGTTGCTC GTAACGAAGG TGGCTCATTC GCCCTTGGAG TTCTTCAAAG GTGGGATCGC TGACACGGCT ACCGATCGCA GTTCCTACGC AATCTTGGCT GGTAGTACCT TTATCAGTTG GATCTTTGCC AAGAGCATCC AAAACGCCAG TATCCTCGGA GCCAGATACG GTATCCTCGG CGGCTTTGCC TACGGCACTT GGTACATAGC CTTCCTTTCC GTTGGCGTTG TTTGCTACTA TTTGCGAACC AATCAGGGCT ATACATCCCT GCAAGAAGCA ATCTTTGAGC GGTACGGGTC GATCGCATCC ATTTCCTACA GTCTTGCTGT ACTTTTTCGT CTCTACCAAG AAATCTGGAG CAACTCGCTC GTGGTGGCTT CTTTCTACGG CGACTACAAT ACGGCTTCCT GGTGGATTGC GGCTCTTTTG TCCACATTCA TTCCCTTTGT CTACGTTTCC TTGGGAGGAC TCCGGTCCTC GCTAATCTCC GACGTCATTC AAGCTCTTCT AGCTGTCATT CTTCTGGTGA CGGTACTCGG TGTGATTGGC AAGCAAGTTA ACGAACTGTC AGACGAGTGT GAGGTCGCTG GACGGGGCGA CTGCAACCTA TTCCAGTGGG ATACCAACGT GGGCGTTGCT ACCAATACTT TGGAAGGGGG TTGGGACTTG GCGATAGTCG GCTTGATTCA GGGATTGTTC AGCTATCCTT TCTTTGATCC AGTTCTCACG GACCGTGCCT TTTTAGCGAG CCCTAAAACA ATGTTGCGAG CCTTCCTTAC TGGCGGCGTC ATTTCCTTCC TGTTTATATT TTTCTTCGGT TTCATTGGCA TTTTCGGCAA TCTTGCTGCA ACCGTGGATG AGACAATTGA CCCTGCTCTC CTTACCGGAA TTAGCACGGG AATTCCTGCC GACGTGGCTC GCTACCTCGG TACCGGCGTC TTTACCATTA CCAATATTAT TTTCATGACA ACTTCTATCA GTACGCTGGA TTCGGCGTTT GCTTCGACGG CCAAACTTTT TGCCGAGCTA CGCACATTCT TTTGGAAATG GAAGCCGGAA AAACTCGCCA ACGTAACCGA TCAACACGTT GCGCTGGGCC GCTTTGCTAT TCGTTTTATT GCCCTCTTGG GCACTTTACC TTTGTTACAG GACCCCAGTG CCCTCGACGC GACGACGGTC AGCGGTACAG TCGTGCTCGG ATTGGGACCA CCTATCATGG CGCTCTATGT TCTTACCGCG TGAGCCCAAC AGCTACTACC CACTGGCCTT TTTGAGTTCG TTTTGGTTGG GAGCGTTGTT GGGCTTGTTG TTTCAGTTGA GTAACGAAAA CCCCAACGCA CTAGATTGGC AACACCTCAC TGTCGGCGAG GGATCGTACG CCAAACTCCT ATGGTTCAAT TTAGTTGGCT CTGTGGCTAC CTTGGGAGCT TTTGTCGTTT TCTTTGGCCT AGAAAAGTAT GTGCTCTCTA AAGTGGTTCC CTTGTACGCT TGGCACGCTC AAGTCCTCGA AGTACACGAG GGGAAGTCGG TGCACGGGAT GGGCAACGAT GAGCTGTCAC ACAAGTTGGA TCGATCCAAC GAAGATCACG ACAAAGAATT GGAGAGCATC GACAGCAGTA GCGAAGAAGT CGCCTCGGGT GGGAGCGCTG GAGAAGAAGA CAACGACACG AGCACCAAGC TGGAGGCCGA CAAAGTTTGA AACGTGATGG CCGGGAATTG CTGGTGTTGG CCTACGGAAG CAGTGCACGA ATGCAAGCAG GTTAACGACA GTCAACAAAC ATAAATGTGT TTGTATATAC ATTACCGATT TACAATACAA AGCTAAAGCT TGTAGAGTTC TATAGAAATC GTCAATCAAT CAAGTGCAAT CATAGTCCAA GAGCTTCTTT TTGTAGTCGA AATAGGTCAA CGGCCGCCTC GTATAGCGCC ATGTCGAGCT CATTGTGTTT CCGAATGAGC TGTGCGGTTG CGTCGTCGGG TCGCGACGGC AAGTCCCAGT GCGTGCTACC ACTTCGGTCC CTTGCGTTCC CGGTACGGCT TCCGCAATGA TTGTTTTGTG GCGACGCGTT GGAGTGCGGC AAGCCGCACA CGGTGCCGTT CACGTCCGTG GCCAACCAGG GAAAAACGCG CCCCACCATG GCGGCGGTCG TGTTCAACTC TTCCGTCAGG CCCACCATAG TAAAGAAATC GTGCATGTTG CGAATCGCTT CCGATATGAT TTGCTGTCGT TGTTGCGGAG TCCCGCGGGC TGTGGCGATG GTGGTTTCGT TGAAATTTGT CGAACTCAGG AGATTGTTGG TTTGGTGATT CTGCAGTTGC AGGGCACACA TCCGATCAAG AGTGGAGTTA CCGCTGTCGA TTTCCGCGTA GACGTCCTTC AGATCCCGAC AGCCGTAACA CGCTTTGGTT CGGAAGCGAT ACATGCTCCA GACACGGTCT ACGGGATGAC GCAGGACGGT CACGGCCCGA ATGGGCGTCG TATCAGTGGA AGGAACGATA CCGCCTTCCT CACCATTATC GGATCCACCA CTGGTGGTCC GAGTCCACTG AAACGTAGTC AGATCGCGGA GCGGGGCACA GTAACTCATA ATAGCGGCTT CTTGAACCTT TCCGACACAC TGTGTATCGT CTCCGGACAA ACAGCGCGCG TAACGCGCCG CACTGCATTC GTGTATGTTT TGGTAAGGAA GTGTCCCGTG TTGCGATCGA TAGCGTTGCA TAGCGCAGCG AATCAGTCCA TCCATCGAGG TCCCACCCGT CTTCATGTGG TGCAGGTGCA GAAATTGTCG CGGTTGTACG GGGCCTTCCT CCGTGTCGGA CGTGCGTAAC AGATCGTGCG GTATCCCCAG TTTGGCTCGG AGTTGCTCCA CGACTTGTTG TCCCTGGGTA TTACCATCCG GATCCAAAGG CACCGCCTGT GCGAGTGTCT CGGAAACGGT GGACGGCTGG TAGAGACGTG TGGGTGTGGA AGAAGAGAAT ATATGTGTAC GCGATTGGTA CGTGGCAAAC AAGACAGCCA GCGATACCGC GAAGGTAGCA ACGGCCACGA CGGGACGCAT CGGAAGTGTG ACCAGTCTCG TACGGTGTGG CACAGAACGC CGTCGTTGGA GACAGGAATG GGGTGGACAG TAATGAACAA GGCAAGCGAG ATTTGTGTAG CAGCTTC
|
Protein sequence | MSTPTSSSSF ALAKVLPSSV TAWMADRPHA VDTIMLFVAF QIAYAATNPS IQWQYMAIYG LGLLLVTKVA HSPLEFFKGG IADTATDRSS YAILAGSTFI SWIFAKSIQN ASILGARYGI LGGFAYGTWY IAFLSVGVVC YYLRTNQGYT SLQEAIFERL AVLFRLYQEI WSNSLVVASF YGDYNTASWW IAALLSTFIP FVYVSLGGLR SSLISDVIQA LLAVILLVTV LGVIGKQVNE LSDECEVAGR GDCNLFQWDT NVGVATNTLE GGWDLAIVGL IQGLFSYPFF DPVLTDRAFL ASPKTMLRAF LTGGVISFLF IFFFGFIGIF GNLAATVDET IDPALLTGIS TGIPADVARY LGTGVFTITN IIFMTTSIST LDSAFASTAK LFAELRTFFW KWKPEKLANV TDQHVALGRF AIRFIALLGT LPLLQDPSAL DATTVSDWQH LTVGEGSYAK LLWFNLVGSV ATLGAFVVFF GLEKYVLSKV VPLYAWHAQV LEVHEGKSVH GMGNDELSHK LDRSNEDHDK ELESIDSSSE EVASGGSAGE EDNDTSTKLE ADKV
|
| |