Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47243 |
Symbol | |
ID | 7202335 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 10094 |
End bp | 14513 |
Gene Length | 4420 bp |
Protein Length | 850 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181469 |
Protein GI | 219122266 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.24929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACAAGTTTA TGCTGTAGCC AAGAAGCACC TCCCACCTTG TGTGTAGGAC GTACTCTGTG CGGTCAATCA TTGGGGACCG GAGTACGCAC TAGTACCAAC GCACCTCCAT TACAAAGGAA AGCATATCAT TCGGCAAAGA TATCGTAGAG GCTGGTTTGT TGTTCCGGAA GGAAATCGAC CGCTATTGAC GGCCGAAGAC AAGTCTCCAT GGACTTCGCG AATGGTTTTG GAGACAGTAG TGATGACCTG ATGGAAAACG CAAGGATCGT CAGCCCTTAT GGCGATGGAG AAACCCAAGC GTTTCCTGCT GATCCGTGGG CAAATTTTCA GTACGCACTG CCTGAAGGGT CCTCGCAGTA TAAATATCGT GCTTCTAGAC AAAGACCAAC GATTGGAGGA TCACGATCTC CCTCGGAGCC AATTCCCACC CCACGAGAAA GGATGGTGAC ATCTTCTAAG ATCTCAACTC CTCACCGACA CGGTATGGTA AATCGAGGCC ATGCTAGTCG ATACGAATAT TCTAGTGACA AGCGATGCGA TGACGTACAC GTAATATTTA CAGATCCCTC CGACGGTGTT GGGATCAGCG TTGAAAAAAG GAGCCAACAG GACTTGTTGC GACAAAAGGT TCGGGAGGAC ACCAGTCGTA GGAGCTTACA ACATTCAACC AGCGCACAAA GTATGTACCG GAAACGTAGC GAAAACAAAG GTGCGTTTCC TTCATTTCCA CAATTAGCGC CGAGCAAAGA GAAAGCACTG GTACGTACTC CGAACACCTA AACTTGAATG CATTGACCAA TTTTGAAAAG TGAACTACAC TCCTCTAAAC TACGCCTTCA ATTCGTATTC TTTTGTCTGA AGCGAGGCCA GCGTCCATCT AAACATACTC GGGACCATAT CTCACGGCAT ACTTCGCCAA AATCATCAGT GGAAATAACT TCAGAAAGCA CAAACGCAGA GACCGATGGT CTAAATTTGA ATTGGGCATT CTCGCGTGTT CAAGTTCGAG CGAATAGTGA AACGGAGTCA ACGACGGTTT GGCTAAATAC TGATCACAGG CAGGATTTAG AAGGAGAAGA CATTTGGTTA CAAGAAGAAC ATGCATTCCC TTTGTTGTCA ACGGGGGCTT CAAGCAGATT TTCGGGCCCA TCTGGTAAAG ATGGCCTCCG AAAACGTACT GATAAGAAGT TAAATCGTGT CCGTTTTGCC GAACCGATAC AGAAGCCGCG ATCTGTCCCT TTTCTGAAAA CTGTTTCACG GGAAACTATA CGGGAAGAAT CCTCAGATCC TCGTTCCAAA AAGATCACAC ATAGTGACAA TCGCGCGTGG TTCGTGGCCC AACCGAAGTC AATTCTCCGT CGGCGGCGTT TCGCTGGCGA AACCGTGTCC CATGACCCAC AGTACCCCCA GAAGAATCGC CCATCCTCTC ATCGCGCAGC TCCACAACGG AAGTCTGCTA CATCGTTTTT GGATACACAA GGATCCCTAC TCTCTCCCAT TCATTCAGAT AGGCGGCCTT GGGACCGCAT CTCTGAAACA GGCTCGGAGT CACTCAGTCC TTCGTACAGT GATGTTGAGC GAGAGAAACG CGTTAGTCTT GGTCCCTACC ACCTGCAGGA GCTAAATGAG ATGTATCCCG ATCCTCCTCT TGAGTTGCAG GTAAGACATT TTTGACTTTA CAAAGCCCTA TCTTTTGTAG CCGACCTACT CACCGACATA CATTTTCTTT TAGTTCGACG ACGAGTCAAC TGTAGTACCA GCACGCGCTT CTTTCATCGA CACTGTCGCT GCTGTTGTCG TTCAAGCCGC TGTTCGTAGA TTTCTTGCCC AAAAAGTGAT GCATGAGATG GTCGGCAAAG CATATTCCTT TCCGCATTTA GAATCTGACG ATAAGAAATA TCGACCATTG TCGTCGCGAA AGGTAACTCC TGAAAAAAGG TCGTCCCGAA AAAGTATGGT AGGAATTTGG GAGGAGCAGT GTTCATAGAA GTCATGGCTG CGATAAAAAT CCAATCTGCC TTCCGAGGCT TTTGGGTTCG AGATTCGTTG AATGTGGATC ATTTTTGCGC GACTATGATC CAGAAATGGT ATCGACGACA TCATCAGAGG CACCACTATT TTGCAGATCT TTCTCGGATC ATACTGGTCC AGTCCATTTG GAGGCGCAGT ATAGCCAGGG AGCACGCTGC CTTTTTCCTT GGGAGCGTAA TTACAGTTCA GTCGCTGTTT CGCTCGTACA GCGCTCGCAA AAAGCTCTAC TCAGGACTCA CTTGCCTACG AAAGGATACT ATGGCAGCTG TAGTGATCCA ATCGCACTGG CGTACATATG CTTGCGAATG CAACTTTATT CGCGATCTTG TCGATATTTT GATCGTTCAA AGTGTTGTGA GAACTTGGTT AGCAAGACGA CACCTGTCAT CACTACGCTC CAGGGCCCAA AGTATTTCCG GCAAAAAGTC ACCAACAGTA TCAAAAAAAT ACGCGAATCA AGTGGCGGCG CAACCTACTG GAAGTCCTCG ACCTGGAGAG GCCAATCGTA ACTCGGCGAC AGGGCAATGC TACTCCTCGT ATAGGTCTGT CGAAGAGAGT TCGTTCAGCG CTATTCTTGG CAATATAAAG AGCAAGGAGA ACAATCACCT CATTGTGTTG ATTACATCTC AGTCTCTCTC GCGCAATCAA GCTTCCACAA GAAGTAATAT TGGTACAATC TTACGCGTCC ATAATGTCTC ATTCGAGGAA GTGGATGGAG CAAATCCGCT AACCCGAGGA CGACGCGACG AACTCTTTGC TATATCACAA ATGCGCGGCG TGTACCCGCA GTTCTTTGTG GTAGACTATG AAACAGGGCT CACGTTATTT TTCTGCAACA GTGATTCTTT TTTCGGTGCC AATGAAGAAG GCTCTCTACC CAGGATACTC AATATTGCTG GTGTTGTGCA GAGCGCGATC GGAGGACATC AAGAAAGAAA TAGTACCATA GACGAAGCTC CTAAAGCAAA CAAGCACCTG TTTGAGCCAA AGAGGCAAAG CTCACATACT ACGGTTTCAA TTGACAGTGA AACTTCGGAG CCCTCTGTAG GACGGAACAG TTTGCTTTCG ATGTGGAAAA ATCTTGACAA GAAGAACACA TTAGTATTAA ATGGACACAG GAATTGACAA CAACCTGATT GTGAACGCGG AGAGAAGTTG TTAGCTGTTA GTCTGTACAT GTTAGTAGCA ATTTCTACAC TAATGCGTAC GACATGAGTT CATGCATGCA ATTTAAAGGA ACCCCTTCCA TTCGCCGCCT CAAGAAAGAA AACCGTTACT CTTTCTGTTT TATCAAGGCA GACAACTCTG AGGCAAACCC ATGCATTGTA GCTGTCTGCA AGCGCGAACA AGCTTTTGAC ACACTTTTGA TTATTGCAGC TGATGGCAGA TGCGACAGGG CACCGTCGTG GTGTCTCGTA ACAATTGTCT TAATAAGACG AGCAGCTCGC TCAACGCAAT GTGCATCACG ACTATTGAGA AGGGCATTGA GACAGGCATC GTAGCACCTT TCATCCGGCT CTATATCGGG ACGACCATGA ATGTGCAAAT CTAACATGTA GGACAATACC ACGCAAGCAT TTTCCGCAGA CTCCTTGGTT CTCGCATTTC TGAGAGACGT CAGTATTGCT GCAAAGACAA GTCTATCGAG CTTCAGCTTG TTTGAGGGAT CGGAGTCTAA AACCTGCATT TCTCGAAACA AATCGAAGGC AACCCTGCCA GATTTTACGT TCCGAAGGGA AGCGAAAGCA AAAATAACAG CAGTATACGC TCGACTTGTC GGTTGAACAC CCAAATTCTT CTGGTGTCCA AGTGTCTGCA ATGCTAAATC AGCGGACTCC TTGGATCTAA AGTTCGCCAT TGCTGAAATT ACAGCGGTGT ACAACCCAAC AGATTTCGGA TCCCATTGAA AAGTTCCTTT CTTAACCCCC GCCTCAATAC GATGTAATAA GGTAGAAGCG GTCACAGTGT CTTCGTACTT TTTCGTTTTT ACCAAAGCAT TGATGAGAAC CTGGTATGGA AACATGTCTA CAGTAAGCCG GTTCTCACCA TCTATCAAGA CGTCGAAAGC CTCTTTGAGC GAGTTTTCCT TACAAAGCAA GCCAATGCAC ATGTTGAAGG TGCCCGAATT TGGAGCACAT GGAAATCCGT GTTGCTCCGA GAGATTTATC ATCGAATTCA GAAGGGATAA GACATATTTC CCCCCGCTCC CTTGAGACAG GCGGAGCCAG CCATGAATAA CAAAATTGAA TGTGTCTGTT GTTGGAGGCG GCCCAGCTTG GCCAATAGCT GCTTCGACCA TAGCGACAAC ATGAGCATGC GCTTGTAGCA TTCTTCCACA ACGGACTTGA GCTTCCATGT AGGACATGTG TGTTGCTAAG TCCGGTCGAA
|
Protein sequence | MDFANGFGDS SDDLMENARI VSPYGDGETQ AFPADPWANF QYALPEGSSQ YKYRASRQRP TIGGSRSPSE PIPTPRERMV TSSKISTPHR HGMVNRGHAS RYEYSSDKRC DDVHVIFTDP SDGVGISVEK RSQQDLLRQK VREDTSRRSL QHSTSAQSMY RKRSENKVEI TSESTNAETD GLNLNWAFSR VQVRANSETE STTVWLNTDH RQDLEGEDIW LQEEHAFPLL STGASSRFSG PSGKDGLRKR TDKKLNRVRF AEPIQKPRSV PFLKTVSRET IREESSDPRS KKITHSDNRA WFVAQPKSIL RRRRFAGETV SHDPQYPQKN RPSSHRAAPQ RKSATSFLDT QGSLLSPIHS DRRPWDRISE TGSESLSPSY SDVEREKRVS LGPYHLQELN EMYPDPPLEL QFDDESTVVP ARASFIDTVA AVVVQAAVRR FLAQKVVPKK YGRNLGGAVF IEVMAAIKIQ SAFRGFWVRD SLNVDHFCAT MIQKWYRRHH QRHHYFADLS RIILVQSIWR RSIAREHAAF FLGSVITVQS LFRSYSARKK LYSGLTCLRK DTMAAVVIQS HWRTYACECN FIRDLVDILI VQSVVRTWLA RRHLSSLRSR AQSISGKKSP TVSKKYANQV AAQPTGSPRP GEANRNSATG QCYSSYRSVE ESSFSAILGN IKSKENNHLI VLITSQSLSR NQASTRSNIG TILRVHNVSF EEVDGANPLT RGRRDELFAI SQMRGVYPQF FVVDYETGLT LFFCNSDSFF GANEEGSLPR ILNIAGVVQS AIGGHQERNS TIDEAPKANK HLFEPKRQSS HTTVSIDSET SEPSVGRNSL LSMWKNLDKK NTLVLNGHRN
|
| |